Direct-to-consumer machine learning model incorrectly classifies rare, aggressive skin cancers

A new study has found that a direct-to-consumer machine learning model for detecting skin cancers incorrectly classified rare and aggressive cancers as low-risk. The breakthrough findings presented at today's 30th EADV Congress suggest that making apps based on such models available directly to the public without transparency on performance metrics for rare but potentially life-threatening skin cancers is ethically questionable.

Researchers in London focused on two types of skin cancer, Merkel cell carcinoma (MCC) and amelanotic melanoma, both of which are rare but particularly aggressive cancers that tend to grow fast and require early treatment. They created a dataset of 116 images of these rare cancers and of the benign lesions seborrheic keratosis and hemangiomas, and assessed these images with two machine-learning models.

The first model studied was a certified medical device, directly sold to the public via the App store and advertised as being able to diagnose 95% of skin cancers (Model 1). The second model was available for research purposes only and used as a reference (Model 2).

The results showed that Model 1 incorrectly classified 17.9% of MCCs and 22.9% of amelanotic melanomas as low-risk. In turn, 62.2% of benign lesions were classified as high risk. For detecting malignancy, Model 1's sensitivity was 79.4% [95% confidence interval (CI) 69.3-89.4%] and specificity was 37.7% [95% CI 24.7-50.8]. For Model 2, MCC was not included in the top 5 diagnoses for any of the 28 MCC images analyzed, raising the possibility that the model had not been trained that this disease class exists.

The high false-positive rate of Model 1 has potentially negative consequences on a personal and societal level. The results pose a bigger question of the safety of other artificial intelligence (AI) models for detecting skin cancer available on the market.

In order to improve, machine learning model evaluations should consider the spectrum of diseases that will be seen in practice. At the moment, most of the performance of those models is driven by the imaging data available, which is particularly scarce when it comes to rare skin cancers."

Lloyd Steele, Study Lead Author, Blizard Institute, Queen Mary University of London, UK

A global collaboration between research groups and hospitals can be a step towards tackling the gap of skin cancer imaging data, which is a crucial element for a high-performance rate of machine learning.

Marie-Aleth Richard, EADV Board Member, and Professor at the University Hospital of La Timone, Marseille, said: "The number of skin cancer detection apps available for consumer use is growing, but as demonstrated in this research, there must be more transparency around the safety and efficacy of these apps. Furthermore such devices detect only what they are shown to analyze and do not make systematic analysis of all the skin's surface. Failure to be transparent could put lives at risk."

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
Dietary adjustments may help control prostate cancer in men undergoing active surveillance