Computer versus human diagnosis of melanoma: evaluation of the feasibility of an automated diagnostic system in a prospective clinical trial

The aim of this study was to evaluate the accuracy of a computer-based system for the automated diagnosis of melanoma in the hands of nonexpert physicians. We performed a prospective comparison between nonexperts using computer assistance and experts without assistance in the setting of a tertiary referral center at a University hospital. Between February and November 2004 we enrolled 511 consecutive patients. Each patient was examined by two nonexpert physicians with low to moderate diagnostic skills who were allowed to use a neural network-based diagnostic system at their own discretion. Every patient was also examined by an expert dermatologist using standard dermatoscopy equipment. The nonexpert physicians used the automatic diagnostic system in 3827 pigmented skin lesions. In their hands, the system achieved a sensitivity of 72% and a specificity of 82%. The sensitivity was significantly lower than that of the expert physician (72 vs. 96%, P = 0.001), whereas the specificity was significantly higher (82 vs. 72%, P<0.01). Three melanomas were missed because the physicians who operated the system did not choose them for examination. The system as a stand-alone device had an average discriminatory power of 0.87, as measured by the area under the receiver operating characteristic curve, with optimal sensitivities and specificities of 75 and 84%, respectively. The diagnostic accuracy achieved in this clinical trial was lower than that achieved in a previous experimental trial of the same system. In total, the performance of a decision-support system for melanoma diagnosis under real-life conditions is lower than that expected from experimental data and depends upon the physicians who are using the system.

[1]  E S Berner,et al.  Influence of case and physician characteristics on perceptions of decision support systems. , 1999, Journal of the American Medical Informatics Association : JAMIA.

[2]  Stephan Dreiseitl,et al.  Do physicians value decision support? A look at the effect of decision support systems on physician opinion , 2005, Artif. Intell. Medicine.

[3]  Jurg Ott,et al.  20 Applications of neural networks for gene finding , 2001 .

[4]  Lucila Ohno-Machado,et al.  A Comparison of Machine Learning Methods for the Diagnosis of Pigmented Skin Lesions , 2001, J. Biomed. Informatics.

[5]  J V Tu,et al.  Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. , 1996, Journal of clinical epidemiology.

[6]  J. Ott,et al.  Applications of neural networks for gene finding. , 2001, Advances in genetics.

[7]  Paulo J. G. Lisboa,et al.  A review of evidence of health benefit from artificial neural networks in medical intervention , 2002, Neural Networks.

[8]  Enrico Blanzieri,et al.  A multiple classifier system for early melanoma diagnosis , 2003, Artif. Intell. Medicine.

[9]  Zheng Rong Yang,et al.  Biological applications of support vector machines , 2004, Briefings Bioinform..

[10]  Staal A. Vinterbo,et al.  Applying a decision support system in clinical practice: Results from melanoma diagnosis , 2007, AMIA.

[11]  G Pellacani,et al.  Computer description of colours in dermoscopic melanocytic lesion images reproducing clinical assessment. , 2003, The British journal of dermatology.

[12]  P. Rajan,et al.  Artificial neural networks in urolithiasis , 2005, Current opinion in urology.

[13]  Phei Lang ChangHANG,et al.  Evaluation of a Decision-support System for Preoperative Staging of Prostate Cancer , 1999, Medical decision making : an international journal of the Society for Medical Decision Making.

[14]  Pietro Rubegni,et al.  Automated diagnosis of pigmented skin lesions , 2002, International journal of cancer.

[15]  H. Kittler,et al.  Follow-up of melanocytic skin lesions with digital epiluminescence microscopy: patterns of modifications observed in early melanoma, atypical nevi, and common nevi. , 2000, Journal of the American Academy of Dermatology.

[16]  Enrico Blanzieri,et al.  Clinical validation of an automated system for supporting the early diagnosis of melanoma , 2004, Skin research and technology : official journal of International Society for Bioengineering and the Skin (ISBS) [and] International Society for Digital Imaging of Skin (ISDIS) [and] International Society for Skin Imaging.

[17]  Lucila Ohno-Machado,et al.  Logistic regression and artificial neural network classification models: a methodology review , 2002, J. Biomed. Informatics.

[18]  Timothy M. Franz,et al.  Enhancement of clinicians' diagnostic reasoning by computer-based consultation: a multisite study of 2 systems. , 1999, JAMA.

[19]  Silvio Bicciato Artificial neural network technologies to identify biomarkers for therapeutic intervention. , 2004, Current opinion in molecular therapeutics.

[20]  G Pellacani,et al.  Practical color calibration for dermoscopy, applied to a digital epiluminescence microscope , 2005, Skin research and technology : official journal of International Society for Bioengineering and the Skin (ISBS) [and] International Society for Digital Imaging of Skin (ISDIS) [and] International Society for Skin Imaging.

[21]  H. Kittler,et al.  Diagnostic accuracy of dermoscopy. , 2002, The Lancet. Oncology.

[22]  K Wolff,et al.  Computer-aided epiluminescence microscopy of pigmented skin lesions: the value of clinical data for the classification process , 2000, Melanoma research.