Skin lesion diagnosis from images using novel ensemble classification techniques

Reduction of the error rate of melanoma diagnosis, a critical and very dangerous skin cancer that could be treated when early detected, is of major importance. Towards this direction, the present paper presents a novel ensemble classification technique, combining traditional Random Forests with the ‘Markov Blanket’ notion. The proposed algorithm performs an inherent feature selection phase where only truly informative features are carried forward, thus alleviating the curse of dimensionality and augmenting classification performance. It has been evaluated in a high-dimensional and imbalanced dataset of 1041 skin lesion images, which been preprocessed using the ABCD-rule of dermatology. The proposed ensemble classification technique exhibited a higher classification performance in comparison with the classical Random Forest algorithms, as well as other widely-used classification algorithms where standard feature reduction techniques, such as PCA and SVD, have been applied.

[1]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[2]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[3]  A. Venot,et al.  Assessment of healing kinetics through true color image processing , 1993, IEEE Trans. Medical Imaging.

[4]  R Marchesini,et al.  The invisible colours of melanoma. A telespectrophotometric diagnostic approach on pigmented skin lesions. , 1996, European journal of cancer.

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  D. Leotta,et al.  Image processing techniques for quantitative analysis of skin structures. , 1999, Computer methods and programs in biomedicine.

[7]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[8]  R Marchesini,et al.  Image Analysis in the RGB and HS Colour Planes for a Computer-Assisted Diagnosis of Cutaneous Pigmented Lesions , 1998, Tumori.

[9]  Ping Wang,et al.  A computer-aided MFCC-based HMM system for automatic auscultation , 2008, Comput. Biol. Medicine.

[10]  S. Chinn,et al.  The assessment of methods of measurement. , 1990, Statistics in medicine.

[11]  R. Pariser,et al.  Primary care physicians' errors in handling cutaneous disorders. A prospective survey. , 1987, Journal of the American Academy of Dermatology.

[12]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[13]  M A Weinstock,et al.  Epidemiology of melanoma. , 2017, Cancer treatment and research.

[14]  Paul Anthony Iaizzo,et al.  Wound status evaluation using color image processing , 1997, IEEE Transactions on Medical Imaging.

[15]  W. Lohmann,et al.  In situ detection of melanomas by fluorescence measurements , 1988, Naturwissenschaften.

[16]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[17]  Jae Won Lee,et al.  An extensive comparison of recent classification tools applied to microarray data , 2004, Comput. Stat. Data Anal..

[18]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .