Partial order label decomposition approaches for melanoma diagnosis

Abstract Melanoma is a type of cancer that develops from the pigment-containing cells known as melanocytes. Usually occurring on the skin, early detection and diagnosis is strongly related to survival rates. Melanoma recognition is a challenging task that nowadays is performed by well trained dermatologists who may produce varying diagnosis due to the task complexity. This motivates the development of automated diagnosis tools, in spite of the inherent difficulties (intra-class variation, visual similarity between melanoma and non-melanoma lesions, among others). In the present work, we propose a system combining image analysis and machine learning to detect melanoma presence and severity. The severity is assessed in terms of melanoma thickness, which is measured by the Breslow index. Previous works mainly focus on the binary problem of detecting the presence of the melanoma. However, the system proposed in this paper goes a step further by also considering the stage of the lesion in the classification task. To do so, we extract 100 features that consider the shape, colour, pigment network and texture of the benign and malignant lesions. The problem is tackled as a five-class classification problem, where the first class represents benign lesions, and the remaining four classes represent the different stages of the melanoma (via the Breslow index). Based on the problem definition, we identify the learning setting as a partial order problem, in which the patterns belonging to the different melanoma stages present an order relationship, but where there is no order arrangement with respect to the benign lesions. Under this assumption about the class topology, we design several proposals to exploit this structure and improve data preprocessing. In this sense, we experimentally demonstrate that those proposals exploiting the partial order assumption achieve better performance than 12 baseline nominal and ordinal classifiers (including a deep learning model) which do not consider this partial order. To deal with class imbalance, we additionally propose specific over-sampling techniques that consider the structure of the problem for the creation of synthetic patterns. The experimental study is carried out with clinician-curated images from the Interactive Atlas of Dermoscopy, which eases reproducibility of experiments. Concerning the results obtained, in spite of having augmented the complexity of the classification problem with more classes, the performance of our proposals in the binary problem is similar to the one reported in the literature.

[1]  Willem Waegeman,et al.  An ensemble of Weighted Support Vector Machines for Ordinal Regression , 2007 .

[2]  Angela Ferrari,et al.  Interactive atlas of dermoscopy , 2000 .

[3]  H. Lorentzen,et al.  Dermatoscopic prediction of melanoma thickness using latent trait analysis and likelihood ratios. , 2001, Acta dermato-venereologica.

[4]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[5]  James Bailey,et al.  Computer-Aided Diagnosis of Melanoma Using Border- and Wavelet-Based Texture Analysis , 2012, IEEE Transactions on Information Technology in Biomedicine.

[6]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[7]  Jaime S. Cardoso,et al.  Discriminative directional classifiers , 2016, Neurocomputing.

[8]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  María Pérez-Ortiz,et al.  Tackling the ordinal and imbalance nature of a melanoma image classification problem , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[10]  Pedro Antonio Gutiérrez,et al.  Projection-Based Ensemble Learning for Ordinal Regression , 2014, IEEE Transactions on Cybernetics.

[11]  Franck Marzani,et al.  Tackling the Problem of Data Imbalancing for Melanoma Classification , 2016, BIOIMAGING.

[12]  A Breslow,et al.  Thickness, Cross‐Sectional Areas and Depth of Invasion in the Prognosis of Cutaneous Melanoma , 1970, Annals of surgery.

[13]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[14]  Carlos S. Mendoza,et al.  Development and evaluation of perceptually adapted colour gradients , 2013, IET Image Process..

[15]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[16]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[17]  R Marchesini,et al.  The ABCD system of melanoma detection , 1999, Cancer.

[18]  Eyke Hüllermeier,et al.  Graded Multilabel Classification: The Ordinal Case , 2010, ICML.

[19]  M. Lens,et al.  Excision margins for primary cutaneous melanoma: updated pooled analysis of randomized controlled trials. , 2007, Archives of surgery.

[20]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[21]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[22]  Eyke Hüllermeier,et al.  Is an ordinal class structure useful in classifier learning? , 2008, Int. J. Data Min. Model. Manag..

[23]  Alan Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[24]  D. Coit,et al.  Sentinel lymph node evaluation in melanoma. , 1997, Archives of dermatology.

[25]  Giuseppe Argenziano,et al.  Dermoscopic criteria for melanoma in situ are similar to those for early invasive melanoma , 2001, Cancer.

[26]  Eibe Frank,et al.  A Simple Approach to Ordinal Classification , 2001, ECML.

[27]  María Pérez-Ortiz,et al.  Classification of Melanoma Presence and Thickness Based on Computational Image Analysis , 2016, HAIS.

[28]  Bruno Bauwens,et al.  From circular ordinal regression to multilabel classification , 2010 .

[29]  Aurora Sáez,et al.  Model-Based Classification Methods of Global Patterns in Dermoscopic Images , 2014, IEEE Transactions on Medical Imaging.

[30]  Pedro Antonio Gutiérrez,et al.  Metrics to guide a multi-objective evolutionary algorithm for ordinal classification , 2014, Neurocomputing.

[31]  Pedro Antonio Gutiérrez,et al.  Graph-Based Approaches for Over-Sampling in the Context of Ordinal Regression , 2015, IEEE Transactions on Knowledge and Data Engineering.

[32]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[33]  M. Stella Atkins,et al.  A novel method for detection of pigment network in dermoscopic images using graphs , 2011, Comput. Medical Imaging Graph..

[34]  Lan Wu,et al.  Nonnegative Elastic Net and application in index tracking , 2014, Appl. Math. Comput..

[35]  C. Hervás-Martínez,et al.  An organ allocation system for liver transplantation based on ordinal regression , 2014, Appl. Soft Comput..

[36]  Pedro Antonio Gutiérrez,et al.  Ordinal Regression Methods: Survey and Experimental Study , 2016, IEEE Transactions on Knowledge and Data Engineering.

[37]  Costantino Grana,et al.  Computer description of colours in dermoscopic melanocytic lesion images reproducing clinical assessment , 2003 .

[38]  Pietro Rubegni,et al.  Evaluation of cutaneous melanoma thickness by digital dermoscopy analysis: a retrospective study , 2010, Melanoma research.

[39]  Randy H. Moss,et al.  A methodological approach to the classification of dermoscopy images , 2007, Comput. Medical Imaging Graph..

[40]  V. De Giorgi,et al.  Non-invasive analysis of melanoma thickness by means of dermoscopy: a retrospective study , 2001, Melanoma research.

[41]  Thore Graepel,et al.  Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[42]  A. Simmons,et al.  Predicting Progression of Alzheimer’s Disease Using Ordinal Regression , 2014, PloS one.

[43]  Rainer Hofmann-Wellenhof,et al.  Color Atlas of Melanocytic Lesions of the Skin , 2008 .

[44]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[45]  Cila Herman,et al.  Emerging technologies for the detection of melanoma: achieving better outcomes , 2012, Clinical, cosmetic and investigational dermatology.

[46]  Amouroux Marine,et al.  Non-Invasive Determination of Breslow Index , 2011 .

[47]  Ilias Maglogiannis,et al.  Overview of Advanced Computer Vision Systems for Skin Lesions Characterization , 2009, IEEE Transactions on Information Technology in Biomedicine.

[48]  Hanqing Lu,et al.  A practical SVM-based algorithm for ordinal regression in image retrieval , 2003, MULTIMEDIA '03.

[49]  G. Argenziano,et al.  Clinical and dermatoscopic criteria for the preoperative evaluation of cutaneous melanoma thickness. , 1999, Journal of the American Academy of Dermatology.

[50]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[51]  Pedro Antonio Gutiérrez,et al.  A guided data projection technique for classification of sovereign ratings: The case of European Union 27 , 2014, Appl. Soft Comput..

[52]  Eyke Hüllermeier,et al.  Label ranking by learning pairwise preferences , 2008, Artif. Intell..

[53]  Hao Chen,et al.  Automated Melanoma Recognition in Dermoscopy Images via Very Deep Residual Networks , 2017, IEEE Transactions on Medical Imaging.

[54]  Ling Li,et al.  Reduction from Cost-Sensitive Ordinal Ranking to Weighted Binary Classification , 2012, Neural Computation.

[55]  Pedro Antonio Gutiérrez,et al.  Machine Learning Methods for Binary and Multiclass Classification of Melanoma Thickness From Dermoscopic Images , 2016, IEEE Transactions on Medical Imaging.

[56]  Qinghua Zheng,et al.  Ordinal extreme learning machine , 2010, Neurocomputing.

[57]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[58]  James Zijun Wang,et al.  Automatic diagnosis of melanoma using machine learning methods on a spectroscopic system , 2014, BMC Medical Imaging.

[59]  Wei Chu,et al.  Support Vector Ordinal Regression , 2007, Neural Computation.

[60]  Pedro Antonio Gutiérrez,et al.  Exploitation of Pairwise Class Distances for Ordinal Classification , 2013, Neural Computation.

[61]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[62]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[63]  M. Binder,et al.  Epiluminescence microscopy. A useful tool for the diagnosis of pigmented skin lesions for formally trained dermatologists. , 1995, Archives of dermatology.

[64]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[65]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[66]  N. Otsu A threshold selection method from gray level histograms , 1979 .