On the Influence of Class Noise in Medical Data Classification: Treatment Using Noise Filtering Methods

ABSTRACT Classification systems play an important role in medical decision support, because they allow automatizing and accelerating the data analysis process. However, their quality is based on that of the training dataset upon which the classification models are built. The labeling process of each training example is usually performed by domain experts or automatic systems. When a wrong assignment of class labels to examples is performed, the training process and, therefore, the classification performance, might be negatively affected. This problem is formally known as class label noise. One of the most used techniques to reduce the harmful consequences of mislabeled objects is noise filtering, which removes noisy examples from the training data. This article analyzes the usefulness of such methods in the context of medical data classification. The experiments carried out on several real-world datasets show the importance of noise filtering when class noise affects the data.

[1]  Francisco Herrera,et al.  Analyzing the presence of noise in multi-class problems: alleviating its influence with the One-vs-One decomposition , 2012, Knowledge and Information Systems.

[2]  Ray J. Hickey,et al.  Noise Modelling and Evaluating Learning from Examples , 1996, Artif. Intell..

[3]  Choh-Man Teng,et al.  Correcting Noisy Data , 1999, ICML.

[4]  Francisco Herrera,et al.  Tackling the problem of classification with noisy data using Multiple Classifier Systems: Analysis of the performance and robustness , 2013, Inf. Sci..

[5]  Bartosz Krawczyk,et al.  Hypertension Type Classification Using Hierarchical Ensemble of One-Class Classifiers for Imbalanced Data , 2014, ICT Innovations.

[6]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[7]  Tony R. Martinez,et al.  Improved Heterogeneous Distance Functions , 1996, J. Artif. Intell. Res..

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  Pierre A. Devijver On the editing rate of the Multiedit algorithm , 1986, Pattern Recognit. Lett..

[10]  Xingquan Zhu,et al.  Class Noise vs. Attribute Noise: A Quantitative Study , 2003, Artificial Intelligence Review.

[11]  Taghi M. Khoshgoftaar,et al.  Improving Software Quality Prediction by Noise Filtering Techniques , 2007, Journal of Computer Science and Technology.

[12]  M. Verleysen,et al.  Classification in the Presence of Label Noise: A Survey , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[13]  S. Cessie,et al.  Ridge Estimators in Logistic Regression , 1992 .

[14]  J. R. Quinlan Constructing Decision Trees , 1993 .

[15]  Filiberto Pla,et al.  Prototype selection for the nearest neighbour rule through proximity graphs , 1997, Pattern Recognit. Lett..

[16]  Gerald Schaefer,et al.  A hybrid classifier committee for analysing asymmetry features in breast thermograms , 2014, Appl. Soft Comput..

[17]  D. Wolpert The Supervised Learning No-Free-Lunch Theorems , 2002 .

[18]  Xindong Wu,et al.  Mining With Noise Knowledge: Error-Aware Data Mining , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[19]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Effect of label noise in the complexity of classification problems , 2015, Neurocomputing.

[20]  Igor Kononenko,et al.  Machine learning for medical diagnosis: history, state of the art and perspective , 2001, Artif. Intell. Medicine.

[21]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[22]  H. Finner On a Monotonicity Problem in Step-Down Multiple Test Procedures , 1993 .

[23]  Carla E. Brodley,et al.  Identifying Mislabeled Training Data , 1999, J. Artif. Intell. Res..

[24]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[25]  Francisco Herrera,et al.  Predicting noise filtering efficacy with data complexity measures for nearest neighbor classification , 2013, Pattern Recognit..

[26]  Nuno Pombo,et al.  Knowledge discovery in clinical decision support systems for pain management: A systematic review , 2014, Artif. Intell. Medicine.

[27]  Saso Dzeroski,et al.  Noise detection and elimination in data preprocessing: Experiments in medical domains , 2000, Appl. Artif. Intell..

[28]  Saso Dzeroski,et al.  Noise Elimination in Inductive Concept Learning: A Case Study in Medical Diagnosois , 1996, ALT.

[29]  Dennis L. Wilson,et al.  Asymptotic Properties of Nearest Neighbor Rules Using Edited Data , 1972, IEEE Trans. Syst. Man Cybern..

[30]  Aboul Ella Hassanien,et al.  Dimensionality reduction of medical big data using neural-fuzzy classifier , 2014, Soft Computing.

[31]  Roberto Alejo,et al.  Analysis of new techniques to obtain quality training sets , 2003, Pattern Recognit. Lett..

[32]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[33]  Nada Lavrac,et al.  Experiments with Noise Filtering in a Medical Domain , 1999, ICML.

[34]  Bartosz Krawczyk,et al.  Cytological image analysis with firefly nuclei detection and hybrid one-class classification decomposition , 2014, Eng. Appl. Artif. Intell..

[35]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[36]  Enrico Blanzieri,et al.  Detecting potential labeling errors in microarrays by data perturbation , 2006, Bioinform..

[37]  Francisco Herrera,et al.  Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power , 2010, Inf. Sci..