A Random Forest Based Approach for One Class Classification in Medical Imaging

In this paper, we address the problem of one-class classification for medical image classification. Indeed, in some situations, pathological samples may be difficult to acquire. In this case, one class classification (OCC) is a natural learning paradigm to be used. It consists in learning from only one class of objects, while two or more classes may be presented in prediction. We propose an original OCC method called One-Class Random Forest (OCRF), that combines ensemble learning principles from traditional Random Forest algorithm with an original outlier generation method. These two key processes complement each other for responding to OCC issues, and are shown to perform well on medical datasets in comparison to few other state-of-the-art OCC methods.

[1]  Christopher M. Bishop,et al.  Non-linear Bayesian Image Modelling , 2000, ECCV.

[2]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[3]  Ian H. Witten,et al.  One-Class Classification by Combining Density and Class Probability Estimation , 2008, ECML/PKDD.

[4]  Janaina Mourão Miranda,et al.  Patient classification as an outlier detection problem: An application of the One-Class Support Vector Machine , 2011, NeuroImage.

[5]  G. Bourg-Heckly,et al.  Human in vivo fluorescence microimaging of the alveolar ducts and sacs during bronchoscopy , 2009, European Respiratory Journal.

[6]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[7]  Michael Brady,et al.  Novelty detection for the identification of masses in mammograms , 1995 .

[8]  Matti Pietikäinen,et al.  Gray Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2000, ECCV.

[9]  Shehroz S. Khan,et al.  A Survey of Recent Trends in One Class Classification , 2009, AICS.

[10]  Daniel Rueckert,et al.  Random Forest-Based Manifold Learning for Classification of Imaging Data in Dementia , 2011, MLMI.

[11]  Robert P. W. Duin,et al.  Combining One-Class Classifiers , 2001, Multiple Classifier Systems.

[12]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[13]  Yongyi Yang,et al.  Machine Learning in Medical Imaging , 2010, IEEE Signal Processing Magazine.

[14]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[15]  Conor Ryan,et al.  Artificial Intelligence and Cognitive Science , 2002, Lecture Notes in Computer Science.

[16]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[17]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[18]  Laurent Heutte,et al.  Influence of Hyperparameters on Random Forest Accuracy , 2009, MCS.

[19]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Peter A. Flach,et al.  Evaluation Measures for Multi-class Subgroup Discovery , 2009, ECML/PKDD.

[21]  Laurent Heutte,et al.  Forest-RK: A New Random Forest Induction Method , 2008, ICIC.