Breast Cancer Histological Image Classification with Multiple Features and Random Subspace Classifier Ensemble

Histological image is important for diagnosis of breast cancer. In this paper, we present a novel automatic breast cancer classification scheme based on histological images. The image features are extracted using the Curvelet Transform, statistics of Gray Level Co-occurrence Matrix (GLCM) and the Completed Local Binary Patterns (CLBP), respectively. The three different features are combined together and used for classification. A classifier ensemble approach, called Random Subspace Ensemble (RSE), are used to select and aggregate a set of base neural network classifiers for classification. The proposed multiple features and random subspace ensemble offer the classification rate 95.22% on a publically available breast cancer image dataset, which compares favorably with the previously published result 93.4%.

[1]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[2]  Robert P. W. Duin,et al.  An experimental study on diversity for bagging and boosting with linear classifiers , 2002, Inf. Fusion.

[3]  Giorgio Valentini,et al.  Bio-molecular cancer prediction with random subspace ensembles of support vector machines , 2005, Neurocomputing.

[4]  Juan José Rodríguez Diez,et al.  Random Subspace Ensembles for fMRI Classification , 2010, IEEE Transactions on Medical Imaging.

[5]  Taxiarchis Botsis,et al.  Automatic detection of clustered microcalcifications in digital mammograms using mathematical morphology and neural networks , 2007, Signal Process..

[6]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Nicolas Loménie,et al.  Time-efficient sparse analysis of histopathological whole slide images , 2011, Comput. Medical Imaging Graph..

[8]  Therese Hesketh,et al.  Effects of reproductive and demographic changes on breast cancer incidence in China: a modeling analysis. , 2008, Journal of the National Cancer Institute.

[9]  Vivek Aggarwal,et al.  Spectrum of Breast Cancer in Asian Women , 2007, World journal of surgery.

[10]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[11]  Leen-Kiat Soh,et al.  Texture analysis of SAR sea ice imagery using gray level co-occurrence matrices , 1999, IEEE Trans. Geosci. Remote. Sens..

[12]  Zhenhua Guo,et al.  A Completed Modeling of Local Binary Pattern Operator for Texture Classification , 2010, IEEE Transactions on Image Processing.

[13]  Ron Kimmel,et al.  Breast Cancer Diagnosis From Biopsy Images Using Generic Features and SVMs , 2006 .

[14]  Gerlind Plonka-Hoch,et al.  The Curvelet Transform , 2010, IEEE Signal Processing Magazine.

[15]  Laurent Demanet,et al.  Fast Discrete Curvelet Transforms , 2006, Multiscale Model. Simul..

[16]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[18]  Kevin W. Bowyer,et al.  Combination of Multiple Classifiers Using Local Accuracy Estimates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[20]  E. Candès,et al.  New tight frames of curvelets and optimal representations of objects with piecewise C2 singularities , 2004 .

[21]  Mikhail Teverovskiy,et al.  Multifeature Prostate Cancer Diagnosis and Gleason Grading of Histological Images , 2007, IEEE Transactions on Medical Imaging.

[22]  Robert Sabourin,et al.  Combining Diversity and Classification Accuracy for Ensemble Selection in Random Subspaces , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[23]  B. S. Manjunath,et al.  Object- and spatial-level quantitative analysis of multispectral histopathology images for detection and characterization of cancer , 2008 .

[24]  Gabor Fichtinger,et al.  Medical Image Computing and Computer-Assisted Intervention - MICCAI 2008, 11th International Conference, New York, NY, USA, September 6-10, 2008, Proceedings, Part I , 2008, International Conference on Medical Image Computing and Computer-Assisted Intervention.

[25]  Metin Nafi Gürcan,et al.  Adaptive Discriminant Wavelet Packet Transform and Local Binary Patterns for Meningioma Subtype Classification , 2008, MICCAI.

[26]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[27]  David A Clausi An analysis of co-occurrence texture statistics as a function of grey level quantization , 2002 .

[28]  Robert P. W. Duin,et al.  Bagging, Boosting and the Random Subspace Method for Linear Classifiers , 2002, Pattern Analysis & Applications.

[29]  Bruno Apolloni,et al.  Biological and Artificial Intelligence Environments - 15th Italian Workshop on Neural Nets, WIRN VIETRI 2004, Vietri sul Mare, Italy, 2004 , 2005, WIRN.

[30]  Ping Zhang,et al.  Reliable recognition of handwritten digits using a cascade ensemble classifier system and hybrid features , 2006 .

[31]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[32]  Juan José Rodríguez Diez,et al.  Rotation Forest: A New Classifier Ensemble Method , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  Jerome H. Friedman,et al.  On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality , 2004, Data Mining and Knowledge Discovery.

[35]  Jacek M. Zurada,et al.  Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance , 2008, Neural Networks.