Classifier ensemble for an effective cytological image analysis

Breast cancer is the most common type of cancer among women. As early detection is crucial for the patient's health, much attention has been paid to the development of tools for effective recognition of this disease. This article presents an application of image analysis and classification methods for fine needle biopsy. In our approach, each patient is described by nine microscopic images taken from the biopsy sample. The images are related to regions of the biopsy that seem interesting to the physician who selects them arbitrarily. We propose four different hybrid segmentation algorithms dedicated to processing these images and examine their effectiveness for the nuclei feature extraction task. Classification is carried out with the usage of a classifier ensemble based on the Random Subspaces approach. To boost its effectiveness, we use a linear combination of the support functions returned by the individual classifiers in the ensemble. In the proposed medical support system, the final decision about the patient is delivered after a fusion of nine separate outputs of the classifier - each for a different image. Experimental results carried out on a diverse dataset collected by the authors prove that the proposed solution outperforms state-of-the-art classifiers and shows itself to be a valuable tool for supporting day-to-day cytologist's routine.

[1]  W. F. Clocksin,et al.  Automatic segmentation of overlapping nuclei with high background variation using robust estimation and flexible contour models , 2003, 12th International Conference on Image Analysis and Processing, 2003.Proceedings..

[2]  Soo-Hong Kim,et al.  Analysis of breast cancer using data mining & statistical techniques , 2005, Sixth International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing and First ACIS International Workshop on Self-Assembling Wireless Network.

[3]  F. Cloppet,et al.  Segmentation of overlapping/aggregating nuclei cells in biological images , 2008, 2008 19th International Conference on Pattern Recognition.

[4]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[5]  Kwang-Ryol Ryu,et al.  Wavelet based Feature Extraction of Human face , 2001 .

[6]  Ryszard S. Choraś Image Processing and Communications Challenges 3 - 3rd International Conference, IP&C 2011, Proceedings , 2011, IP&C.

[7]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Bartosz Krawczyk,et al.  Combined classifier based on feature space partitioning , 2012, Int. J. Appl. Math. Comput. Sci..

[9]  Michal Wozniak,et al.  Combining classifiers under probabilistic models: experimental comparative analysis of methods , 2012, Expert Syst. J. Knowl. Eng..

[10]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Yousef Al-Kofahi,et al.  Improved Automatic Detection and Segmentation of Cell Nuclei in Histopathology Images , 2010, IEEE Transactions on Biomedical Engineering.

[12]  R. A. Osborn,et al.  Introduction to Biopsy Interpretation and Surgical Pathology , 1988 .

[13]  Michał Woźniak,et al.  Combining classifiers using trained fuser - analytical and experimental results / , 2010 .

[14]  Bogdan Gabrys,et al.  Classifier selection for majority voting , 2005, Inf. Fusion.

[15]  Christophoros Nikou,et al.  Combining shape, texture and intensity features for cell nuclei extraction in Pap smear images , 2011, Pattern Recognit. Lett..

[16]  S Issac Niwas,et al.  Wavelet based feature extraction method for breast cancer cytology images , 2010, 2010 IEEE Symposium on Industrial Electronics and Applications (ISIEA).

[17]  Marek Kowal,et al.  Fuzzy Clustering and Adaptive Thresholding Based Segmentation Method for Breast Cancer Diagnosis , 2011, Computer Recognition Systems 4.

[18]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[19]  Mira Park,et al.  Clustering nuclei using machine learning techniques , 2010, IEEE/ICME International Conference on Complex Medical Engineering.

[20]  Bartosz Krawczyk,et al.  Cost-Sensitive Splitting and Selection Method for Medical Decision Support System , 2012, IDEAL.

[21]  Elif Derya Übeyli,et al.  Computer recognition systems , 2009, Expert Syst. J. Knowl. Eng..

[22]  D. Wolpert The Supervised Learning No-Free-Lunch Theorems , 2002 .

[23]  Marek Kowal,et al.  Automatic Breast Cancer Diagnosis Based on K-Means Clustering and Adaptive Thresholding Hybrid Segmentation , 2011, IP&C.

[24]  Tadeusz M. Szuba,et al.  Computational Collective Intelligence , 2001, Lecture Notes in Computer Science.

[25]  Louis Vuurpijl,et al.  An overview and comparison of voting methods for pattern recognition , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[26]  C. Mathers,et al.  Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008 , 2010, International journal of cancer.

[27]  Ryszard Tadeusiewicz,et al.  Texture analysis in perfusion images of prostate cancer—A case study , 2010, Int. J. Appl. Math. Comput. Sci..

[28]  Marek Kowal,et al.  COMPUTER-AIDED DIAGNOSIS OF BREAST CANCER USING GAUSSIAN MIXTURE CYTOLOGICAL IMAGE SEGMENTATION , 2011 .

[29]  Emilio Corchado,et al.  Intelligent Data Engineering and Automated Learning - IDEAL 2006, 7th International Conference, Burgos, Spain, September 20-23, 2006, Proceedings , 2006, IDEAL.

[30]  Marek Kowal,et al.  Hybrid cytological image segmentation method based on competitive neural network and adaptive thresholding , 2011 .

[31]  Rached Tourki,et al.  Automated Breast Cancer Diagnosis Based on GVF-Snake Segmentation, Wavelet Features Extraction and Fuzzy Classification , 2009, J. Signal Process. Syst..

[32]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[33]  Michal Wozniak,et al.  Some Remarks on Chosen Methods of Classifier Fusion Based on Weighted Voting , 2009, HAIS.

[34]  C Caldas,et al.  One-stop diagnostic breast clinics: how often are breast cancers missed? , 2009, British Journal of Cancer.

[35]  Robert A. Jacobs,et al.  Methods For Combining Experts' Probability Assessments , 1995, Neural Computation.

[36]  Aboul Ella Hassanien,et al.  Computational Intelligence in Biomedicine and Bioinformatics, Current Trends and Applications , 2008, Computational Intelligence in Biomedicine and Bioinformatics.

[37]  J. Ferlay,et al.  Global estimates of cancer prevalence for 27 sites in the adult population in 2008 , 2013, International journal of cancer.

[38]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[39]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Kevin W. Bowyer,et al.  Combination of Multiple Classifiers Using Local Accuracy Estimates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Andrzej Obuchowicz,et al.  Segmentation of Breast Cancer Fine Needle Biopsy Cytological Images , 2008, Int. J. Appl. Math. Comput. Sci..

[42]  Sameer Singh,et al.  Advanced Algorithmic Approaches to Medical Image Segmentation , 2002, Advances in Computer Vision and Pattern Recognition.

[43]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[44]  Anil K. Jain,et al.  Prostate cancer grading: Gland segmentation and structural features , 2012, Pattern Recognit. Lett..

[45]  Robert P. W. Duin,et al.  The combining classifier: to train or not to train? , 2002, Object recognition supported by user interaction for service robots.

[46]  Kevin W. Bowyer,et al.  Combination of multiple classifiers using local accuracy estimates , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[47]  H. Irshad,et al.  Image segmentation using fuzzy clustering: A survey , 2010, 2010 6th International Conference on Emerging Technologies (ICET).

[48]  Bartosz Krawczyk,et al.  Adaptive Splitting and Selection Algorithm for Classification of Breast Cytology Images , 2012, ICCCI.

[49]  G.B. Coleman,et al.  Image segmentation by clustering , 1979, Proceedings of the IEEE.

[50]  Daniel Hernández-Lobato,et al.  An Analysis of Ensemble Pruning Techniques Based on Ordered Aggregation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Gerald Schaefer,et al.  Breast thermogram analysis using a cost-sensitive multiple classifier system , 2012, Proceedings of 2012 IEEE-EMBS International Conference on Biomedical and Health Informatics.

[52]  Fabio Roli,et al.  Bayesian Analysis of Linear Combiners , 2007, MCS.

[53]  Adam Krzyzak,et al.  Classification of Breast Cancer Malignancy Using Cytological Images of Fine Needle Aspiration Biopsies , 2008, Int. J. Appl. Math. Comput. Sci..

[54]  Ming Xie,et al.  Color clustering and learning for image segmentation based on neural networks , 2005, IEEE Trans. Neural Networks.

[55]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[56]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[57]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[58]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[59]  Gavin Brown,et al.  "Good" and "Bad" Diversity in Majority Vote Ensembles , 2010, MCS.

[60]  Marek Kowal,et al.  COMPUTER-AIDED DIAGNOSIS OF BREAST CANCER USING GAUSSIAN MIXTURE CYTOLOGICAL IMAGE SEGMENTATION 2 , 2011 .

[61]  Kotagiri Ramamohanarao,et al.  Breast-Cancer identification using HMM-fuzzy approach , 2010, Comput. Biol. Medicine.

[62]  Andrzej Obuchowicz,et al.  Computational Intelligence Techniques in Image Segmentation for Cytopathology , 2008, Computational Intelligence in Biomedicine and Bioinformatics.

[63]  Habibollah Haron,et al.  Image segmentation using fuzzy C-Means clustering , 2008 .

[64]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..