Breast Thermogram Analysis Using Classifier Ensembles and Image Symmetry Features

Breast cancer is the most commonly diagnosed form of cancer in women. Thermography, which uses cameras with sensitivities in the thermal infrared, has been shown to provide an interesting alternative to the standard modality of mammography for detecting breast cancer as it is able to detect smaller tumors and hence can lead to earlier diagnosis. In this paper, we present an approach to breast thermogram analysis that extracts features describing bilateral symmetries from an image and then utilizes a classifier ensemble for decision making. Importantly, our classification approach addresses the problem of imbalanced class distribution that is common in medical data analysis. We do this by constructing feature subspaces from balanced data subsets and train different classifiers on different subspaces. To combine the individual classifiers, we investigate two different strategies. The first dynamically assigns classifier weights based on an evolutionary algorithm, while the second uses a neural network for classifier fusion. Both approaches are shown to work well and to lead to significantly improved performance compared to canonical classification systems.

[1]  Robert P. W. Duin,et al.  The combining classifier: to train or not to train? , 2002, Object recognition supported by user interaction for service robots.

[2]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[3]  Gerald Schaefer Ant Colony Classification for Analysis of Image Features in Breast Thermograms , 2009, IPCV.

[4]  J. Flusser,et al.  Moments and Moment Invariants in Pattern Recognition , 2009 .

[5]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[6]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[7]  Hui Han,et al.  Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning , 2005, ICIC.

[8]  Longin Jan Latecki,et al.  Improving SVM classification on imbalanced time series data sets with ghost points , 2011, Knowledge and Information Systems.

[9]  Chumphol Bunkhumpornpat,et al.  DBSMOTE: Density-Based Synthetic Minority Over-sampling TEchnique , 2011, Applied Intelligence.

[10]  H. Qi,et al.  Detecting breast cancer from infrared images by asymmetry analysis , 2000, Proceedings of the 22nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (Cat. No.00CH37143).

[11]  Bryan F. Jones,et al.  A reappraisal of the use of infrared thermal image analysis in medicine , 1998, IEEE Transactions on Medical Imaging.

[12]  Yan-Shi Dong,et al.  Boosting SVM classifiers by ensemble , 2005, WWW '05.

[13]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[14]  Xin Yao,et al.  Diversity analysis on imbalanced data sets by using ensemble models , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[15]  Nathalie Japkowicz,et al.  Boosting Support Vector Machines for Imbalanced Data Sets , 2008, ISMIS.

[16]  Jing Peng,et al.  Classifying Unbalanced Pattern Groups by Training Neural Network , 2006, ISNN.

[17]  E. Yu,et al.  Functional infrared imaging of the breast , 2000, IEEE Engineering in Medicine and Biology Magazine.

[18]  Ethem Alpaydın,et al.  Combined 5 x 2 cv F Test for Comparing Supervised Classification Learning Algorithms , 1999, Neural Comput..

[19]  C. K. Chow,et al.  Statistical Independence and Threshold Functions , 1965, IEEE Trans. Electron. Comput..

[20]  D. Wolpert The Supervised Learning No-Free-Lunch Theorems , 2002 .

[21]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[22]  Gerald Schaefer,et al.  Thermography based breast cancer analysis using statistical features and fuzzy classification , 2009, Pattern Recognit..

[23]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[24]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[25]  James C. Bezdek,et al.  Decision templates for multiple classifier fusion: an experimental comparison , 2001, Pattern Recognit..

[26]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[27]  T. Button,et al.  Detection of cancerous breasts by dynamic area telethermometry , 2001, IEEE Engineering in Medicine and Biology Magazine.

[28]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[29]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[30]  M. Gautherie Thermobiological assessment of benign and malignant breast diseases. , 1983, American journal of obstetrics and gynecology.

[31]  B. Krawczyk,et al.  Improving minority class prediction using cost-sensitive ensembles , 2011 .

[32]  C. A. Lipari,et al.  The important role of infrared imaging in breast cancer , 2000, IEEE Engineering in Medicine and Biology Magazine.

[33]  Nitesh V. Chawla,et al.  SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.

[34]  Michal Wozniak,et al.  Designing combining classifier with trained fuser — Analytical and experimental evaluation , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[35]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[36]  Kurt Hornik,et al.  Support Vector Machines in R , 2006 .