Ensemble-Based Machine Learning Algorithms for Classifying Breast Tissue Based on Electrical Impedance Spectroscopy

The initial identification of breast cancer and the prediction of its category have become a requirement in cancer research because they can simplify the subsequent clinical management of patients. The application of artificial intelligence techniques (e.g., machine learning and deep learning) in medical science is becoming increasingly important for intelligently transforming all available information into valuable knowledge. Therefore, we aimed to classify six classes of freshly excised tissues from a set of electrical impedance measurement variables using five ensemble-based machine learning (ML) algorithms, namely, the random forest (RF), extremely randomized trees (ERT), decision tree (DT), gradient boosting tree (GBT) and AdaBoost (Adaptive Boosting) (ADB) algorithms, which can be subcategorized as bagging and boosting methods. In addition, the ranked order of the variables based on their importance differed across the ML algorithms. The results demonstrated that the three bagging ensemble ML algorithms, namely, RF ERT and DT, yielded better classification accuracies (78–86%) compared with the two boosting algorithms, GBT and ADB (60–75%). We hope that these our results would help improve the classification of breast tissue to allow the early prediction of cancer susceptibility.

[1]  Sean T. Osis,et al.  Fuzzy Inference System-based Recognition of Slow, Medium and Fast Running Conditions using a Triaxial Accelerometer , 2017 .

[2]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[3]  Christian A. Clermont,et al.  Using wearable sensors to classify subject-specific running biomechanical gait patterns based on changes in environmental weather conditions , 2018, PloS one.

[4]  J. Jossinet Variability of impedivity in normal and pathological breast tissue , 1996, Medical and Biological Engineering and Computing.

[5]  Zhenyu Guo,et al.  A review of electrical impedance techniques for breast cancer detection. , 2003, Medical engineering & physics.

[6]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[7]  Robert Marti,et al.  A Novel Breast Tissue Density Classification Methodology , 2008, IEEE Transactions on Information Technology in Biomedicine.

[8]  J. Jossinet,et al.  Classification of breast tissue by electrical impedance spectroscopy , 2006, Medical and Biological Engineering and Computing.

[9]  Robert Marti,et al.  A Comparison of Breast Tissue Classification Techniques , 2006, MICCAI.

[10]  Yu-Bin Yang,et al.  Lung cancer cell identification based on artificial neural network ensembles , 2002, Artif. Intell. Medicine.

[11]  Kurt Hornik,et al.  The Design and Analysis of Benchmark Experiments , 2005 .

[12]  Mike Clarke,et al.  UK and USA breast cancer deaths down 25% in year 2000 at ages 20–69 years , 2000, The Lancet.

[13]  Denys Poshyvanyk,et al.  Machine Learning-Based Prototyping of Graphical User Interfaces for Mobile Apps , 2018, IEEE Transactions on Software Engineering.

[14]  Kenneth Sundaraj,et al.  Real-time robot-human interaction by tracking hand movement & orientation based on morphology , 2011, 2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA).

[15]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[16]  Kenneth Sundaraj,et al.  Rehabilitation systems for physically disabled patients: A brief review ofsensor-based computerised signal-monitoring systems. , 2013 .

[17]  Peter Bühlmann,et al.  Bagging, Boosting and Ensemble Methods , 2012 .

[18]  Christian A. Clermont,et al.  Subject-specific and group-based running pattern classification using a single wearable sensor. , 2019, Journal of biomechanics.

[19]  Guangtao Ge,et al.  Classification of premalignant pancreatic cancer mass-spectrometry data using decision tree ensembles , 2008, BMC Bioinformatics.

[20]  Ardalan Ghasemzadeh,et al.  Breast cancer detection based on Gabor-wavelet transform and machine learning methods , 2018, Int. J. Mach. Learn. Cybern..

[21]  Saliha Oukid,et al.  Breast cancer classification using machine learning , 2018, 2018 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT).

[22]  Rajkumar Palaniappan,et al.  Machine learning in lung sound analysis: a systematic review , 2013 .

[23]  Lam Chee Kiang,et al.  Artificial Intelligence Techniques Used for Wheeze Sounds Analysis: Review , 2016 .

[24]  Weijian Guo,et al.  Prediction of Pancreatic Cancer by Serum Biomarkers Using Surface-Enhanced Laser Desorption/Ionization-Based Decision Tree Classification , 2005, Oncology.

[25]  J Jossinet,et al.  The impedivity of freshly excised human breast tissue , 1998, Physiological measurement.

[26]  Kemal Polat,et al.  Breast cancer diagnosis using least square support vector machine , 2007, Digit. Signal Process..