Detecting Lung Abnormalities From X-rays Using an Improved SSL Algorithm

Abstract A significant component in computer-aided medical diagnosis is the automatic detection of lung abnormalities from digital chest X-rays; thus it constitutes a vital first step in radiologic image analysis. During the last decades, the rapid advances of digital technology and chest radiography have ultimately led to the development of large repositories with labeled and unlabeled images. Semi-supervised learning algorithms have become a hot topic of research, exploiting the explicit classification information of labeled images with the knowledge hidden in the unlabeled images. In the present work, we propose a new semi-supervised learning algorithm for the classification of lung abnormalities from X-rays based on an ensemble philosophy. The efficacy of the presented algorithm is demonstrated by numerical experiments, illustrating that reliable prediction models could be developed by incorporating ensemble methodologies in the semi-supervised framework.

[1]  J. L. Hodges,et al.  Rank Methods for Combination of Independent Experiments in Analysis of Variance , 1962 .

[2]  D. Mollura,et al.  Segmentation and Image Analysis of Abnormal Lungs at CT: Current Approaches, Challenges, and Future Trends. , 2015, Radiographics : a review publication of the Radiological Society of North America, Inc.

[3]  Sameer Antani,et al.  Automated Chest X-Ray Screening: Can Lung Region Symmetry Help Detect Pulmonary Abnormalities? , 2018, IEEE Transactions on Medical Imaging.

[4]  Yan Zhou,et al.  Democratic co-learning , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[5]  Panayiotis E. Pintelas,et al.  An Auto-Adjustable Semi-Supervised Self-Training Algorithm , 2018, Algorithms.

[6]  Panayiotis E. Pintelas,et al.  An Ensemble SSL Algorithm for Efficient Chest X-Ray Image Classification , 2018, J. Imaging.

[7]  Sylvie Ratté,et al.  An Unsupervised Approach for Combining Scores of Outlier Detection Techniques, Based on Similarity Measures , 2016, CLEI Selected Papers.

[8]  Nilanjan Dey,et al.  Adaptive Semi Supervised Support Vector Machine Semi Supervised Learning with Features Cooperation for Breast Cancer Classification , 2016 .

[9]  Tassos A. Mikropoulos,et al.  Predicting Secondary School Students' Performance Utilizing a Semi-supervised Learning Approach , 2019 .

[10]  Milan Petkovic,et al.  Hedaquin: A Reputation-based Health Data Quality Indicator , 2008, STM@ESORICS.

[11]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[12]  Clement J. McDonald,et al.  Lung Segmentation in Chest Radiographs Using Anatomical Atlases With Nonrigid Registration , 2014, IEEE Transactions on Medical Imaging.

[13]  Sameer Antani,et al.  Visualization and Interpretation of Convolutional Neural Network Predictions in Detecting Pneumonia in Pediatric Chest Radiographs , 2018, Applied sciences.

[14]  David W. Aha,et al.  Lazy Learning , 1997, Springer Netherlands.

[15]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[16]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[17]  Lior Rokach,et al.  Pattern Classification Using Ensemble Methods , 2009, Series in Machine Perception and Artificial Intelligence.

[18]  Tassos A. Mikropoulos,et al.  An Ensemble-Based Semi-Supervised Approach for Predicting Students’ Performance , 2018 .

[19]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[20]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[21]  Zhi-Hua Zhou,et al.  Improve Computer-Aided Diagnosis With Machine Learning Techniques Using Undiagnosed Samples , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[23]  Francisco Herrera,et al.  Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study , 2015, Knowledge and Information Systems.

[24]  Friedhelm Schwenker,et al.  Combining committee-based semi-supervised learning and active learning , 2010 .

[25]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[26]  Sandro Etalle,et al.  Audit-Based Access Control for Electronic Health Records , 2007, Electron. Notes Theor. Comput. Sci..

[27]  H. Finner On a Monotonicity Problem in Step-Down Multiple Test Procedures , 1993 .

[28]  Kenji Suzuki Machine Learning in Computer-Aided Diagnosis: Medical Imaging Intelligence and Analysis , 2012 .

[29]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[30]  Alamgir Hossan,et al.  Multi-Stage Lung Cancer Detection and Prediction Using Multi-class SVM Classifie , 2018, 2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2).

[31]  Hamideh Afsarmanesh,et al.  Semi-supervised self-training for decision tree classifiers , 2017, Int. J. Mach. Learn. Cybern..

[32]  Francisco Herrera,et al.  On the characterization of noise filters for self-training semi-supervised in nearest neighbor classification , 2014, Neurocomputing.

[33]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[34]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[35]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[36]  Daniel S. Kermany,et al.  Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning , 2018, Cell.

[37]  Evelina Lamma,et al.  Rule-based Programming for Building Expert Systems: a Comparison in the Microbiological Data Validation and Surveillance Domain , 2001, Electron. Notes Theor. Comput. Sci..

[38]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[39]  Jesús Alcalá-Fdez,et al.  KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework , 2011, J. Multiple Valued Log. Soft Comput..

[40]  Pong C. Yuen,et al.  A Boosted Co-Training Algorithm for Human Action Recognition , 2011, IEEE Transactions on Circuits and Systems for Video Technology.