Conventional Machine Learning based on Feature Engineering for Detecting Pneumonia from Chest X-rays

Chest X-ray is the standard approach used to diagnose pneumonia and other chest diseases. Early diagnosis of the disease is very relevant in the life of people, but analyzing X-ray images can be complicated and needs the competence of a radiographer. In this paper, we demonstrate the potential of detecting the disease in chest X-rays using conventional machine learning classifiers. The principal component analysis is used for the data dimensionality reduction and features extraction then the extracted features are used to train several model classifiers. We obtained an accuracy of , using of the principal explained variance.

[1]  Zhihua Cai,et al.  Evaluation Measures of the Classification Performance of Imbalanced Data Sets , 2009 .

[2]  Wei Dai,et al.  A MapReduce Implementation of C4.5 Decision Tree Algorithm , 2014 .

[3]  Cesar H. Comin,et al.  Principal Component Analysis , 2018, ACM Comput. Surv..

[4]  Wei Dai,et al.  A MapReduce Implementation of C 4 . 5 Decision Tree Algorithm , 2014 .

[5]  Jonathon Shlens,et al.  A Tutorial on Principal Component Analysis , 2014, ArXiv.

[6]  April Tantillo Infant Mortality , 1923, Nature.

[7]  Raheel Siddiqi,et al.  Automated Pneumonia Diagnosis using a Customized Sequential Convolutional Neural Network , 2019, ICDLT.

[8]  Keeley A. Crockett,et al.  Soft decision trees: a new approach using non-linear fuzzification , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[9]  Andrew Y. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[10]  Armin Shmilovici,et al.  On the use of decision tree induction for discovery of interactions in a photolithographic process , 2003 .

[11]  Kazem Taghva,et al.  Cancer survivability with logistic regression , 2017, 2017 Computing Conference.

[12]  Can Jozef Saul,et al.  Early Diagnosis of Pneumonia with Deep Learning , 2019, ArXiv.

[13]  Alaa Tharwat,et al.  Classification assessment methods , 2020, Applied Computing and Informatics.

[14]  Yasen Jiao,et al.  Performance measures in evaluating machine learning based bioinformatics predictors for classifications , 2016, Quantitative Biology.

[15]  C. Grohé Pneumonie , 2020, DMW - Deutsche Medizinische Wochenschrift.

[16]  Darwin Prasetio,et al.  Predicting football match results with logistic regression , 2016, 2016 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA).

[17]  Hiroshi Yadohisa,et al.  F-measure Maximizing Logistic Regression , 2019, ArXiv.

[18]  Yang Wang,et al.  Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. , 2018, Cancer genomics & proteomics.

[19]  Okeke Stephen,et al.  An Efficient Deep Learning Approach to Pneumonia Classification in Healthcare , 2019, Journal of healthcare engineering.

[20]  Michael Grass,et al.  Comparison of Deep Learning Approaches for Multi-Label Chest X-Ray Classification , 2018, Scientific Reports.

[21]  Peng Li,et al.  Telecom Customer Churn Prediction Method Based on Cluster Stratified Sampling Logistic Regression , 2014, IOT 2014.

[22]  Kueh Lee Hui,et al.  A nonlinear decision tree based classification approach to predict the Parkinson's disease using different feature sets of voice data , 2018, 2018 20th International Conference on Advanced Communication Technology (ICACT).