A comparative analysis of machine learning methods for classification type decision problems in healthcare

Advanced analytical techniques are gaining popularity in addressing complex classification type decision problems in many fields including healthcare and medicine. In this exemplary study, using digitized signal data, we developed predictive models employing three machine learning methods to diagnose an asthma patient based solely on the sounds acquired from the chest of the patient in a clinical laboratory. Although, the performances varied slightly, ensemble models (i.e., Random Forest and AdaBoost combined with Random Forest) achieved about 90% accuracy on predicting asthma patients, compared to artificial neural networks models that achieved about 80% predictive accuracy. Our results show that non-invasive, computerized lung sound analysis that rely on low-cost microphones and an embedded real-time microprocessor system would help physicians to make faster and better diagnostic decisions, especially in situations where x-ray and CT-scans are not reachable or not available. This study is a testament to the improving capabilities of analytic techniques in support of better decision making, especially in situations constraint by limited resources.

[1]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[2]  L Pesu,et al.  Classification of respiratory sounds based on wavelet packet decomposition and learning vector quantization. , 1998, Technology and health care : official journal of the European Society for Engineering and Medicine.

[3]  Dursun Delen,et al.  Predicting the graft survival for heart-lung transplantation patients: An integrated data mining methodology , 2009, Int. J. Medical Informatics.

[4]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[5]  Z. Çatmakaş,et al.  Towards an ARM based low cost and mobile biomedical device test bed for improved multi-channel pulmonary diagnosis , 2009 .

[6]  Dursun Delen,et al.  A machine learning-based approach to prognostic analysis of thoracic transplantations , 2010, Artif. Intell. Medicine.

[7]  N. Malmurugan,et al.  Neural classification of lung sounds using wavelet coefficients , 2004, Comput. Biol. Medicine.

[8]  Y.P. Kahya,et al.  Classifying Respiratory Sounds with Different Feature Sets , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[9]  Noam Gavriely,et al.  Breath Sounds Methodology , 1995 .

[10]  Christopher J. James,et al.  Semi-blind source separation and extraction techniques applied to multi-channel electroencephalogram and magnetoencephalogram signals , 2009 .

[11]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[12]  Kilian Stoffel,et al.  Theoretical Comparison between the Gini Index and Information Gain Criteria , 2004, Annals of Mathematics and Artificial Intelligence.

[13]  H. Ridvan Öz,et al.  Analysis of Pulmonary Diseases Using Genetic Programming , 2009 .

[14]  小野 啓資 Evaluation of the usefulness of spectral analysis of inspiratory lung sounds recorded with phonopneumography in patients with interstitial pneumonia , 2009 .

[15]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.

[16]  Aintree Chest,et al.  Current methods used for computerized respiratory sound analysis , 2000 .

[17]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[18]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[19]  Steve Furber ARM System-on-Chip Architecture , 2000 .

[20]  A. Jensen,et al.  Ripples in Mathematics - The Discrete Wavelet Transform , 2001 .

[21]  Dursun Delen,et al.  Analysis of cancer data: a data mining approach , 2009, Expert Syst. J. Knowl. Eng..

[22]  R. Pauwels,et al.  GLOBAL STRATEGY FOR ASTHMA MANAGEMENT AND PREVENTION , 1996 .

[23]  Andrey Vyshedskiy,et al.  Automated lung sound analysis in patients with pneumonia. , 2004, Respiratory care.

[24]  S. Mallat A wavelet tour of signal processing , 1998 .

[25]  Y P Kahya,et al.  Comparison of AR-based algorithms for respiratory sounds classification. , 1994, Computers in biology and medicine.

[26]  Mohammed Bahoura,et al.  Pattern recognition methods applied to respiratory sounds classification into normal and wheeze classes , 2009, Comput. Biol. Medicine.

[27]  Walt Kester,et al.  The data conversion handbook , 2005 .

[28]  Avinash C. Kak,et al.  PCA versus LDA , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[30]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[31]  Christie M. Fuller,et al.  Analysis of healthcare coverage: A data mining approach , 2009, Expert Syst. Appl..