Principal component analysis-based features generation combined with ellipse models-based classification criterion for a ventricular septal defect diagnosis system

In this study, a simple and efficient diagnostic system, which adopts a novel methodology consisting of principal component analysis (PCA)-based feature generation and ellipse models-based classification criterion, is proposed for the diagnosis of a ventricular septal defect (VSD). The three stages corresponding to the diagnostic system implementation are summarized as follows. In stage 1, the heart sound is collected by 3M-3200 electronic stethoscope and is preprocessed using the wavelet decomposition. In stage 2, the PCA-based diagnostic features, [$$y_{1}, y_{2}$$y1,y2], are generated from time-frequency feature matrix ($${\text{TFFM}}$$TFFM). In the matrix TFFM, the time domain features $$[T_{12}, T_{11}]$$[T12,T11] are firstly extracted from the time domain envelope $$E_{\text{T}}$$ET for the filtered heart sound signal $$X_{\text{T}}$$XT, and frequency domain features, $$[F_{\text{G}}, F_{\text{W}}]$$[FG,FW], are subsequently extracted from a frequency domain envelope ($$E_{\text {F}}$$EF) for each heart sound cycle automatically segmented via the short time modified Hilbert transform (STMHT). In stage 3, support vector machines-based classification boundary curves for the dataset $$[y_{{1}}, y_{{2}}]$$[y1,y2] are first generated, and least-squares-based ellipse models are subsequently built for the classification boundary curve. Finally, based on the ellipse models, the classification criterion is defined for the diagnosis of VSD sounds. The proposed diagnostic system is validated by sounds from the internet and by sounds from clinical heart diseases. Moreover, comparative analysis to validate the usefulness of the proposed diagnostic system, mitral regurgitation and aortic stenosis sounds are used as examples for detection. As a result, the higher classification accuracy, which is achieved by this study compared to the other methods, is $$95.5\%$$95.5%, $$92.1\%$$92.1%, $$96.2\%$$96.2% and $$99.0\%$$99.0% for diagnosing small VSD, moderate VSD, large VSD and normal sounds, respectively.

[1]  Chien-Hung Lin,et al.  Heart Rate Variability Signal Features for Emotion Recognition by Using Principal Component Analysis and Support Vectors Machine , 2016, 2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE).

[2]  Ming Yang,et al.  Sensorineural hearing loss detection via discrete wavelet transform and principal component analysis combined with generalized eigenvalue proximal support vector machine and Tikhonov regularization , 2018, Multimedia Tools and Applications.

[3]  Giacomo Capizzi,et al.  Automatic heart activity diagnosis based on Gram polynomials and probabilistic neural networks , 2018, Biomedical engineering letters.

[4]  S. Mallat A wavelet tour of signal processing , 1998 .

[5]  M. Sigrist,et al.  Laser-induced breakdown spectroscopy applied to the characterization of rock by support vector machine combined with principal component analysis , 2016 .

[6]  Ram Bilas Pachori,et al.  Automatic diagnosis of septal defects based on tunable-Q wavelet transform of cardiac sound signals , 2015, Expert Syst. Appl..

[7]  Christian Hansen,et al.  What next? , 1994, Nature.

[8]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[9]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[10]  Zhongwei Jiang,et al.  Cardiac sound murmurs classification with autoregressive spectral analysis and multi-support vector machine technique , 2010, Comput. Biol. Medicine.

[11]  Jiang Wu,et al.  A Mass Spectrometric Analysis Method Based on PPCA and SVM for Early Detection of Ovarian Cancer , 2016, Comput. Math. Methods Medicine.

[12]  E. Kannan,et al.  An efficient framework for heart disease classification using feature extraction and feature selection technique in data mining , 2016, 2016 International Conference on Emerging Trends in Engineering, Technology and Science (ICETETS).

[13]  Chi-Hyuck Jun,et al.  PCA-based high-dimensional noisy data clustering via control of decision errors , 2013, Knowl. Based Syst..

[14]  W. Sweldens Wavelets: What Next? , 1997 .

[15]  El-Sayed A. El-Dahshan,et al.  Denoising of Heart Sound Signals Using Discrete Wavelet Transform , 2017, Circuits Syst. Signal Process..

[16]  Thea Radüntz,et al.  Automated EEG artifact elimination by applying machine learning algorithms to ICA-based features , 2017, Journal of neural engineering.

[17]  Sung-Il Kim,et al.  Geological model sampling using PCA-assisted support vector machine for reliable channel reservoir characterization , 2018, Journal of Petroleum Science and Engineering.

[18]  Yu Hu,et al.  Machine-learning-based classification of real-time tissue elastography for hepatic fibrosis in patients with chronic hepatitis B , 2017, Comput. Biol. Medicine.

[19]  Jian Qin,et al.  Computer-assisted diagnosis for chronic heart failure by the analysis of their cardiac reserve and heart sound characteristics , 2015, Comput. Methods Programs Biomed..

[20]  Zhongwei Jiang,et al.  Automatic moment segmentation and peak detection analysis of heart sound pattern via short-time modified Hilbert transform , 2014, Comput. Methods Programs Biomed..

[21]  Zhongwei Jiang,et al.  Segmentation-based heart sound feature extraction combined with classifier models for a VSD diagnosis system , 2014, Expert Syst. Appl..

[22]  Hun-Kuk Park,et al.  Selection of wavelet packet measures for insufficiency murmur identification , 2011, Expert Syst. Appl..

[23]  Juan Manuel Górriz,et al.  Early diagnosis of Alzheimer's disease based on partial least squares, principal component analysis and support vector machine using segmented MRI images , 2015, Neurocomputing.

[24]  S. S. Mehta,et al.  SVM-based algorithm for recognition of QRS complexes in electrocardiogram , 2008 .

[25]  Ashok Ghatol,et al.  Feature selection for medical diagnosis : Evaluation for cardiovascular diseases , 2013, Expert Syst. Appl..

[26]  Ping Wang,et al.  A computer-aided MFCC-based HMM system for automatic auscultation , 2008, Comput. Biol. Medicine.

[27]  Mehmet Fatih Akay,et al.  Support vector machines combined with feature selection for breast cancer diagnosis , 2009, Expert Syst. Appl..

[28]  Curt DeGroff,et al.  A classifier based on the artificial neural network approach for cardiologic auscultation in pediatrics , 2005, Artif. Intell. Medicine.

[29]  Yongsheng Ding,et al.  Multiclass maximum margin clustering via immune evolutionary algorithm for automatic diagnosis of electrocardiogram arrhythmias , 2014, Appl. Math. Comput..

[30]  Samson W. Tu,et al.  An ontology-driven tool for structured data acquisition using Web forms , 2017, J. Biomed. Semant..

[31]  Seyed Saleh Mohseni,et al.  Heart arrhythmias classification via a sequential classifier using neural network, principal component analysis and heart rate variation , 2016, 2016 IEEE 8th International Conference on Intelligent Systems (IS).

[32]  Petr G. Lokhov,et al.  Diagnosis of lung cancer based on direct-infusion electrospray mass spectrometry of blood plasma metabolites , 2012 .

[33]  A. Furuse,et al.  Automated diagnosis of heart disease in patients with heart murmurs: application of a neural network technique , 2006, Journal of medical engineering & technology.

[34]  Jingcheng Du,et al.  Optimization on machine learning based approaches for sentiment analysis on HPV vaccines related tweets , 2017, Journal of Biomedical Semantics.

[35]  Jyrki Rasku,et al.  Machine Learning Approach to Automated Quality Identification of Human Induced Pluripotent Stem Cell Colony Images , 2016, Comput. Math. Methods Medicine.

[36]  Mohamed Esmail Karar,et al.  Automated Diagnosis of Heart Sounds Using Rule-Based Classification Tree , 2017, Journal of Medical Systems.

[37]  Maryam Imani,et al.  Classification of heart sound signal using curve fitting and fractal dimension , 2018, Biomed. Signal Process. Control..

[38]  Rajkumar Palaniappan,et al.  Machine learning in lung sound analysis: a systematic review , 2013 .

[39]  Bei Yu,et al.  Accurate lithography hotspot detection based on principal component analysis-support vector machine classifier with hierarchical data clustering , 2014 .

[40]  U. Rajendra Acharya,et al.  Automated diagnosis of Coronary Artery Disease affected patients using LDA, PCA, ICA and Discrete Wavelet Transform , 2013, Knowl. Based Syst..

[41]  Marimuthu Palaniswami,et al.  Ensemble Empirical Mode Decomposition With Principal Component Analysis: A Novel Approach for Extracting Respiratory Rate and Heart Rate From Photoplethysmographic Signal , 2018, IEEE Journal of Biomedical and Health Informatics.

[42]  Peter Funk,et al.  Clinical decision-support for diagnosing stress-related disorders by applying psychophysiological medical knowledge to an instance-based learning system , 2006, Artif. Intell. Medicine.

[43]  R. Juchems,et al.  [On auscultation of the heart. I]. , 1962, Medizinische Klinik.

[44]  Wolfgang Rottbauer,et al.  Apnea and heart rate detection from tracheal body sounds for the diagnosis of sleep-related breathing disorders , 2018, Medical & Biological Engineering & Computing.