Characterization of $S_1$ and $S_2$ Heart Sounds Using Stacked Autoencoder and Convolutional Neural Network

This paper proposes a new technique for the identification of fundamental heart sounds (HSs), namely, <inline-formula> <tex-math notation="LaTeX">$S_{1}$ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$S_{2}$ </tex-math></inline-formula> from the cardiac cycle, the first and foremost step in the automated HSs analysis for the detection of pathological events, without incorporating time-interval informations between <inline-formula> <tex-math notation="LaTeX">$S_{1}$ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$S_{2}$ </tex-math></inline-formula> or electrocardiogram signal as reference. The motive of this paper is to demonstrate that the reliable <inline-formula> <tex-math notation="LaTeX">$S_{1}$ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$S_{2}$ </tex-math></inline-formula> classification performances based on the combinatory feature (CF) derived from higher order moments and cepstral-based domain can still be achieved, under the circumstances where the timing interval information might not be easily understood due to cardiac abnormalities. Using deep neural networks approach, a stacked autoencoder (SAE) based on the CF is proposed for the classification of fundamental HSs. Experiments are conducted on both publicly available and recorded HSs signals for the validation of the proposed method. The SAE using the proposed CF achieves better classification results in recognizing <inline-formula> <tex-math notation="LaTeX">$S_{1}$ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$S_{2}$ </tex-math></inline-formula> in comparison to well-known classifiers such as deep belief neural network, support vector machine, Naive Bayes, linear discriminant analysis, and boosting ensemble. The proposed method shows higher classification rate in terms of accuracy, sensitivity, and specificity by considering CF, which uses Mel-frequency cepstral coefficients and its derivative features. A second approach for addressing the problem of <inline-formula> <tex-math notation="LaTeX">$S_{1}$ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$S_{2}$ </tex-math></inline-formula> identifications is carried out by employing 1-D convolutional neural network that uses the signals directly to learn the relevant features by its own for the recognition.

[1]  F. Fleuret Fast Binary Feature Selection with Conditional Mutual Information , 2004, J. Mach. Learn. Res..

[2]  Jin Jiang,et al.  Heart sound analysis using the S transform , 2000, Computers in Cardiology 2000. Vol.27 (Cat. 00CH37163).

[3]  I. Hartimo,et al.  Heart sound segmentation algorithm based on heart sound envelogram , 1997, Computers in Cardiology 1997.

[4]  Bin Li,et al.  Detection of Double Compressed AMR Audio Using Stacked Autoencoder , 2017, IEEE Transactions on Information Forensics and Security.

[5]  Leontios J. Hadjileontiadis,et al.  Efficient Heart Sound Segmentation and Extraction Using Ensemble Empirical Mode Decomposition and Kurtosis Features , 2014, IEEE Journal of Biomedical and Health Informatics.

[6]  L. Senhadji,et al.  Analysis-synthesis of the phonocardiogram based on the matching pursuit method , 1998, IEEE Transactions on Biomedical Engineering.

[7]  Paul S. Wright,et al.  Short-time Fourier transforms and Wigner-Ville distributions applied to the calibration of power frequency harmonic analyzers , 1999, IEEE Trans. Instrum. Meas..

[8]  J J Struijk,et al.  Segmentation of heart sound recordings by a duration-dependent hidden Markov model , 2010, Physiological measurement.

[9]  Lionel Tarassenko,et al.  Logistic Regression-HSMM-Based Heart Sound Segmentation , 2016, IEEE Transactions on Biomedical Engineering.

[10]  I. Kamarulafizam,et al.  Heart sound segmentation algorithm based on instantaneous energy of electrocardiogram , 2003, Computers in Cardiology, 2003.

[11]  Rabab Kreidieh Ward,et al.  Semi-supervised Stacked Label Consistent Autoencoder for Reconstruction and Analysis of Biomedical Signals , 2017, IEEE Transactions on Biomedical Engineering.

[12]  Anirban Mukherjee,et al.  Segmentation and detection of first and second heart sounds (Si and S2) using variational mode decomposition , 2016, 2016 IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES).

[13]  Anirban Mukherjee,et al.  Grading heart sounds through variational mode decomposition and higher order spectral features , 2018, 2018 IEEE International Instrumentation and Measurement Technology Conference (I2MTC).

[14]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Jozef Juhar,et al.  Comparison of feature selection algorithms for acoustic event detection system , 2014, Proceedings ELMAR-2014.

[16]  Jane Labadin,et al.  Feature selection based on mutual information , 2015, 2015 9th International Conference on IT in Asia (CITA).

[17]  Yu Tsao,et al.  S1 and S2 Heart Sound Recognition Using Deep Neural Networks , 2017, IEEE Transactions on Biomedical Engineering.

[18]  A. Trotta,et al.  Application of Wigner-Ville distribution to measurements on transient signals , 1993, 1993 IEEE Instrumentation and Measurement Technology Conference.

[19]  Noureddine Zerhouni,et al.  Bearing Health Monitoring Based on Hilbert–Huang Transform, Support Vector Machine, and Regression , 2015, IEEE Transactions on Instrumentation and Measurement.

[20]  Yong Wang,et al.  Conditional Mutual Information‐Based Feature Selection Analyzing for Synergy and Redundancy , 2011, ETRI Journal.

[21]  Anirban Mukherjee,et al.  Detection of Third Heart Sound Using Variational Mode Decomposition , 2018, IEEE Transactions on Instrumentation and Measurement.

[22]  Gregorio Andria,et al.  Interpolated smoothed pseudo Wigner-Ville distribution for accurate spectrum analysis , 1996 .

[23]  Colas Schretter,et al.  Information-Theoretic Feature Selection in Microarray Data Using Variable Complementarity , 2008, IEEE Journal of Selected Topics in Signal Processing.

[24]  Aleks Jakulin Machine Learning Based on Attribute Interactions , 2005 .

[25]  Luca Fanucci,et al.  Sensing Devices and Sensor Signal Processing for Remote Monitoring of Vital Signs in CHF Patients , 2013, IEEE Transactions on Instrumentation and Measurement.

[26]  Dahua Lin,et al.  Conditional Infomax Learning: An Integrated Framework for Feature Extraction and Fusion , 2006, ECCV.

[27]  Gianluca Bontempi,et al.  On the Use of Variable Complementarity for Feature Selection in Cancer Classification , 2006, EvoWorkshops.

[28]  Roberto Battiti,et al.  Using mutual information for selecting features in supervised neural net learning , 1994, IEEE Trans. Neural Networks.

[29]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[30]  Feng Jiang,et al.  Measurement of Duration, Energy of Instantaneous Frequencies, and Splits of Subcomponents of the Second Heart Sound , 2015, IEEE Transactions on Instrumentation and Measurement.

[31]  L. Sakari,et al.  A heart sound segmentation algorithm using wavelet decomposition and reconstruction , 1997, Proceedings of the 19th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 'Magnificent Milestones and Emerging Opportunities in Medical Engineering' (Cat. No.97CH36136).

[32]  Carlos A. F. da Rocha,et al.  Equalization of Crackle Sounds to Compensate Thorax Attenuation , 2014, IEEE Transactions on Instrumentation and Measurement.

[33]  Gavin Brown,et al.  Conditional Likelihood Maximisation: A Unifying Framework for Information Theoretic Feature Selection , 2012, J. Mach. Learn. Res..