Audio-cough event detection based on moment theory

Abstract Cough detection has recently been identified as of paramount importance to fully exploit the potential of telemedicine in respiratory conditions and release some of the economic burden of respiratory care in national health systems. Current audio-based cough detection systems are either uncomfortable or not suitable for continuous patient monitoring, since the audio processing methods implemented therein fail to cope with noisy environments such as those where the acquiring device is carried in the pocket (e.g. smartphone). Moment theory has been widely applied in a number of complex problems involving image processing, computer vision, and pattern recognition. Their invariance properties and noise robustness make them especially suitable as “signature” features enabling character recognition or texture analysis. A natural extension of moment theory to one-dimensional signals is the identification of meaningful patterns in audio signals. However, to the best of our knowledge only marginal attempts have been made in this direction. This paper applies moment theory to perform cough detection in noisy audio signals. Our proposal adopts the first steps used to extract Mel frequency cepstral coefficients (time-frequency decomposition and application of a filter bank defined in the Mel scale) while the innovation is introduced in the second step, where energy patterns defined for specific temporal frames and frequency bands are characterised using moment theory. Our results show the feasibility of using moment theory to solve such a problem in a variety of noise conditions, with sensitivity and specificity values around 90%, significantly outperforming popular state-of-the-art feature sets.

[1]  Chong Chen,et al.  Cough detection by ensembling multiple frequency subband features , 2017, Biomed. Signal Process. Control..

[2]  Guihua Wen,et al.  Weighted spectral features based on local Hu moments for speech emotion recognition , 2015, Biomed. Signal Process. Control..

[3]  Paul Jones,et al.  COPD uncovered: an international survey on the impact of chronic obstructive pulmonary disease [COPD] on a working age population , 2011, BMC public health.

[4]  Nicolai Petkov,et al.  Reliable detection of audio events in highly noisy environments , 2015, Pattern Recognit. Lett..

[5]  R. P. Ramachandran,et al.  Robust speaker recognition: a feature-based approach , 1996, IEEE Signal Processing Magazine.

[6]  Thomas Drugman,et al.  Using mutual information in supervised temporal event detection: Application to cough detection , 2014, Biomed. Signal Process. Control..

[7]  Limin Luo,et al.  Moment-Based Approaches in Imaging. 1. Basic Features [A Look At ...] , 2007, IEEE Engineering in Medicine and Biology Magazine.

[8]  Khalid M. Hosny Refined translation and scale Legendre moment invariants , 2010, Pattern Recognit. Lett..

[9]  Gaël Richard,et al.  Musical instrument recognition by pairwise classification strategies , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Theodoros Giannakopoulos,et al.  Chapter 6 – Audio Segmentation , 2014 .

[11]  S. Birring,et al.  ERS guidelines on the assessment of cough , 2007, European Respiratory Journal.

[12]  J. Flusser,et al.  Moments and Moment Invariants in Pattern Recognition , 2009 .

[13]  Francesc Alías,et al.  Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification , 2012, IEEE Transactions on Multimedia.

[14]  Huihui Wang,et al.  Novel feature extraction method for cough detection using NMF , 2017, IET Signal Process..

[15]  Luis Alejandro Sánchez-Pérez,et al.  Aircraft take-off noises classification based on human auditory’s matched features extraction , 2014 .

[16]  David H. Evans,et al.  Detection of cough signals in continuous audio recordings using hidden Markov models , 2006, IEEE Transactions on Biomedical Engineering.

[17]  Sim Heng Ong,et al.  Image Analysis by Tchebichef Moments , 2001, IEEE Trans. Image Process..

[18]  Huazhong Shu,et al.  Image analysis by discrete orthogonal dual Hahn moments , 2007, Pattern Recognit. Lett..

[19]  Tom J. Moir,et al.  An overview of applications and advancements in automatic sound recognition , 2016, Neurocomputing.

[20]  Raveendran Paramesran,et al.  Image analysis by Krawtchouk moments , 2003, IEEE Trans. Image Process..

[21]  Kofi Odame,et al.  Deep Neural Networks for Identifying Cough Sounds , 2016, IEEE Transactions on Biomedical Circuits and Systems.

[22]  Richard M. Stern,et al.  Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[23]  Haizhou Li,et al.  Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions , 2011, IEEE Signal Processing Letters.

[24]  Tomasz P. Zielinski,et al.  Joint Application of Audio Spectral Envelope and Tonality Index in an E-Asthma Monitoring System , 2015, IEEE Journal of Biomedical and Health Informatics.

[25]  J. Widdicombe,et al.  What is cough and what should be measured? , 2007, Pulmonary pharmacology & therapeutics.

[26]  Changsheng Xu,et al.  The Era of Interactive Media , 2013, Springer New York.

[27]  Theodoros Giannakopoulos,et al.  Music Information Retrieval , 2014 .

[28]  M. Teague Image analysis via the general theory of moments , 1980 .

[29]  Jaakko Astola,et al.  Audio based solutions for detecting intruders in wild areas , 2012, Signal Process..

[30]  Theodoros Giannakopoulos,et al.  Appendix C – Audio Datasets , 2014 .

[31]  Dimitris E. Koulouriotis,et al.  A unified methodology for the efficient computation of discrete orthogonal image moments , 2009, Inf. Sci..

[32]  Goutam Saha,et al.  Lung sound classification using cepstral-based statistical features , 2016, Comput. Biol. Medicine.

[33]  Kuldip K. Paliwal,et al.  Robust speech recognition in noisy environments based on subband spectral centroid histograms , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[34]  Mita Nasipuri,et al.  A Study of Moment Based Features on Handwritten Digit Recognition , 2016, Appl. Comput. Intell. Soft Comput..

[35]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[36]  Nasser Kehtarnavaz,et al.  Real-Time Unsupervised Classification of Environmental Noise Signals , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[37]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[38]  Ruzena Bajcsy,et al.  Real-Time Tele-Monitoring of Patients with Chronic Heart-Failure Using a Smartphone: Lessons Learned , 2016, IEEE Transactions on Affective Computing.

[39]  P. Dhanalakshmi,et al.  Pattern classification models for classifying and indexing audio signals , 2011, Eng. Appl. Artif. Intell..

[40]  Samantha J Barry,et al.  The automatic recognition and counting of cough , 2006, Cough.

[41]  Raimon Jané,et al.  Correntropy-Based Spectral Characterization of Respiratory Patterns in Patients With Chronic Heart Failure , 2010, IEEE Transactions on Biomedical Engineering.

[42]  Theodoros Giannakopoulos,et al.  Chapter 5 – Audio Classification , 2014 .

[43]  Riccardo Leonardi,et al.  Analysis, Retrieval and Delivery of Multimedia Content , 2012 .

[44]  Marko Robnik-Sikonja,et al.  Overcoming the Myopia of Inductive Learning Algorithms with RELIEFF , 2004, Applied Intelligence.

[45]  Jamshid Shanbehzadeh,et al.  Fast Zernike wavelet moments for Farsi character recognition , 2007, Image Vis. Comput..

[46]  Miroslaw Pawlak,et al.  On Image Analysis by Moments , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Tom J. Moir,et al.  Noise robust audio surveillance using reduced spectrogram image feature and one-against-all SVM , 2015, Neurocomputing.

[48]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[49]  Steven Kay,et al.  Modern Spectral Estimation: Theory and Application , 1988 .

[50]  D. B. Keenan,et al.  Evaluation of an ambulatory system for the quantification of cough frequency in patients with chronic obstructive pulmonary disease , 2005, Cough.