Automatic Detection System for Cough Sounds as a Symptom of Abnormal Health Condition

The problem of attending to the health of the aged who live alone has became an important issue in developed countries. One way of solving the problem is to check their health condition by a remote-monitoring technique and support them with well-timed treatment. The purpose of this study is to develop an automatic system that can monitor a health condition in real time using acoustical information and detect an abnormal symptom. In this study, cough sound was chosen as a representative acoustical symptom of abnormal health conditions. For the development of the system distinguishing a cough sound from other environmental sounds, a hybrid model was proposed that consists of an artificial neural network (ANN) model and a hidden Markov model (HMM). The ANN model used energy cepstral coefficients obtained by filter banks based on human auditory characteristics as input parameters representing a spectral feature of a sound signal. Subsequently, an output of this ANN model and a filtered envelope of the signal were used for making an input sequence for the HMM that deals with the temporal variation of the sound signal. Compared with the conventional HMM using Mel-frequency cepstral coefficients, the proposed hybrid model improved recognition rates on low SNR from 5 dB down to -10 dB. Finally, a preliminary prototype of the automatic detection system was simply illustrated.

[1]  LinLin Shen,et al.  Combining Wavelets with HMM for Face Recognition , 2003, SGAI Conf..

[2]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[3]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[4]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[5]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[6]  Richard Lippmann,et al.  Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[7]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[8]  Michel Vacher,et al.  Information extraction from sound for medical telemonitoring , 2006, IEEE Transactions on Information Technology in Biomedicine.

[9]  Lawrence R. Rabiner,et al.  A tutorial on Hidden Markov Models , 1986 .

[10]  N. Cox Statistical Models in Engineering , 1970 .

[11]  E. Owens,et al.  An Introduction to the Psychology of Hearing , 1997 .

[12]  Stephen T. Neely,et al.  Signals, Sound, and Sensation , 1997 .

[13]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[14]  B. Moore An introduction to the psychology of hearing, 3rd ed. , 1989 .

[15]  William M. Hartmann,et al.  Psychoacoustics: Facts and Models , 2001 .

[16]  Paul R. Cohen,et al.  Segmenting time series with a hybrid neural networks - hidden Markov model , 2002, AAAI/IAAI.

[17]  Haikady N. Nagaraja,et al.  Inference in Hidden Markov Models , 2006, Technometrics.

[18]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[19]  Joydeep Ghosh,et al.  A neural network based hybrid system for detection, characterization, and classification of short-duration oceanic signals , 1992 .

[20]  A. Michel,et al.  Analysis and synthesis of a class of neural networks: linear systems operating on a closed hypercube , 1989 .

[21]  Joseph Picone,et al.  Signal modeling techniques in speech recognition , 1993, Proc. IEEE.

[22]  Jacek M. Zurada,et al.  Introduction to artificial neural systems , 1992 .

[23]  W. Cholewa,et al.  Fault Diagnosis: Models, Artificial Intelligence, Applications , 2004 .

[24]  Mark D Skowronski,et al.  Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition. , 2004, The Journal of the Acoustical Society of America.

[25]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..