Respiratory sounds classification using cepstral analysis and Gaussian mixture models

The cepstral analysis is proposed with Gaussian mixture models (GMM) method to classify respiratory sounds in two categories: normal and wheezing. The sound signal is divided in overlapped segments, which are characterized by a reduced dimension feature vectors using Mel-frequency cepstral coefficients (MFCC) or subband based cepstral parameters (SBC). The proposed schema is compared with other classifiers: vector quantization (VQ) and multi-layer perceptron (MLP) neural networks. A post processing is proposed to improve the classification results.

[1]  Tamer Ölmez,et al.  Classification of Respiratory Sounds by Using an Artificial Neural Network , 2003, Int. J. Pattern Recognit. Artif. Intell..

[2]  Bülent Sankur,et al.  Two-stage classi cation of respiratory sound patterns , 2003 .

[3]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[4]  M. Bahoura,et al.  New parameters for respiratory sound classification , 2003, CCECE 2003 - Canadian Conference on Electrical and Computer Engineering. Toward a Caring and Humane Technology (Cat. No.03CH37436).

[5]  E. H. Dooijes,et al.  Classification of Asthmatic Breath Sounds: Preliminary Results of the Classifying Capacity of Human Examiners versus Artificial Neural Networks , 1999, Comput. Biomed. Res..

[6]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[7]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[8]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[9]  T. Olmez,et al.  Classification of respiratory sounds by using an artificial neural network , 2001, 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[10]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  N. Malmurugan,et al.  Neural classification of lung sounds using wavelet coefficients , 2004, Comput. Biol. Medicine.