Vocal fold pathology detection using modified wavelet-like features and support vector machines

Acoustic analysis is a perspective vocal pathology diagnostic method that can complement (and in some cases replace) other methods, based on direct vocal fold observation. There are different approaches and algorithms for feature extraction from acoustic speech signal and for making decision on their basis. While the second stage implies a choice of a variety of machine learning methods (SVMs, neural networks, etc), the first stage plays crucial part in performance and accuracy of the classification system, providing much more creativity in development of different feature extraction methods. In this paper we present initial study of feature extraction based on wavelets and pseudo-wavelets in the task of vocal pathology diagnostic. A new type of feature vector, based on continuous wavelet and wavelet-like transform of input audio data is proposed. Support vector machine was used as a classifier for testing the feature extraction procedure. The results of our experimental study are shown.

[1]  John H. L. Hansen,et al.  Vocal fold pathology assessment using AM autocorrelation analysis of the Teager energy operator , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[2]  Moncef Gabbouj,et al.  AUDIO-BASED MULTIMEDIA INDEXING AND RETRIEVAL SCHEME IN MUVIS FRAMEWORK , 2003 .

[3]  L. Gavidia-Ceballos,et al.  A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment , 1998, IEEE Transactions on Biomedical Engineering.

[4]  Marcelo de Oliveira Rosa,et al.  Adaptive estimation of residue signal for voice pathology diagnosis , 2000, IEEE Trans. Biomed. Eng..

[5]  Claudia Manfredi,et al.  Adaptive noise energy estimation in pathological speech signals , 2000, IEEE Transactions on Biomedical Engineering.

[6]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Pedro Gómez Vilda,et al.  Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors , 2004, IEEE Transactions on Biomedical Engineering.

[8]  Adnene Cherif,et al.  Analysis of pathological voices by speech processing , 2003, ISSPA.

[9]  Tao Li,et al.  A comparative study on content-based music genre classification , 2003, SIGIR.

[10]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[11]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[12]  John H. L. Hansen,et al.  A screening test for speech pathology assessment using objective quality measures , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[13]  Miguel Angel Ferrer-Ballester,et al.  Automatic Detection of Pathologies in The Voice by HOS Based Parameters , 2001, EURASIP J. Adv. Signal Process..

[14]  S. Mallat A wavelet tour of signal processing , 1998 .