Discrete wavelet transform and support vector machine applied to pathological voice signals identification

An algorithm able to classify pathological and normal voice signals based on Daubechies discrete wavelet transform (DWT-db) and support vector machines (SVM) classifier is presented. DWT-db is used for time-frequency analysis giving quantitative evaluation of signal characteristics to identify pathologies in voice signals, particularly nodules in vocal folds, of subjects with different ages for both male and female. After using a linear prediction coefficients (LPC) filter, the signals mean square values of a particular scale from wavelet analysis are entries to a nonlinear least square support vector machine (LS-SVM) classifier, which leads to an adequate larynx pathology classifier which over 95% of classification accuracy.

[1]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[2]  R. Guido,et al.  Trying different wavelets on the search for voice disorders sorting , 2005, Proceedings of the Thirty-Seventh Southeastern Symposium on System Theory, 2005. SSST '05..

[3]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[4]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[5]  Truong Q. Nguyen,et al.  Wavelets and filter banks , 1996 .

[6]  N. Isshiki,et al.  Differential diagnosis of hoarseness. , 1969, Folia phoniatrica.

[7]  Y. Koike Vowel amplitude modulations in patients with laryngeal diseases. , 1969, The Journal of the Acoustical Society of America.

[8]  John R. Williams,et al.  Introduction to wavelets in engineering , 1994 .

[9]  Marcelo de Oliveira Rosa,et al.  Adaptive estimation of residue signal for voice pathology diagnosis , 2000, IEEE Trans. Biomed. Eng..

[10]  D. Bless Measurement of vocal function. , 1991, Otolaryngologic clinics of North America.

[11]  Stefan Todorov Hadjitodorov,et al.  Laryngeal pathology detection by means of class-specific neural maps , 2000, IEEE Transactions on Information Technology in Biomedicine.

[12]  S. Mallat A wavelet tour of signal processing , 1998 .

[13]  Jian Liu,et al.  A new efficient SVM-based edge detection method , 2004, Pattern Recognit. Lett..

[14]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[15]  S. Iwata,et al.  Periodicities of pitch perturbations in normal and pathologic larynges , 1972, The Laryngoscope.

[16]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[17]  Jean Schoentgen,et al.  Time series analysis of jitter , 1995 .

[18]  Christopher J. C. Burges,et al.  Geometry and invariance in kernel based methods , 1999 .

[19]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[20]  Li Zhang,et al.  Wavelet support vector machine , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[21]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .