Wavelet transform and artificial neural networks applied to voice disorders identification

The amount of non-invasive methods of diagnosis has increased due to the need for simple, quick and painless tests. Due to the growth of technology that provides the means for extraction and signal processing, new analytical methods have been developed to understand the complexity of the voice signals. This paper presents a new idea to characterize signals of healthy and pathological voice based on two mathematical tools widely known in the literature, Wavelet Transform (WT) and Artificial Neural Networks. Four classes of samples were used: one from healthy individuals and three from people with vocal fold nodules, Reinke's edema and neurological dysphonia. All the samples were recorded using the vowel /a/ in Brazilian Portuguese. The work shows that the proposed approach using WT is a suitable technique to discriminate between healthy and pathological voices.

[1]  Miguel A. Ferrer,et al.  Using Nonlinear Features for Voice Disorder Detection , 2005 .

[2]  Joseph S. Perkell,et al.  Phonatory function associated with hyperfunctionally related vocal fold lesions , 1990 .

[3]  O. Rioul,et al.  Wavelets and signal processing , 1991, IEEE Signal Processing Magazine.

[4]  Virgilijus Uloza,et al.  Perceptual and acoustic assessment of voice pathology and the efficacy of endolaryngeal phonomicrosurgery. , 2005, Journal of voice : official journal of the Voice Foundation.

[5]  Jason Goodwin,et al.  Voice disorders. , 1991, Otolaryngologic clinics of North America.

[6]  F. Almasganj,et al.  Pathological voice classifcation using local discriminant basis and genetic algorithm , 2008, 2008 16th Mediterranean Conference on Control and Automation.

[7]  J.C. Pereira,et al.  Normal versus pathological voice signals , 2009, IEEE Engineering in Medicine and Biology Magazine.

[8]  Omar Farooq,et al.  Phoneme recognition using wavelet based features , 2003, Inf. Sci..

[9]  Paulo Pontes,et al.  Diferenciação histopatológica e imunoistoquímica das alterações epiteliais no nódulo vocal em relação aos pólipos e ao edema de laringe , 2004 .

[10]  Marcelo de Oliveira Rosa,et al.  Adaptive estimation of residue signal for voice pathology diagnosis , 2000, IEEE Trans. Biomed. Eng..

[11]  O. Kleinsasser [MICROSURGERY OF THE LARYNX]. , 1964, Archiv fur Ohren-, Nasen- und Kehlkopfheilkunde.

[12]  Thais Russomano,et al.  Digital signal processing in the differential diagnosis of benign larynx diseases [Abstract in English] , 2006 .

[13]  Minoru Hirano,et al.  Vocal mechanisms in singing: Laryngological and phoniatric aspects , 1988 .

[14]  Carlos Dias Maciel,et al.  Relative entropy measures applied to healthy and pathological voice characterization , 2009, Appl. Math. Comput..

[15]  Carlos Dias Maciel,et al.  Autoregressive decomposition and pole tracking applied to vocal fold nodule signals , 2007, Pattern Recognit. Lett..

[16]  R. Martins,et al.  Vocal fold nodules: morphological and immunohistochemical investigations. , 2010, Journal of voice : official journal of the Voice Foundation.