Analysis and Classification of Voice Pathologies Using Glottal Signal Parameters.

The classification of voice diseases has many applications in health, in diseases treatment, and in the design of new medical equipment for helping doctors in diagnosing pathologies related to the voice. This work uses the parameters of the glottal signal to help the identification of two types of voice disorders related to the pathologies of the vocal folds: nodule and unilateral paralysis. The parameters of the glottal signal are obtained through a known inverse filtering method, and they are used as inputs to an Artificial Neural Network, a Support Vector Machine, and also to a Hidden Markov Model, to obtain the classification, and to compare the results, of the voice signals into three different groups: speakers with nodule in the vocal folds; speakers with unilateral paralysis of the vocal folds; and speakers with normal voices, that is, without nodule or unilateral paralysis present in the vocal folds. The database is composed of 248 voice recordings (signals of vowels production) containing samples corresponding to the three groups mentioned. In this study, a larger database was used for the classification when compared with similar studies, and its classification rate is superior to other studies, reaching 97.2%.

[1]  Leonardo A Forero Mendoza,et al.  Classification of vocal aging using parameters extracted from the glottal signal. , 2014, Journal of voice : official journal of the Voice Foundation.

[2]  P. Alku,et al.  Normalized amplitude quotient for parametrization of the glottal flow. , 2002, The Journal of the Acoustical Society of America.

[3]  M. Airas METHODS AND STUDIES OF LARYNGEAL VOICE QUALITY ANALYSIS IN SPEECH PRODUCTION , 2008 .

[4]  B. Doval,et al.  Glottal open quotient in singing: measurements and correlation with laryngeal mechanisms, vocal intensity, and fundamental frequency. , 2005, The Journal of the Acoustical Society of America.

[5]  C Gaelyn Garrett,et al.  Assessment of patient experience with unilateral vocal fold immobility: a preliminary study. , 2014, Journal of voice : official journal of the Voice Foundation.

[6]  P. Alku,et al.  Physical variations related to stress and emotional state: A preliminary study. , 1996 .

[7]  Nathalie Henrich-Bernardoni Etude de la source glottique en voix parlee et chantee : modelisation et estimation, mesures acoustiques et electroglottographiques, perception , 2001 .

[8]  María Victoria Rodellar Biarge,et al.  Glottal Source biometrical signature for voice pathology detection , 2009, Speech Commun..

[9]  C. Gobl,et al.  Amplitude-Based Source Parameters for Measur ing Voice Quality , 2003 .

[10]  Minsoo Hahn,et al.  Classification of Pathological and Normal Voice Based on Linear Discriminant Analysis , 2007, ICANNGA.

[11]  N. Biasi,et al.  Vestibular vocal fold behavior during phonation in unilateral vocal fold paralysis. , 1999, Journal of voice : official journal of the Voice Foundation.

[12]  Germán Castellanos-Domínguez,et al.  An improved method for voice pathology detection by means of a HMM-based feature space transformation , 2010, Pattern Recognit..

[13]  M. Hariharan,et al.  Identification of vocal fold pathology based on Mel Frequency Band Energy Coefficients and singular value decomposition , 2009, 2009 IEEE International Conference on Signal and Image Processing Applications.

[14]  Sean Redmond,et al.  Behavioral characteristics of children with vocal fold nodules. , 2007, Journal of voice : official journal of the Voice Foundation.

[15]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[16]  K. Thangavel,et al.  Feature Selection for Visual Clustering , 2009, 2009 International Conference on Advances in Recent Technologies in Communication and Computing.

[17]  I R Titze,et al.  Vocal intensity in speakers and singers. , 1991, The Journal of the Acoustical Society of America.

[18]  Hannu Pulakka Analysis of human voice production using inverse filtering, high-speed imaging, and electroglottography , 2005 .

[19]  Ivani Rosa dos Santos Análise acústica da voz de indivíduos na terceira idade , 2005 .

[20]  Marcelo de Oliveira Rosa,et al.  Adaptive estimation of residue signal for voice pathology diagnosis , 2000, IEEE Trans. Biomed. Eng..

[21]  A. Gillespie,et al.  The influence of clinical terminology on self-efficacy for voice , 2011, Logopedics, phoniatrics, vocology.

[22]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.

[23]  M. Vieira Automated measures of dysphonias and the phonatory effects of asymmetries in the posterior larynx , 1997 .

[24]  Paavo Alku,et al.  Amplitude domain quotient for characterization of the glottal volume velocity waveform estimated by inverse filtering , 1996, Speech Commun..

[25]  Nedio Steffen,et al.  Modifications of vestibular fold shape from respiration to phonation in unilateral vocal fold paralysis. , 2011, Journal of voice : official journal of the Voice Foundation.