Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics

In this paper, we propose to quantify the quality of the recorded voice through objective nonlinear measures. Quantification of speech signal quality has been traditionally carried out with linear techniques since the classical model of voice production is a linear approximation. Nevertheless, nonlinear behaviors in the voice production process have been shown. This paper studies the usefulness of six nonlinear chaotic measures based on nonlinear dynamics theory in the discrimination between two levels of voice quality: healthy and pathological. The studied measures are first- and second-order Renyi entropies, the correlation entropy and the correlation dimension. These measures were obtained from the speech signal in the phase-space domain. The values of the first minimum of mutual information function and Shannon entropy were also studied. Two databases were used to assess the usefulness of the measures: a multiquality database composed of four levels of voice quality (healthy voice and three levels of pathological voice); and a commercial database (MEEI Voice Disorders) composed of two levels of voice quality (healthy and pathological voices). A classifier based on standard neural networks was implemented in order to evaluate the measures proposed. Global success rates of 82.47% (multiquality database) and 99.69% (commercial database) were obtained.

[1]  E. Mumolo,et al.  Analysis of normal and pathological voices via short-time fractal dimension , 1992, 1992 14th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[2]  Jack J. Jiang,et al.  Nonlinear dynamic analysis in signal typing of pathological human voices , 2003 .

[3]  Yu Zhang,et al.  Nonlinear dynamic analysis of speech from pathological subjects , 2002 .

[4]  Lingyun Gu,et al.  Disordered Speech Assessment Using Automatic Methods Based on Quantitative Measures , 2005, EURASIP J. Adv. Signal Process..

[5]  Stefan Hadjitodorov,et al.  Robust hybrid pitch detector , 1993 .

[6]  H. M. Teager,et al.  Evidence for Nonlinear Sound Production Mechanisms in the Vocal Tract , 1990 .

[7]  Pedro Gómez Vilda,et al.  Methodological issues in the development of automatic systems for voice pathology detection , 2006, Biomed. Signal Process. Control..

[8]  D. Jamieson,et al.  Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[9]  Holger Kantz,et al.  Practical implementation of nonlinear time series methods: The TISEAN package. , 1998, Chaos.

[10]  H. Kasuya,et al.  Normalized noise energy as an acoustic measure to evaluate pathologic voice. , 1986, The Journal of the Acoustical Society of America.

[11]  Roland Linder,et al.  Artificial neural network-based classification to screen for dysphonia using psychoacoustic scaling of acoustic voice features. , 2008, Journal of voice : official journal of the Voice Foundation.

[12]  Hideki Kasuya,et al.  Novel acoustic measurements of jitter and shimmer characteristics from pathological voice , 1993, EUROSPEECH.

[13]  James Theiler,et al.  Lacunarity in a best estimator of fractal dimension , 1988 .

[14]  John H. L. Hansen,et al.  A screening test for speech pathology assessment using objective quality measures , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[15]  A. Giovanni,et al.  Experimental study of the effects of surface mucus viscosity on the glottic cycle. , 2004, Journal of voice : official journal of the Voice Foundation.

[16]  H. Abarbanel,et al.  Determining embedding dimension for phase-space reconstruction using a geometrical construction. , 1992, Physical review. A, Atomic, molecular, and optical physics.

[17]  J. Jiang,et al.  Modeling of chaotic vibrations in symmetric vocal folds. , 2001, The Journal of the Acoustical Society of America.

[18]  D. Berry,et al.  Interpretation of biomechanical simulations of normal and chaotic vocal fold oscillations with empirical eigenfunctions. , 1994, The Journal of the Acoustical Society of America.

[19]  Dirk Michaelis,et al.  Acoustic "breathiness measures" in the description of pathologic voices , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[20]  Stefan Hadjitodorov,et al.  A computer system for acoustic analysis of pathological voices and laryngeal diseases screening. , 2002, Medical engineering & physics.

[21]  Iasonas Kokkinos,et al.  Some advances in nonlinear speech modeling using modulations, fractals, and chaos , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[22]  L. Gavidia-Ceballos,et al.  A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment , 1998, IEEE Transactions on Biomedical Engineering.

[23]  John H. L. Hansen,et al.  Vocal fold pathology assessment using AM autocorrelation analysis of the Teager energy operator , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[24]  Miguel A. Ferrer,et al.  Using Nonlinear Features for Voice Disorder Detection , 2005 .

[25]  F. Takens Detecting strange attractors in turbulence , 1981 .

[26]  Henry D I Abarbanel,et al.  False neighbors and false strands: a reliable minimum embedding dimension algorithm. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Miguel Angel Ferrer-Ballester,et al.  Automatic Detection of Pathologies in The Voice by HOS Based Parameters , 2001, EURASIP J. Adv. Signal Process..

[28]  Fraser,et al.  Independent coordinates for strange attractors from mutual information. , 1986, Physical review. A, General physics.

[29]  平野 実 Clinical examination of voice , 1981 .

[30]  H. Herzel,et al.  Bifurcations in an asymmetric vocal-fold model. , 1995, The Journal of the Acoustical Society of America.

[31]  D. Jamieson,et al.  Identification of pathological voices using glottal noise measures. , 2000, Journal of speech, language, and hearing research : JSLHR.

[32]  B. Boyanov,et al.  Robust hybrid pitch detector for pathologic voice analysis , 1997 .

[33]  Jacques Koreman,et al.  FINDING CORRELATES OF VOCAL FOLD ADDUCTION DEFICIENCIES , 1997 .

[34]  Bernard Widrow,et al.  Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[35]  B. Boyanov,et al.  Analysis Of Voiced Speech By Means Of Bispectrum , 1990, IEEE South African Symposium on Communications and Signal Processing.

[36]  John H. L. Hansen,et al.  Recent advances in hypernasal speech detection using the nonlinear Teager energy operator , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[37]  J.H.L. Hansen,et al.  A noninvasive technique for detecting hypernasal speech using a nonlinear operator , 1996, IEEE Transactions on Biomedical Engineering.

[38]  A Giovanni,et al.  Objective voice analysis for dysphonic patients: a multiparametric protocol including acoustic and aerodynamic measurements. , 2001, Journal of voice : official journal of the Voice Foundation.

[39]  Steve McLaughlin,et al.  Dynamical modelling of vowel sounds as a synthesis tool , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[40]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Evaluation of neural classifiers using statistic methods for identification of laryngeal pathologies , 1998, Proceedings 5th Brazilian Symposium on Neural Networks (Cat. No.98EX209).

[41]  T. Baer,et al.  Harmonics-to-noise ratio as an index of the degree of hoarseness. , 1982, The Journal of the Acoustical Society of America.

[42]  K. Harris,et al.  Laryngeal function in phonation and respiration , 1987 .

[43]  B Boyanov,et al.  Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases. , 1997, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[44]  J. A. Stewart,et al.  Nonlinear Time Series Analysis , 2015 .