Detection of different voice diseases based on the nonlinear characterization of speech signals

A novel methodology to characterize voice diseases using nonlinear dynamics.Use of complexity measures based on the analysis of the time delay embedded space.Transformation of the feature space using a Discrete Hidden Markov Model.The methodology validated on three different datasets with different voice diseases. This work describes a novel methodology to characterize voice diseases by using nonlinear dynamics, considering different complexity measures that are mainly based on the analysis of the time delay embedded space. The feature space is represented with a DHMM and a further transformation of the DHMM states to a hyperdimensional space is performed. The discrimination between healthy and pathological speech signals is peformed by using a RBF-SVM which is trained following a K-fold cross-validation strategy. Results of around 99% of accuracy are obtained for three different voice disorders, disphonia due to laryngeal pathologies, hypernasality due to cleft lip and palate, and dysarthria due to Parkinson's disease.

[1]  J.H.L. Hansen,et al.  A noninvasive technique for detecting hypernasal speech using a nonlinear operator , 1996, IEEE Transactions on Biomedical Engineering.

[2]  Pedro Gómez Vilda,et al.  Automatic detection of voice impairments from text-dependent running speech , 2009, Biomed. Signal Process. Control..

[3]  A Giovanni,et al.  Objective voice analysis for dysphonic patients: a multiparametric protocol including acoustic and aerodynamic measurements. , 2001, Journal of voice : official journal of the Voice Foundation.

[4]  Fraser,et al.  Independent coordinates for strange attractors from mutual information. , 1986, Physical review. A, General physics.

[5]  Jesús Francisco Vargas-Bonilla,et al.  Analysis of Speech from People with Parkinson's Disease through Nonlinear Dynamics , 2013, NOLISP.

[6]  Jesús Francisco Vargas-Bonilla,et al.  Characterization Methods for the Detection of Multiple Voice Disorders: Neurological, Functional, and Laryngeal Diseases , 2015, IEEE Journal of Biomedical and Health Informatics.

[7]  H. Abarbanel,et al.  Determining embedding dimension for phase-space reconstruction using a geometrical construction. , 1992, Physical review. A, Atomic, molecular, and optical physics.

[8]  H. Kantz,et al.  Nonlinear time series analysis , 1997 .

[9]  H. Herzel,et al.  Bifurcations in an asymmetric vocal-fold model. , 1995, The Journal of the Acoustical Society of America.

[10]  Schuster,et al.  Easily calculable measure for the complexity of spatiotemporal patterns. , 1987, Physical review. A, General physics.

[11]  Miguel Angel Ferrer-Ballester,et al.  Automatic Detection of Pathologies in The Voice by HOS Based Parameters , 2001, EURASIP J. Adv. Signal Process..

[12]  Pedro Gómez Vilda,et al.  Methodological issues in the development of automatic systems for voice pathology detection , 2006, Biomed. Signal Process. Control..

[13]  Lu Wang,et al.  Gaussian kernel approximate entropy algorithm for analyzing irregularity of time-series , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[14]  M. Rosenstein,et al.  A practical method for calculating largest Lyapunov exponents from small data sets , 1993 .

[15]  Lingyun Gu,et al.  Disordered Speech Assessment Using Automatic Methods Based on Quantitative Measures , 2005, EURASIP J. Adv. Signal Process..

[16]  D. Jamieson,et al.  Identification of pathological voices using glottal noise measures. , 2000, Journal of speech, language, and hearing research : JSLHR.

[17]  Max A. Little,et al.  Accurate Telemonitoring of Parkinson's Disease Progression by Noninvasive Speech Tests , 2009, IEEE Transactions on Biomedical Engineering.

[18]  Stefan Hadjitodorov,et al.  A computer system for acoustic analysis of pathological voices and laryngeal diseases screening. , 2002, Medical engineering & physics.

[19]  B Boyanov,et al.  Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases. , 1997, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[20]  E. H. Lloyd,et al.  Long-Term Storage: An Experimental Study. , 1966 .

[21]  H. Kasuya,et al.  Normalized noise energy as an acoustic measure to evaluate pathologic voice. , 1986, The Journal of the Acoustical Society of America.

[22]  H. M. Teager,et al.  Evidence for Nonlinear Sound Production Mechanisms in the Vocal Tract , 1990 .

[23]  Jack J. Jiang,et al.  Using Rate of Divergence as an Objective Measure to Differentiate between Voice Signal Types Based on the Amount of Disorder in the Signal. , 2017, Journal of voice : official journal of the Voice Foundation.

[24]  B. Boyanov,et al.  Robust hybrid pitch detector for pathologic voice analysis , 1997 .

[25]  Farshad Almasganj,et al.  Pathological assessment of patients' speech signals using nonlinear dynamical analysis , 2010, Comput. Biol. Medicine.

[26]  Hideki Kasuya,et al.  Novel acoustic measurements of jitter and shimmer characteristics from pathological voice , 1993, EUROSPEECH.

[27]  Miguel Angel Ferrer-Ballester,et al.  Characterization of Healthy and Pathological Voice Through Measures Based on Nonlinear Dynamics , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[28]  Ping Yu,et al.  Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale , 2016, J. Signal Process. Syst..

[29]  Max A. Little,et al.  Exploiting Nonlinear Recurrence and Fractal Scaling Properties for Voice Disorder Detection , 2007 .

[30]  Jesús Francisco Vargas-Bonilla,et al.  Voice pathology detection in continuous speech using nonlinear dynamics , 2012, 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA).

[31]  M. Ramasubba Reddy,et al.  The analysis on band-limited hypernasal speech using group delay based formant extraction technique , 2005, INTERSPEECH.

[32]  Jack J Jiang,et al.  Optimized Nonlinear Dynamic Analysis of Pathologic Voices With Laryngeal Paralysis Based on the Minimum Embedding Dimension. , 2017, Journal of voice : official journal of the Voice Foundation.

[33]  J. Jiang,et al.  Modeling of chaotic vibrations in symmetric vocal folds. , 2001, The Journal of the Acoustical Society of America.

[34]  John H. L. Hansen,et al.  Recent advances in hypernasal speech detection using the nonlinear Teager energy operator , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[35]  Jack J Jiang,et al.  Chaos in voice, from modeling to measurement. , 2006, Journal of voice : official journal of the Voice Foundation.

[36]  Henry D. I. Abarbanel,et al.  Analysis of Observed Chaotic Data , 1995 .

[37]  Jesús Francisco Vargas-Bonilla,et al.  Spectral and cepstral analyses for Parkinson's disease detection in Spanish vowels and words , 2015, Expert Syst. J. Knowl. Eng..

[38]  Dirk Michaelis,et al.  Acoustic "breathiness measures" in the description of pathologic voices , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[39]  Erhan Demirhan,et al.  Acoustic Voice Analysis of Young Turkish Speakers. , 2016, Journal of voice : official journal of the Voice Foundation.

[40]  Germán Castellanos-Domínguez,et al.  An improved method for voice pathology detection by means of a HMM-based feature space transformation , 2010, Pattern Recognit..

[41]  Madalena Costa,et al.  Multiscale entropy analysis of biological signals. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[42]  Meng Joo Er,et al.  An effective semi-cross-validation model selection method for extreme learning machine with ridge regression , 2015, Neurocomputing.

[43]  C. Newman,et al.  The Voice Handicap Index (VHI)Development and Validation , 1997 .

[44]  Antanas Verikas,et al.  Fusing voice and query data for non-invasive detection of laryngeal disorders , 2015, Expert Syst. Appl..

[45]  Antanas Verikas,et al.  Combined Use of Standard and Throat Microphones for Measurement of Acoustic Voice Parameters and Voice Categorization. , 2015, Journal of voice : official journal of the Voice Foundation.

[46]  Jack J. Jiang,et al.  Nonlinear dynamic analysis in signal typing of pathological human voices , 2003 .

[47]  John H. L. Hansen,et al.  Detection of hypernasal speech using a nonlinear operator , 1994, Proceedings of 16th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[48]  Johan A. K. Suykens,et al.  Fixed-Size Least Squares Support Vector Machines: Scala Implementation for Large Scale Classification , 2015, 2015 IEEE Symposium Series on Computational Intelligence.

[49]  John H. L. Hansen,et al.  A screening test for speech pathology assessment using objective quality measures , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[50]  D. Berry,et al.  Analysis of vocal disorders with methods from nonlinear dynamics. , 1994, Journal of speech and hearing research.

[51]  T. Baer,et al.  Harmonics-to-noise ratio as an index of the degree of hoarseness. , 1982, The Journal of the Acoustical Society of America.

[52]  Jesús Francisco Vargas-Bonilla,et al.  Nonlinear Dynamics for Hypernasality Detection in Spanish Vowels and Words , 2012, Cognitive Computation.

[53]  F. Takens Detecting strange attractors in turbulence , 1981 .

[54]  Germán Castellanos-Domínguez,et al.  Automatic Detection of Pathological Voices Using Complexity Measures, Noise Parameters, and Mel-Cepstral Coefficients , 2011, IEEE Transactions on Biomedical Engineering.

[55]  D. Narayana Dutt,et al.  Nonlinear Dynamical Analysis of Speech Signals , 2013 .

[56]  Siegfried Piepenbrock,et al.  In vivo myograph measurement of muscle contraction at optimal length , 2007, Biomedical engineering online.

[57]  Jesús Francisco Vargas-Bonilla,et al.  New Spanish speech corpus database for the analysis of people suffering from Parkinson’s disease , 2014, LREC.

[58]  Jesús B. Alonso,et al.  Hand shape identification on multirange images , 2014, Inf. Sci..