Pathological assessment of patients' speech signals using nonlinear dynamical analysis

Acoustic analysis of voice features can complete the invasive observation-based methods for the diagnosis of vocal fold pathologies. Selection of an appropriate feature extraction method from the voice can significantly improve the diagnostic results for patients with vocal disorders. In this paper, the performance of nonlinear dynamics and acoustical perturbation features is evaluated in order to distinguish patients with vocal fold disorder and other normal cases. As a matter of fact, vocal fold pathology is one of the major causes of voice quality reduction or feature variation in patients with dysphonic voices. Due to the devastating impact of vocal folds dysfunction on the complex dynamical structure of the speech signals, spectral analysis methods are not suitable for characterizing such changes in disordered voices. Therefore, the using measures that can reflect the nonlinear nature of such changes in the acoustical signals is an efficient alternative for the conventional methods. In order to compare and contrast the effectiveness of such approaches, we exploit features such as correlation dimension, the largest Lyapunov exponent, approximate entropy, fractal dimension and Ziv-Lempel complexity, and we also evaluate their performance with respect to some conventional features like jitter and shimmer, in the voice diagnosis task. Using the support vector machine classifier, our simulation results show that correlation dimension and the largest Lyapunov exponent features with the highest recognition rates of 94.44% and 88.89% can be used as a highly reliable method for the clinical diagnosis of vocal folds pathologies and other relevant applications.

[1]  Zhi-zhong Wang,et al.  Characterization of surface EMG signals using improved approximate entropy , 2006, Journal of Zhejiang University SCIENCE B.

[2]  J Kreiman,et al.  Comparing internal and external standards in voice quality judgments. , 1993, Journal of speech and hearing research.

[3]  Steve McLaughlin,et al.  Speech characterization and synthesis by nonlinear methods , 1999, IEEE Trans. Speech Audio Process..

[4]  H. Kantz,et al.  Nonlinear time series analysis , 1997 .

[5]  Jing Hu,et al.  Analysis of Biomedical Signals by the Lempel-Ziv Complexity: the Effect of Finite Data Size , 2006, IEEE Transactions on Biomedical Engineering.

[6]  H.L. Rufiner,et al.  Study of complexity in normal and pathological speech signals , 2003, Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (IEEE Cat. No.03CH37439).

[7]  Faruk Göloglu,et al.  On Lempel-Ziv Complexity of Sequences , 2006, SETA.

[8]  Stefan Hadjitodorov,et al.  A computer system for acoustic analysis of pathological voices and laryngeal diseases screening. , 2002, Medical engineering & physics.

[9]  Niels Wessel,et al.  Correlation dimension analysis of heart rate variability in patients with dilated cardiomyopathy , 2005, Comput. Methods Programs Biomed..

[10]  J. Lucero Dynamics of the Vocal Fold Oscillation , 2005 .

[11]  Farshad Almasganj,et al.  Pathological Assesment of Vocal Fold Nodules and Polyp Using Accoustic Perturbation and Phase Space Features , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[12]  Witold Kinsner,et al.  Characterizing chaos through Lyapunov metrics , 2003, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[13]  J. Echauz,et al.  Fractal dimension characterizes seizure onset in epileptic patients , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[14]  Jack J. Jiang,et al.  Perturbation and nonlinear dynamic analyses of voices from patients with unilateral laryngeal paralysis. , 2005, Journal of voice : official journal of the Voice Foundation.

[15]  Jan Svec,et al.  On vibration properties of human vocal folds: voice registers, bifurcations, resonance characteristics, development and application of videokymography , 2000 .

[16]  Shrikanth Narayanan,et al.  Feature analysis for automatic detection of pathological speech , 2002, Proceedings of the Second Joint 24th Annual Conference and the Annual Fall Meeting of the Biomedical Engineering Society] [Engineering in Medicine and Biology.

[17]  M Lampl,et al.  The use of regularity as estimated by approximate entropy to distinguish saltatory growth , 2001, Annals of human biology.

[18]  Brian Litt,et al.  A comparison of waveform fractal dimension algorithms , 2001 .

[19]  Miguel A. Ferrer,et al.  Using Nonlinear Features for Voice Disorder Detection , 2005 .

[20]  Svetlana Alfredovna Borovkova,et al.  Estimation and prediction for nonlinear time series , 1998 .

[21]  Yu Zhang,et al.  Nonlinear dynamic analysis of speech from pathological subjects , 2002 .

[22]  Isao Tokuda,et al.  Detecting synchronizations in an asymmetric vocal fold model from time series data. , 2005, Chaos.

[23]  Pavel Pudil,et al.  Introduction to Statistical Pattern Recognition , 2006 .

[24]  José M. F. Moura,et al.  Biomedical Signal Processing , 2018, Series in BioEngineering.

[25]  M. Rosenstein,et al.  A practical method for calculating largest Lyapunov exponents from small data sets , 1993 .

[26]  Robert X. Gao,et al.  Mechanical Systems and Signal Processing Approximate Entropy as a Diagnostic Tool for Machine Health Monitoring , 2006 .

[27]  Max A. Little,et al.  Nonlinear, Biophysically-Informed Speech Pathology Detection , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[28]  B Boyanov,et al.  Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases. , 1997, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[29]  K. Aihara,et al.  Surrogate analysis for detecting nonlinear dynamics in normal vowels. , 2001, The Journal of the Acoustical Society of America.

[30]  Jagadish Nayak,et al.  Identification of voice disorders using speech samples , 2003, TENCON 2003. Conference on Convergent Technologies for Asia-Pacific Region.

[31]  H.L. Rufiner,et al.  Acoustic analysis of speech for detection of laryngeal pathologies , 2000, Proceedings of the 22nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (Cat. No.00CH37143).

[32]  Stefan Hadjitodorov,et al.  ACOUSTIC ANALYSIS OF PATHOLOGICAL VOICES , 1997 .

[33]  Farshad Almasganj,et al.  Optimal selection of wavelet-packet-based features using genetic algorithm in pathological assessment of patients' speech signal with unilateral vocal fold paralysis , 2007, Comput. Biol. Medicine.

[34]  J. A. Stewart,et al.  Nonlinear Time Series Analysis , 2015 .

[35]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[36]  Marta Borowska,et al.  Application of the Lempel-Ziv complexity measure to the analysis of biosignals and medical images , 2005 .

[37]  Richard J. Povinelli,et al.  Speech recognition using reconstructed phase space features , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[38]  Hanspeter Herzel,et al.  Calls out of chaos: the adaptive significance of nonlinear phenomena in mammalian vocal production , 2002, Animal Behaviour.