Voice Pathology Recognition and Classification using Noise Related Features

Nowadays, the diseases of the voice increase because of bad social habits and the misuse of voice. These pathologies should be treated from the beginning. Indeed, it is no longer necessary that the diseases of the voice lead to affect the quality of the voice as heard by a listener. The most useful tool for diagnosing such diseases is the Acoustic analysis. We present in this work, new expression parameters in order to clarify the description of the vocal signal. These parameters help to classify the unhealthy voices. They describes essentially the fundamental frequency F0, the Harmonics-to-Noise report HNR, the report Noise to Harmonics Ratio NHR and Detrended Fluctuation Analysis (DFA). The classification is performed on two Saarbruecken Voice and MEEI pathological databases using HTK classifiers. We can classify them into two different type: the first classification is binary which is used for the normal and pathological voices, the second one is called a four-category classification used in spasmodic, polyp, nodule and normal female voices and male speakers. And we studied the effects of these new parameters when combined with the MFCC, Delta, Delta second and Energy coefficients.

[1]  Ghulam Muhammad,et al.  An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification. , 2017, Journal of voice : official journal of the Voice Foundation.

[2]  I. Elamvazuthi,et al.  Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques , 2010, ArXiv.

[3]  Tim Ritchings,et al.  Pathological voice quality assesment using artificial neural networks , 2001, MAVEBA.

[4]  M. A. Anusuya,et al.  Front end analysis of speech recognition: a review , 2011, Int. J. Speech Technol..

[5]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[6]  Pedro Gómez Vilda,et al.  Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters , 2006, IEEE Transactions on Biomedical Engineering.

[7]  Shigeo Abe Support Vector Machines for Pattern Classification , 2010, Advances in Pattern Recognition.

[8]  Eduardo Lleida,et al.  Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit , 2012, IberSPEECH.

[9]  Aslam Muhammad,et al.  A Speaker Identification System Using MFCC Features with VQ Technique , 2009, 2009 Third International Symposium on Intelligent Information Technology Application.

[10]  Max A. Little,et al.  Exploiting Nonlinear Recurrence and Fractal Scaling Properties for Voice Disorder Detection , 2007 .

[11]  Ghulam Muhammad,et al.  Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions , 2018, IEEE Access.

[12]  T.W. Berger,et al.  Pathological Voice Assessment , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[13]  Miguel Angel Ferrer-Ballester,et al.  Support Vector Machines Applied to the Detection of Voice Disorders , 2005, NOLISP.

[14]  João Paulo Teixeira,et al.  CENTERIS 2013-Conference on ENTERprise Information Systems / HCIST 2013-International Conference on Health and Social Care Information Systems and Technologies Vocal Acoustic Analysis-Jitter , Shimmer and HNR Parameters , 2013 .

[15]  Steve Young,et al.  The HTK book version 3.4 , 2006 .