New Feature Extraction from Electroglottographic Signals Applied to Automatic Detection of Laryngeal Pathologies

The objective of this report is to design a mechanism of classification that, through electroglottography, helps distinguishing between healthy and pathological subjects, as well as maximizing the efficiency of electroglottography through an optimal configuration of the classification parameters of SVM (Support Vector Machine). The proposed system consists in parameterizing electroglottography signals obtained in the open database, Saarbruecken Voice DataBase, and to draw the more relevant characteristics in temporary, frequency and cepstral domain. Afterwards, the samples are classified with a SVM. The study carried out contains different combinations of parameters and characteristics in order to assess the appropriate configuration considering: the recorded vowel, the type of windowing, the configured SVM percentages of training and the different values of the SVM parameters. The results obtained are compared to the real data, in this way, it is obtained the performance values of the system (precision, sensitivity and specificity) for each features configuration contemplated. The best results come from vowel I, 30 ms windowing with 50% overlapping, percentages of training around 80–90% (PES higher than PEP) and γ and σ2 values of 100 and 0.1 respectively. This study expects to provide a greater knowledge to the classification methods based on electroglottography as an aid in diagnosing laryngeal diseases.

[1]  W. Fitch,et al.  Electroglottographic wavegrams: a technique for visualizing vocal fold dynamics noninvasively. , 2010, The Journal of the Acoustical Society of America.

[2]  B Boyanov,et al.  Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases. , 1997, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[3]  Donald G. Childers,et al.  Electroglottography for Laryngeal Function Assessment and Speech Analysis , 1984, IEEE Transactions on Biomedical Engineering.

[4]  D. G. Childers,et al.  Laryngeal Evaluation Using Features from Speech and the Electroglottograph , 1983, IEEE Transactions on Biomedical Engineering.

[5]  B. Doval,et al.  On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation. , 2004, The Journal of the Acoustical Society of America.

[6]  Eduardo Lleida,et al.  Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit , 2012, IberSPEECH.

[7]  Patrick A. Naylor,et al.  The SIGMA Algorithm: A Glottal Activity Detector for Electroglottographic Signals , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  H. Strik,et al.  Automatic parameterization of voice source signals: a novel evaluation procedure is used to compare methods and test the effects of low-pass filtering , 1997 .

[9]  D. Childers,et al.  Detection of laryngeal function using speech and electroglottographic data , 1992, IEEE Transactions on Biomedical Engineering.

[10]  Johan A. K. Suykens,et al.  Least squares support vector machine classifiers: a large scale algorithm , 1999 .

[11]  Perturbation analysis of EGG for detecting laryngeal pathology , 1994, Proceedings of 16th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[12]  Miguel Angel Ferrer-Ballester,et al.  Automatic Detection of Pathologies in The Voice by HOS Based Parameters , 2001, EURASIP J. Adv. Signal Process..

[13]  Patrick A. Naylor,et al.  Estimation of Glottal Closing and Opening Instants in Voiced Speech Using the YAGA Algorithm , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  S. Feijoo,et al.  Acoustic evaluation of glottal cancer based on short-term stability measures , 1989, Images of the Twenty-First Century. Proceedings of the Annual International Engineering in Medicine and Biology Society,.

[15]  Fabrice Plante,et al.  Speech monitoring of infective laryngitis , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[16]  Dirk Michaelis,et al.  Acoustic "breathiness measures" in the description of pathologic voices , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[17]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.