Automatic detection of laryngeal pathologies using cepstral analysis in Mel and Bark scales

Problems in voice production can appear due to functional disorders and laryngeal pathologies. The presence of laryngeal pathologies can causes significant changes in the vibrational patterns of the vocal folds and it is demonstrated that the impact of such pathologies can be reduced through continuous speech therapy. We propose a methodology based on non-parametric cepstral coefficients in Mel and Bark scales. The most relevant features are automatically selected using two algorithms, one is based on Principal Components Analysis (PCA) and other is based on Sequential Floating Features Selection (SFFS). In order to decide whether a voice recording is healthy or pathological, four different classifiers are implemented: linear and quadratic Bayesian, K nearest neighbors and Parzen. The best result was 89.18%, it was obtained from the union between MFCC and BFCC.

[1]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[2]  D. Jamieson,et al.  Identification of pathological voices using glottal noise measures. , 2000, Journal of speech, language, and hearing research : JSLHR.

[3]  Jean-François Bonastre,et al.  Frequency study for the characterization of the dysphonic voices , 2007, INTERSPEECH.

[4]  B Boyanov,et al.  Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases. , 1997, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[5]  D. Jamieson,et al.  Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[6]  Stefan Hadjitodorov,et al.  A computer system for acoustic analysis of pathological voices and laryngeal diseases screening. , 2002, Medical engineering & physics.

[7]  William Brent Perceptually Based Pitch Scales in Cepstral Techniques for Percussive Timbre Identification , 2009, ICMC.

[8]  Jesús Francisco Vargas-Bonilla,et al.  Automatic Selection of Acoustic and Non-Linear Dynamic Features in Voice Signals for Hypernasality Detection , 2011, INTERSPEECH.

[9]  R. Fraile,et al.  Automatic Detection of Laryngeal Pathologies in Records of Sustained Vowels by Means of Mel-Frequency Cepstral Coefficient Parameters and Differentiation of Patients by Sex , 2009, Folia Phoniatrica et Logopaedica.

[10]  Stefan Hadjitodorov,et al.  ACOUSTIC ANALYSIS OF PATHOLOGICAL VOICES , 1997 .

[11]  Mario Cannataro,et al.  Early detection of voice diseases via a web-based system , 2007, MAVEBA.

[12]  Pedro Gómez Vilda,et al.  Methodological issues in the development of automatic systems for voice pathology detection , 2006, Biomed. Signal Process. Control..