Dynamic Feature Extraction: an Application to Voice Pathology Detection

Abstract In pattern recognition, observations are often represented by the so called static features, that is, numeric values that represent some kind of attribute from observations, which are assumed constant with respect to an associated dimension or dimensions (e.g. time, space, and so on). Nevertheless, we can represent the objects to be classified by means of another kind of measurements that do change over some associated dimension: these are called dynamic features. A dynamic feature can be represented by either a vector or a matrix for each observation. The advantage of using such an extended form is the inclusion of new information that gives abetter representation of the object. The main goal in this work is to extend traditional Principal Component Analysis (normally applied on static features) to a classification task using a dynamic representation. The method was applied to detect the presence of pathology in the speech using two different voice disorders databases, obtaining high classificat...

[1]  Hans Werner Strube,et al.  Glottal-to-Noise Excitation Ratio - a New Measure for Describing Pathological Voices , 1997 .

[2]  B Boyanov,et al.  Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases. , 1997, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[3]  Pedro Gómez Vilda,et al.  Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters , 2006, IEEE Transactions on Biomedical Engineering.

[4]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[5]  Douglas C. Montgomery,et al.  Applied Statistics and Probability for Engineers, Third edition , 1994 .

[6]  Kuldip K. Paliwal,et al.  Feature extraction and dimensionality reduction algorithms and their applications in vowel recognition , 2003, Pattern Recognit..

[7]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[8]  Andrew R. Webb,et al.  Statistical Pattern Recognition , 1999 .

[9]  T. Moon The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[10]  Pedro Gómez Vilda,et al.  Methodological issues in the development of automatic systems for voice pathology detection , 2006, Biomed. Signal Process. Control..

[11]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Guus de Krom,et al.  A Cepstrum-Based Technique for Determining a Harmonics-to-Noise Ratio in Speech Signals , 1993 .

[13]  Alvin F. Martin,et al.  The DET curve in assessment of detection task performance , 1997, EUROSPEECH.

[14]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[15]  H. Kasuya,et al.  Normalized noise energy as an acoustic measure to evaluate pathologic voice. , 1986, The Journal of the Acoustical Society of America.

[16]  María Victoria Rodellar Biarge,et al.  Principal component analysis of spectral perturbation parameters for voice pathology detection , 2005, 18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05).

[17]  Tim Ritchings,et al.  Pathological voice quality assesment using artificial neural networks , 2001, MAVEBA.

[18]  Stefan Hadjitodorov,et al.  A computer system for acoustic analysis of pathological voices and laryngeal diseases screening. , 2002, Medical engineering & physics.

[19]  S. Feijóo,et al.  Short-term stability measures for the evaluation of vocal quality. , 1990, Journal of speech and hearing research.

[20]  C Manfredi,et al.  A comparative analysis of fundamental frequency estimation methods with application to pathological voices. , 2000, Medical engineering & physics.

[21]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[22]  Alex Pentland,et al.  Face recognition using eigenfaces , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  D. Jamieson,et al.  Identification of pathological voices using glottal noise measures. , 2000, Journal of speech, language, and hearing research : JSLHR.