Functional Feature Selection by Weighted Projections in Pathological Voice Detection

In this paper, we introduce an adaptation of a multivariate feature selection method to deal with functional features. In our case, observations are described by a set of functions defined over a common domain (e.g. a time interval). The feature selection method consists on combining variable weighting with a feature extraction projection. Although the employed method was primarily intended for observations described by vectors in *** n , we propose a simple extension that allows us to select a set of functional features, which is well suited for classification. This study is complemented by the incorporation of Functional Principal Component Analysis (FPCA) that project functions into a finite dimensional space were we can perform classification easily. Another remarkable property of FPCA is that it can provide insight about the nature of the functional features. The proposed algorithms are tested on a pathological voice detection task. Two databases are considered: Massachusetts Eye and Ear Infirmary Voice Laboratory voice disorders database and Universidad Politecnica de Madrid voice database. As a result, we obtain a canonical function whose time average is enough to reach similar performances to the ones reported in the literature.

[1]  Frédéric Ferraty,et al.  Nonparametric Functional Data Analysis: Theory and Practice (Springer Series in Statistics) , 2006 .

[2]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[3]  B. Silverman,et al.  Functional Data Analysis , 1997 .

[4]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[5]  Andrew R. Webb,et al.  Statistical Pattern Recognition , 1999 .

[6]  J. Friedman Regularized Discriminant Analysis , 1989 .

[7]  Germán Castellanos-Domínguez,et al.  Feature Extraction of Weighted Data for Implicit Variable Selection , 2007, CAIP.

[8]  Pedro Gómez Vilda,et al.  Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters , 2006, IEEE Transactions on Biomedical Engineering.

[9]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[10]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[11]  Germán Castellanos-Domínguez,et al.  Dynamic Feature Extraction: an Application to Voice Pathology Detection , 2009, Intell. Autom. Soft Comput..

[12]  I. Jolliffe Principal Component Analysis , 2002 .

[13]  Paul S. Bradley,et al.  Feature Selection via Mathematical Programming , 1997, INFORMS J. Comput..

[14]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[15]  Lior Wolf,et al.  Feature selection for unsupervised and supervised inference: the emergence of sparsity in a weighted-based approach , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.