Large scale feature selection using modified random mutation hill climbing

Feature selection is a critical component of many pattern recognition applications. There are two distinct mechanisms for feature selection, namely the wrapper methods and the filter methods. The filter methods are generally considered inferior to the wrapper method, however wrapper methods are computationally more demanding than filter methods. One of the popular methods for wrapper-based feature selection is random mutation hill climbing. It performs a random search over the feature space to derive the optimal set of features. We would describe two enhancements to this algorithm, one that would improve its convergence time, and the other that would allow us to bias the results towards either higher accuracy or lower final feature space dimensionality. We would apply the algorithm to a real-world massive-scale feature selection problem involving the image classification problem associated with suppressing automobile airbags for children. We would provide classification results on an image database of nearly 4,000 images that indicate the advantages of the proposed method.

[1]  Anil K. Jain,et al.  Feature Selection: Evaluation, Application, and Small Sample Performance , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  David B. Skalak,et al.  Prototype and Feature Selection by Sampling and Random Mutation Hill Climbing Algorithms , 1994, ICML.

[3]  Mineichi Kudo,et al.  Comparison of algorithms that select features for pattern classifiers , 2000, Pattern Recognit..

[4]  David W. Aha,et al.  A Comparative Evaluation of Sequential Feature Selection Algorithms , 1995, AISTATS.

[5]  Pavel Paclík,et al.  Adaptive floating search methods in feature selection , 1999, Pattern Recognit. Lett..

[6]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[7]  Ron Kohavi,et al.  The Wrapper Approach , 1998 .

[8]  Anil K. Jain,et al.  Occupant classification system for automotive airbag suppression , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[9]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.