Towards an evolutionary algorithm: a comparison of two feature selection algorithms

In order to deal with a large number of attributes, probabilistic feature selection algorithms have been proposed. Pure random walk entails mediocre performance in terms of search time. Introducing adaptiveness into a probabilistic algorithm can lead to a more focused search that results in a better search time. We compare two algorithms in search of an efficient but not myopic algorithm for feature selection. Based on the comparative study, we suggest some ways of improvement towards an evolutionary feature selection algorithm for data mining.

[1]  Ron Kohavi,et al.  Wrappers for feature selection , 1997 .

[2]  Dorit S. Hochbaum,et al.  Approximation Algorithms for NP-Hard Problems , 1996 .

[3]  Hiroshi Motoda,et al.  Feature Extraction, Construction and Selection: A Data Mining Perspective , 1998 .

[4]  Haleh Vafaie,et al.  Evolutionary Feature Space Transformation , 1998 .

[5]  Jack Sklansky,et al.  On Automatic Feature Selection , 1988, Int. J. Pattern Recognit. Artif. Intell..

[6]  Kan Chen A general learning algorithm for solving optimization problems and its application to the spin glass problem , 1998 .

[7]  Huan Liu,et al.  A Monotonic Measure for Optimal Feature Selection , 1998, ECML.

[8]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[9]  Ron Kohavi,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998 .

[10]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[11]  Hiroshi Motoda,et al.  Feature Extraction, Construction and Selection , 1998 .

[12]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[13]  Thomas G. Dietterich,et al.  Learning Boolean Concepts in the Presence of Many Irrelevant Features , 1994, Artif. Intell..

[14]  Anil K. Jain,et al.  Feature Selection: Evaluation, Application, and Small Sample Performance , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[16]  Francesc J. Ferri,et al.  Comparative study of techniques for large-scale feature selection* *This work was suported by a SERC grant GR/E 97549. The first author was also supported by a FPI grant from the Spanish MEC, PF92 73546684 , 1994 .

[17]  Kan Chen,et al.  LEARNING ALGORITHM FOR THE UNIFORM GRAPH PARTITIONING PROBLEM , 1998 .

[18]  Gilles Brassard,et al.  Fundamentals of algorithms , 1996 .

[19]  Huan Liu,et al.  Hybrid Search of Feature Subsets , 1998, PRICAI.

[20]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[21]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.

[22]  Rajeev Motwani,et al.  Randomized Algorithms , 1995, SIGA.

[23]  Huan Liu,et al.  A Probabilistic Approach to Feature Selection - A Filter Solution , 1996, ICML.

[24]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[25]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.