Ensemble Feature Selection Based on the Contextual Merit

Recent research has proved the benefits of using ensembles of classifiers for classification problems. Ensembles constructed by machine learning methods manipulating the training set are used to create diverse sets of accurate classifiers. Different feature selection techniques based on applying different heuristics for generating base classifiers can be adjusted to specific domain characteristics. In this paper we consider and experiment with the contextual feature merit measure as a feature selection heuristic. We use the diversity of an ensemble as evaluation function in our new algorithm with a refinement cycle. We have evaluated our algorithm on seven data sets from UCI. The experimental results show that for all these data sets ensemble feature selection based on the contextual merit and suitable starting amount of features produces an ensemble which with weighted voting never produces smaller accuracy than C4.5 alone with all the features.

[1]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  David J. Hand,et al.  Advances in intelligent data analysis , 2000 .

[3]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[4]  Se June Hong,et al.  Use of Contextaul Information for Feature Ranking and Discretization , 1997, IEEE Trans. Knowl. Data Eng..

[5]  David W. Opitz,et al.  Feature Selection for Ensembles , 1999, AAAI/IAAI.

[6]  Ron Kohavi,et al.  The Wrapper Approach , 1998 .

[7]  David W. Opitz,et al.  Generating Accurate and Diverse Members of a Neural-Network Ensemble , 1995, NIPS.

[8]  Thomas G. Dietterich Machine-Learning Research , 1997, AI Mag..

[9]  George H. John Enhancements to the data mining process , 1997 .

[10]  Edwin P. D. Pednault,et al.  Decomposition of Heterogeneous Classification Problems , 1997, IDA.

[11]  Ron Kohavi,et al.  Data Mining Using MLC a Machine Learning Library in C++ , 1996, Int. J. Artif. Intell. Tools.

[12]  Kagan Tumer,et al.  Dimensionality Reduction Through Classifier Ensembles , 1999 .

[13]  D. Opitz,et al.  Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..

[14]  Roberto Battiti,et al.  Democracy in neural nets: Voting schemes for classification , 1994, Neural Networks.

[15]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[16]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[17]  Salvatore J. Stolfo,et al.  Pruning Classifiers in a Distributed Meta-Learning System , 1998 .