论文信息 - Feature Selection using Multiple Streams

Feature Selection using Multiple Streams

Feature selection for supervised learning can be greatly improved by making use of the fact that features often come in classes. For example, in gene expression data, the genes which serve as features may be divided into classes based on their membership in gene families or pathways. When labeling words with senses for word sense disambiguation, features fall into classes including adjacent words, their parts of speech, and the topic and venue of the document the word is in. We present a streamwise feature selection method that allows dynamic generation and selection of features, while taking advantage of the different feature classes, and the fact that they are of different sizes and have different (but unknown) fractions of good features. Experimental results show that our approach provides significant improvement in performance and is computationally less expensive than comparable “batch” methods that do not take advantage of the feature classes and expect all features to be known in advance.

Dean P. Foster | Lyle H. Ungar | Paramveer S. Dhillon | Dean Phillips Foster | L. Ungar

[1] Dean P. Foster,et al. The risk inflation criterion for multiple regression , 1994 .

[2] R. Tibshirani,et al. Least angle regression , 2004, math/0406456.

[3] M. Yuan,et al. Model selection and estimation in regression with grouped variables , 2006 .

[4] Francis R. Bach,et al. Consistency of the group Lasso and multiple kernel learning , 2007, J. Mach. Learn. Res..

[5] Y. Benjamini,et al. Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[6] Dean P. Foster,et al. Efficient Feature Selection in the Presence of Multiple Feature Classes , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[7] Daphne Koller,et al. Learning a meta-level prior for feature relevance from multiple related tasks , 2007, ICML '07.

[8] H. Zou,et al. Regularization and variable selection via the elastic net , 2005 .

[9] Martha Palmer,et al. Towards Robust High Performance Word Sense Disambiguation of English Verbs Using Rich Linguistic Features , 2005, IJCNLP.

[10] Naftali Tishby,et al. Learning to Select Features using their Properties , 2008 .

[11] Jing Zhou,et al. Streamwise Feature Selection , 2006, J. Mach. Learn. Res..

[12] Jing Zhou,et al. Streaming feature selection using alpha-investing , 2005, KDD '05.