论文信息 - Learning Combination Features with L1 Regularization

Learning Combination Features with L1 Regularization

When linear classifiers cannot successfully classify data, we often add combination features, which are products of several original features. The searching for effective combination features, namely feature engineering, requires domain-specific knowledge and hard work. We present herein an efficient algorithm for learning an L1 regularized logistic regression model with combination features. We propose to use the grafting algorithm with efficient computation of gradients. This enables us to find optimal weights efficiently without enumerating all combination features. By using L1 regularization, the result we obtain is very compact and achieves very efficient inference. In experiments with NLP tasks, we show that the proposed method can extract effective combination features, and achieve high performance with very few features.

Jun'ichi Tsujii | Daisuke Okanohara

[1] Manabu Sassano,et al. Linear-Time Dependency Analysis for Japanese , 2004, COLING.

[2] Jianfeng Gao,et al. A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing , 2007, ACL.

[3] Jianfeng Gao,et al. Approximation Lasso Methods for Language Modeling , 2006, ACL.

[4] Yuji Matsumoto,et al. A Boosting Algorithm for Classification of Semi-Structured Text , 2004, EMNLP.

[5] Miroslav Dudík,et al. Maximum Entropy Density Estimation with Generalized Regularization and an Application to Species Distribution Modeling , 2007, J. Mach. Learn. Res..

[6] Evgeniy Gabrilovich,et al. Parameterized generation of labeled datasets for text categorization based on a hierarchical directory , 2004, SIGIR '04.

[7] Takeaki Uno,et al. Mining complex genotypic features for predicting HIV-1 drug resistance , 2007, Bioinform..

[8] A. Ng. Feature selection, L1 vs. L2 regularization, and rotational invariance , 2004, Twenty-first international conference on Machine learning - ICML '04.

[9] James Theiler,et al. Online Feature Selection using Grafting , 2003, ICML.