论文信息 - Efficient Sparse Modeling With Automatic Feature Grouping

Efficient Sparse Modeling With Automatic Feature Grouping

For high-dimensional data, it is often desirable to group similar features together during the learning process. This can reduce the estimation variance and improve the stability of feature selection, leading to better generalization. Moreover, it can also help in understanding and interpreting data. Octagonal shrinkage and clustering algorithm for regression (OSCAR) is a recent sparse-modeling approach that uses a l1 -regularizer and a pairwise l∞-regularizer on the feature coefficients to encourage such feature grouping. However, computationally, its optimization procedure is very expensive. In this paper, we propose an efficient solver based on the accelerated gradient method. We show that its key proximal step can be solved by a highly efficient simple iterative group merging algorithm. Given d input features, this reduces the empirical time complexity from O(d2 ~ d5) for the existing solvers to just O(d). Experimental results on a number of toy and real-world datasets demonstrate that OSCAR is a competitive sparse-modeling approach, but with the added ability of automatic feature grouping.

Leon Wenliang Zhong | James T. Kwok | J. Kwok

[1] H. Bondell,et al. Simultaneous regression shrinkage , variable selection and clustering of predictors with OSCAR , 2006 .

[2] Francis R. Bach,et al. Structured Variable Selection with Sparsity-Inducing Norms , 2009, J. Mach. Learn. Res..

[3] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[4] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Yoram Singer,et al. Efficient Online and Batch Learning Using Forward Backward Splitting , 2009, J. Mach. Learn. Res..

[6] Julien Mairal,et al. Convex optimization with sparsity-inducing norms , 2011 .

[7] Jean-Philippe Vert,et al. Group lasso with overlap and graph lasso , 2009, ICML '09.

[8] M. Yuan,et al. Model selection and estimation in regression with grouped variables , 2006 .

[9] Thomas Martinetz,et al. Simple Method for High-Performance Digit Recognition Based on Sparse Coding , 2008, IEEE Transactions on Neural Networks.

[10] Y. She. Sparse regression with exact clustering , 2008 .

[11] H. Zou,et al. Regularization and variable selection via the elastic net , 2005 .