论文信息 - Variable Selection in Penalized Model‐Based Clustering Via Regularization on Grouped Parameters

Variable Selection in Penalized Model‐Based Clustering Via Regularization on Grouped Parameters

Summary Penalized model‐based clustering has been proposed for high‐dimensional but small sample‐sized data, such as arising from genomic studies; in particular, it can be used for variable selection. A new regularization scheme is proposed to group together multiple parameters of the same variable across clusters, which is shown both analytically and numerically to be more effective than the conventional L1 penalty for variable selection. In addition, we develop a strategy to combine this grouping scheme with grouping structured variables. Simulation studies and applications to microarray gene expression data for cancer subtype discovery demonstrate the advantage of the new proposal over several existing approaches.

Xiaotong Shen | W. Pan | Benhuai Xie

[1] Ji Zhu,et al. Group variable selection via a hierarchical lasso and its oracle property , 2010, 1006.2871.

[2] Ji Zhu,et al. Variable Selection for Model‐Based High‐Dimensional Clustering and Its Application to Microarray Data , 2008, Biometrics.

[3] Xiaotong Shen,et al. Penalized model-based clustering with cluster-specic diagonal covariances and grouped variables , 2008 .

[4] Mee Young Park,et al. L1‐regularization path algorithm for generalized linear models , 2007 .

[5] Wei Pan,et al. Penalized Model-Based Clustering with Application to Variable Selection , 2007, J. Mach. Learn. Res..

[6] Ji Zhu,et al. Improved centroids estimation for the nearest shrunken centroid classifier , 2007, Bioinform..

[7] P. Zhao,et al. Grouped and Hierarchical Model Selection through Composite Absolute Penalties , 2007 .

[8] Marina Vannucci,et al. Variable selection in clustering via Dirichlet process mixture models , 2006 .

[9] Wei Pan,et al. Semi-supervised learning via penalized mixture model with application to microarray sample classification , 2006, Bioinform..

[10] Peter D. Hoff,et al. Model-based subspace clustering , 2006 .

[11] A. Raftery,et al. Variable Selection for Model-Based Clustering , 2006 .