Model selection via adaptive shrinkage with t priors

We discuss a model selection procedure, the adaptive ridge selector, derived from a hierarchical Bayes argument, which results in a simple and efficient fitting algorithm. The hierarchical model utilized resembles an un-replicated variance components model and leads to weighting of the covariates. We discuss the intuition behind this type estimator and investigate its behavior as a regularized least squares procedure. While related alternatives were recently exploited to simultaneously fit and select variablses/features in regression models (Tipping in J Mach Learn Res 1:211–244, 2001; Figueiredo in IEEE Trans Pattern Anal Mach Intell 25:1150–1159, 2003), the extension presented here shows considerable improvement in model selection accuracy in several important cases. We also compare this estimator’s model selection performance to those offered by the lasso and adaptive lasso solution paths. Under randomized experimentation, we show that a fixed choice of tuning parameter leads to results in terms of model selection accuracy which are superior to the entire solution paths of lasso and adaptive lasso when the underlying model is a sparse one. We provide a robust version of the algorithm which is suitable in cases where outliers may exist.

[1]  I. Johnstone,et al.  Empirical Bayes selection of wavelet thresholds , 2005, math/0508281.

[2]  Terence Tao,et al.  The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.

[3]  L. Breiman Better subset regression using the nonnegative garrote , 1995 .

[4]  D. Harville Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems , 1977 .

[5]  M. Yuan,et al.  Efficient Empirical Bayes Variable Selection and Estimation in Linear Models , 2005 .

[6]  Joseph G. Ibrahim,et al.  Monte Carlo Methods in Bayesian Computation , 2000 .

[7]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[8]  A. O'Hagan,et al.  On posterior joint and marginal modes , 1976 .

[9]  J. Friedman,et al.  Estimating Optimal Transformations for Multiple Regression and Correlation. , 1985 .

[10]  E. George The Variable Selection Problem , 2000 .

[11]  Mário A. T. Figueiredo Adaptive Sparseness for Supervised Learning , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[13]  D. Lindley,et al.  Bayes Estimates for the Linear Model , 1972 .

[14]  Edward I. George,et al.  The Practical Implementation of Bayesian Model Selection , 2001 .

[15]  Cajo J. F. ter Braak,et al.  Bayesian sigmoid shrinkage with improper variance priors and an application to wavelet denoising , 2006, Comput. Stat. Data Anal..

[16]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[17]  Peng Zhao,et al.  On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..

[18]  H. Kiiveri A Bayesian approach to variable selection when the number of variables is very large , 2003 .

[19]  J. Griffin,et al.  Bayesian adaptive lassos with non-convex penalization , 2007 .

[20]  J. Geweke,et al.  Bayesian Treatment of the Independent Student- t Linear Model , 1993 .

[21]  James O. Berger,et al.  Robust hierarchical Bayes estimation of exchangeable means , 1991 .

[22]  Bani K. Mallick,et al.  Gene selection using a two-level hierarchical Bayesian model , 2004, Bioinform..

[23]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[24]  Michael E. Tipping Sparse Bayesian Learning and the Relevance Vector Machine , 2001, J. Mach. Learn. Res..

[25]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[26]  T. Fearn,et al.  Multivariate Bayesian variable selection and prediction , 1998 .

[27]  R. Kohn,et al.  Nonparametric regression using Bayesian variable selection , 1996 .

[28]  Irwin Guttman,et al.  Bayesian Methods for Variance Component Models , 1996 .

[29]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[30]  Daniel Gianola,et al.  "Likelihood, Bayesian, and Mcmc Methods in Quantitative Genetics" , 2010 .

[31]  G. Casella,et al.  Objective Bayesian Variable Selection , 2006 .