Skinny Gibbs: A Consistent and Scalable Gibbs Sampler for Model Selection

ABSTRACT We consider the computational and statistical issues for high-dimensional Bayesian model selection under the Gaussian spike and slab priors. To avoid large matrix computations needed in a standard Gibbs sampler, we propose a novel Gibbs sampler called “Skinny Gibbs” which is much more scalable to high-dimensional problems, both in memory and in computational efficiency. In particular, its computational complexity grows only linearly in p, the number of predictors, while retaining the property of strong model selection consistency even when p is much greater than the sample size n. The present article focuses on logistic regression due to its broad applicability as a representative member of the generalized linear models. We compare our proposed method with several leading variable selection methods through a simulation study to show that Skinny Gibbs has a strong performance as indicated by our theoretical work. Supplementary materials for this article are available online.

[1]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[2]  F. Liang,et al.  Bayesian Subset Modeling for High-Dimensional Generalized Linear Models , 2013 .

[3]  T. J. Mitchell,et al.  Bayesian variable selection in regression , 1987 .

[4]  James G. Scott,et al.  Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem , 2010, 1011.2333.

[5]  Veronika Rockova,et al.  EMVS: The EM Approach to Bayesian Variable Selection , 2014 .

[6]  J. Tobias,et al.  Analysis of gene expression in pancreatic islets from diet-induced obese mice. , 2008, Physiological genomics.

[7]  N. Narisetty,et al.  Bayesian variable selection with shrinking and diffusing priors , 2014, 1405.6545.

[8]  M. West,et al.  Shotgun Stochastic Search for “Large p” Regression , 2007 .

[9]  M. G. Pittau,et al.  A weakly informative default prior distribution for logistic and other regression models , 2008, 0901.4011.

[10]  Joseph G Ibrahim,et al.  Bayesian Variable Selection and Computation for Generalized Linear Models with Conjugate Priors. , 2008, Bayesian analysis.

[11]  A. V. D. Vaart,et al.  Needles and Straw in a Haystack: Posterior concentration for possibly sparse sequences , 2012, 1211.1197.

[12]  V. Johnson,et al.  Bayesian Model Selection in High-Dimensional Settings , 2012, Journal of the American Statistical Association.

[13]  Xiaotong Shen,et al.  Journal of the American Statistical Association Likelihood-based Selection and Sharp Parameter Estimation Likelihood-based Selection and Sharp Parameter Estimation , 2022 .

[14]  R. Carroll,et al.  Stochastic Approximation in Monte Carlo Computation , 2007 .

[15]  N. Pillai,et al.  Dirichlet–Laplace Priors for Optimal Shrinkage , 2014, Journal of the American Statistical Association.

[16]  S. Geer HIGH-DIMENSIONAL GENERALIZED LINEAR MODELS AND THE LASSO , 2008, 0804.0703.

[17]  Jian Huang,et al.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION. , 2011, The annals of applied statistics.

[18]  Jennifer H Barrett,et al.  Association studies. , 2002, Methods in molecular biology.

[19]  Brian J Reich,et al.  Consistent High-Dimensional Bayesian Variable Selection via Penalized Credible Regions , 2012, Journal of the American Statistical Association.

[20]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[21]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[22]  Christina Kendziorski,et al.  Combined Expression Trait Correlations and Expression Quantitative Trait Locus Mapping , 2006, PLoS genetics.

[23]  Gary W. Cline,et al.  Glycerol-3-Phosphate Acyltransferase 1 Deficiency in ob/ob Mice Diminishes Hepatic Steatosis but Does Not Protect Against Insulin Resistance or Obesity , 2010, Diabetes.

[24]  B. Caballero Insulin resistance in obesity. , 1992, Archivos latinoamericanos de nutricion.

[25]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[26]  Jianqing Fan,et al.  Nonconcave penalized likelihood with a diverging number of parameters , 2004, math/0406466.

[27]  Leonard A. Stefanski A normal scale mixture representation of the logistic distribution , 1991 .

[28]  D. Dunson,et al.  Bayesian Multivariate Logistic Regression , 2004, Biometrics.

[29]  Cun-Hui Zhang Nearly unbiased variable selection under minimax concave penalty , 2010, 1002.4734.

[30]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[31]  Sara van de Geer,et al.  Statistics for High-Dimensional Data , 2011 .

[32]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[33]  J. Geweke,et al.  Variable selection and model comparison in regression , 1994 .

[34]  M. Stephens,et al.  Bayesian variable selection regression for genome-wide association studies and other large-scale problems , 2011, 1110.6019.

[35]  C. Holmes,et al.  Bayesian auxiliary variable models for binary and multinomial regression , 2006 .

[36]  Stephen J. Wright Coordinate descent algorithms , 2015, Mathematical Programming.

[37]  M. Yuan,et al.  Efficient Empirical Bayes Variable Selection and Estimation in Linear Models , 2005 .

[38]  J. S. Rao,et al.  Spike and slab variable selection: Frequentist and Bayesian strategies , 2005, math/0505633.

[39]  H. Zou,et al.  Regression Shrinkage and Selection via the Elastic Net , with Applications to Microarrays , 2003 .

[40]  James G. Scott,et al.  Bayesian Inference for Logistic Models Using Pólya–Gamma Latent Variables , 2012, 1205.0310.

[41]  M. Stephens,et al.  Scalable Variational Inference for Bayesian Variable Selection in Regression, and Its Accuracy in Genetic Association Studies , 2012 .

[42]  B. Mallick,et al.  Fast sampling with Gaussian scale-mixture priors in high-dimensional regression. , 2015, Biometrika.

[43]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[44]  Jian Huang,et al.  Estimation and Selection via Absolute Penalized Convex Minimization And Its Multistage Adaptive Applications , 2011, J. Mach. Learn. Res..

[45]  Zehua Chen,et al.  EXTENDED BIC FOR SMALL-n-LARGE-P SPARSE GLM , 2012 .