Bayesian Semiparametric Multiple Shrinkage

High-dimensional and highly correlated data leading to non- or weakly identified effects are commonplace. Maximum likelihood will typically fail in such situations and a variety of shrinkage methods have been proposed. Standard techniques, such as ridge regression or the lasso, shrink estimates toward zero, with some approaches allowing coefficients to be selected out of the model by achieving a value of zero. When substantive information is available, estimates can be shrunk to nonnull values; however, such information may not be available. We propose a Bayesian semiparametric approach that allows shrinkage to multiple locations. Coefficients are given a mixture of heavy-tailed double exponential priors, with location and scale parameters assigned Dirichlet process hyperpriors to allow groups of coefficients to be shrunk toward the same, possibly nonzero, mean. Our approach favors sparse, but flexible, structure by shrinking toward a small number of random locations. The methods are illustrated using a study of genetic polymorphisms and Parkinson's disease.

[1]  Tom Britton,et al.  Hierarchical Bayes models for cDNA microarray gene expression. , 2005, Biostatistics.

[2]  K. Takatsuki,et al.  Ectopic production of salivary‐type amylase by a lga‐λ‐type multiple myeloma , 1988, Cancer.

[3]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[4]  M. Newton,et al.  Multiple Hypothesis Testing by Clustering Treatment Effects , 2007 .

[5]  D. Dunson,et al.  Bayesian Selection and Clustering of Polymorphisms in Functionally Related Genes , 2008 .

[6]  Freya Kamel,et al.  Association of Pesticide Exposure with Neurologic Dysfunction and Disease , 2004, Environmental health perspectives.

[7]  D. Duggan,et al.  Recent developments in genomewide association scans: a workshop summary and review. , 2005, American journal of human genetics.

[8]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[9]  E. George Minimax Multiple Shrinkage Estimation , 1986 .

[10]  J. Geweke,et al.  Variable selection and model comparison in regression , 1994 .

[11]  D. Dunson,et al.  Bayesian Multivariate Logistic Regression , 2004, Biometrics.

[12]  Sonja W. Scholz,et al.  Genome-wide genotyping in Parkinson's disease and neurologically normal controls: first stage analysis and public release of data , 2006, The Lancet Neurology.

[13]  D. F. Andrews,et al.  Scale Mixtures of Normal Distributions , 1974 .

[14]  M. G. Pittau,et al.  A Default Prior Distribution for Logistic and Other Regression Models , 2007 .

[15]  David B. Dunson,et al.  Bayesian Methods for Highly Correlated Exposure Data , 2007, Epidemiology.

[16]  E. George Combining Minimax Shrinkage Estimators , 1986 .

[17]  M. Escobar,et al.  Bayesian Density Estimation and Inference Using Mixtures , 1995 .

[18]  E. George A formal bayes multiple shrinkage estimator , 1986 .

[19]  T. Ferguson Prior Distributions on Spaces of Probability Measures , 1974 .

[20]  G. Roberts,et al.  Retrospective Markov chain Monte Carlo methods for Dirichlet process hierarchical models , 2007, 0710.4228.

[21]  P. Sonneveld,et al.  Abnormalities of chromosome 1p/q are highly associated with chromosome 13/13q deletions and are an adverse prognostic factor for the outcome of high‐dose chemotherapy in patients with multiple myeloma , 2007, British journal of haematology.

[22]  S Greenland,et al.  Hierarchical regression for epidemiologic analyses of multiple exposures. , 1994, Environmental health perspectives.

[23]  P. Müller,et al.  A Bayesian mixture model for differential gene expression , 2005 .

[24]  S Greenland,et al.  Hierarchical Regression Analysis Applied to a Study of Multiple Dietary Exposures and Breast Cancer , 1994, Epidemiology.

[25]  P. Müller,et al.  Optimal Sample Size for Multiple Testing , 2004 .

[26]  J. Griffin,et al.  Alternative prior distributions for variable selection with very many more variables than observations , 2005 .

[27]  S Greenland,et al.  A semi-Bayes approach to the analysis of correlated multiple associations, with an application to an occupational cancer-mortality study. , 1992, Statistics in medicine.

[28]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[29]  Michael E. Tipping Sparse Bayesian Learning and the Relevance Vector Machine , 2001, J. Mach. Learn. Res..

[30]  J. Leroy Folks,et al.  The Inverse Gaussian Distribution , 1989 .

[31]  David Page,et al.  Predicting cancer susceptibility from single-nucleotide polymorphism data: a case study in multiple myeloma , 2005, BIOKDD.

[32]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[33]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[34]  M. West On scale mixtures of normal distributions , 1987 .