Structured priors for sparse probability vectors with application to model selection in Markov chains

We develop two prior distributions for probability vectors which, in contrast to the popular Dirichlet distribution, retain sparsity properties in the presence of data. Our models are appropriate for count data with many categories, most of which are expected to have negligible probability. Both models are tractable, allowing for efficient posterior sampling and marginalization. Consequently, they can replace the Dirichlet prior in hierarchical models without sacrificing convenient Gibbs sampling schemes. We derive both models and demonstrate their properties. We then illustrate their use for model-based selection with a hierarchical model in which we infer the active lag from time-series data. Using a squared-error loss, we demonstrate the utility of the models for data simulated from a nearly deterministic dynamical system. We also apply the prior models to an ecological time series of Chinook salmon abundance, demonstrating their ability to extract insights into the lag dependence.

[1]  E. George,et al.  APPROACHES FOR BAYESIAN VARIABLE SELECTION , 1997 .

[2]  Alan Agresti,et al.  Bayesian inference for categorical data analysis , 2005, Stat. Methods Appl..

[3]  J. Sethuraman A CONSTRUCTIVE DEFINITION OF DIRICHLET PRIORS , 1991 .

[4]  Jim Albert,et al.  Mixtures of Dirichlet Distributions and Estimation in Contingency Tables , 1982 .

[5]  Fadlalla G. Elfadaly,et al.  Eliciting Dirichlet and Gaussian copula prior distributions for multinomial models , 2016, Stat. Comput..

[6]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[7]  A. Raftery,et al.  The Mixture Transition Distribution Model for High-Order Markov Chains and Non-Gaussian Time Series , 2002 .

[8]  Michael A. West,et al.  Time Series: Modeling, Computation, and Inference , 2010 .

[9]  N. Mantua,et al.  An historical narrative on the Pacific Decadal Oscillation, interdecadal climate variability and ecosystem impacts , 2001 .

[10]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[11]  Michael P. Wiper,et al.  Bayesian Analysis of Stochastic Process Models , 2012 .

[12]  J. Atchison,et al.  Logistic-normal distributions:Some properties and uses , 1980 .

[13]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[14]  Robert H. Lochner,et al.  A Generalized Dirichlet Distribution in Bayesian Life Testing , 1975 .

[15]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[16]  Ocean Size and Corresponding Life History Diversity among the Four Run Timings of California Central Valley Chinook Salmon , 2017 .

[17]  Robert J. Connor,et al.  Concepts of Independence for Proportions with a Generalization of the Dirichlet Distribution , 1969 .

[18]  J. Besag,et al.  Exact Goodness‐of‐Fit Tests for Markov Chains , 2013, Biometrics.

[19]  W. Zucchini,et al.  Hidden Markov Models for Time Series: An Introduction Using R , 2009 .

[20]  Michael P. Wiper,et al.  Bayesian Analysis of Stochastic Process Models: Ruggeri/Bayesian Analysis of Stochastic Process Models , 2012 .

[21]  A. Raftery A model for high-order Markov chains , 1985 .

[22]  Nils Lid Hjort,et al.  Bayesian Approaches to Non- and Semiparametric Density Estimation , 1994 .

[23]  Terrance J. Quinn,et al.  Quantitative Fish Dynamics , 1999 .

[24]  Tzu-Tsung Wong,et al.  Generalized Dirichlet distribution in Bayesian analysis , 1998, Appl. Math. Comput..

[25]  Alan Edelman,et al.  Julia: A Fresh Approach to Numerical Computing , 2014, SIAM Rev..

[26]  Nizar Bouguila,et al.  Dirichlet-based probability model applied to human skin detection [image skin detection] , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[27]  I. Good On the Application of Symmetric Dirichlet Distributions and their Mixtures to Contingency Tables , 1976 .

[28]  A. Raftery,et al.  Estimation and Modelling Repeated Patterns in High Order Markov Chains with the Mixture Transition Distribution Model , 1994 .