Hierarchical Dirichlet Processes Author ( s ) :

JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.. American Statistical Association is collaborating with JSTOR to digitize, preserve and extend access to Journal of the American Statistical Association. We consider problems involving groups of data where each observation within a group is a draw from a mixture model and where it is desirable to share mixture components between groups. We assume that the number of mixture components is unknown a priori and is to be inferred from the data. In this setting it is natural to consider sets of Dirichlet processes, one for each group, where the well-known clustering property of the Dirichlet process provides a nonparametric prior for the number of mixture components within each group. Given our desire to tie the mixture models in the various groups, we consider a hierarchical model, specifically one in which the base measure for the child Dirichlet processes is itself distributed according to a Dirichlet process. Such a base measure being discrete, the child Dirichlet processes necessarily share atoms. Thus, as desired, the mixture models in the different groups necessarily share mixture components. We discuss representations of hierarchical Dirichlet processes in terms of a stick-breaking process, and a generalization of the Chinese restaurant process that we refer to as the "Chinese restaurant franchise." We present Markov chain Monte Carlo algorithms for posterior inference in hierarchical Dirichlet process mixtures and describe applications to problems in information retrieval and text modeling.

[1]  Michael S. Pratte,et al.  Hierarchical Bayesian Models , 2013 .

[2]  S. MacEachern,et al.  Bayesian Density Estimation and Inference Using Mixtures , 2007 .

[3]  D. Hinkley Annals of Statistics , 2006 .

[4]  P. Müller,et al.  A method for combining inference across related nonparametric Bayesian models , 2004 .

[5]  Radford M. Neal,et al.  A Split-Merge Markov chain Monte Carlo Procedure for the Dirichlet Process Mixture Model , 2004 .

[6]  T. Speed,et al.  Biological Sequence Analysis , 1998 .

[7]  Michael I. Jordan,et al.  Hierarchical Bayesian Models for Applications in Information Retrieval , 2003 .

[8]  Matthew J. Beal Variational algorithms for approximate Bayesian inference , 2003 .

[9]  Jim Pitman,et al.  Poisson–Dirichlet and GEM Invariant Distributions for Split-and-Merge Transformations of an Interval Partition , 2002, Combinatorics, Probability and Computing.

[10]  Gary E. Bolton,et al.  Reanalyzing Ultimatum Bargaining—Comparing Nondecreasing Curves Without Shape Constraints , 2002 .

[11]  H. Ishwaran,et al.  Exact and approximate sum representations for the Dirichlet process , 2002 .

[12]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[13]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[14]  D. D. Castro,et al.  Semiparametric regression for count data , 2002 .

[15]  P. Green,et al.  Modelling Heterogeneity With and Without the Dirichlet Process , 2001 .

[16]  Alex Acero,et al.  Spoken Language Processing , 2001 .

[17]  P. Donnelly,et al.  A new statistical method for haplotype reconstruction from population data. , 2001, American journal of human genetics.

[18]  Carl E. Rasmussen,et al.  Factorial Hidden Markov Models , 1997 .

[19]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[20]  M. Escobar,et al.  Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[21]  Carl E. Rasmussen,et al.  The Infinite Gaussian Mixture Model , 1999, NIPS.

[22]  J G Ibrahim,et al.  A semi-parametric Bayesian approach to generalized linear mixed models. , 1998, Statistics in medicine.

[23]  G. Tomlinson Analysis of densities , 1998 .

[24]  B. Mallick,et al.  Combining information from several experiments with nonparametric priors , 1997 .

[25]  J. Pitman Random discrete distributions invariant under size-biased permutation , 1996, Advances in Applied Probability.

[26]  David J. C. MacKay,et al.  A hierarchical Dirichlet language model , 1995, Natural Language Engineering.

[27]  Pietro Muliere,et al.  A bayesian predictive approach to sequential search for an optimal dose: Parametric and nonparametric models , 1993 .

[28]  Andreas Stolcke,et al.  Hidden Markov Model} Induction by Bayesian Model Merging , 1992, NIPS.

[29]  Radford M. Neal Bayesian Mixture Modeling , 1992 .

[30]  J. Sethuraman A CONSTRUCTIVE DEFINITION OF DIRICHLET PRIORS , 1991 .

[31]  Lawrence R. Rabiner,et al.  A tutorial on Hidden Markov Models , 1986 .

[32]  C. Antoniak Mixtures of Dirichlet Processes with Applications to Bayesian Nonparametric Problems , 1974 .

[33]  D. Blackwell,et al.  Ferguson Distributions Via Polya Urn Schemes , 1973 .

[34]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[35]  N. Jimpitma Poisson – Dirichlet and GEM Invariant Distributions for Split-and-Merge Transformations of an Interval Partition , 2022 .