Poisson Latent Feature Calculus for Generalized Indian Buffet Processes

The purpose of this work is to describe a unified, and indeed simple, mechanism for non-parametric Bayesian analysis, construction and generative sampling of a large class of latent feature models which one can describe as generalized notions of Indian Buffet Processes(IBP). This is done via the Poisson Process Calculus as it now relates to latent feature models. The IBP was ingeniously devised by Griffiths and Ghahramani in (2005) and its generative scheme is cast in terms of customers entering sequentially an Indian Buffet restaurant and selecting previously sampled dishes as well as new dishes. In this metaphor dishes corresponds to latent features, attributes, preferences shared by individuals. The IBP, and its generalizations, represent an exciting class of models well suited to handle high dimensional statistical problems now common in this information age. The IBP is based on the usage of conditionally independent Bernoulli random variables, coupled with completely random measures acting as Bayesian priors, that are used to create sparse binary matrices. This Bayesian non-parametric view was a key insight due to Thibaux and Jordan (2007). One way to think of generalizations is to to use more general random variables. Of note in the current literature are models employing Poisson and Negative-Binomial random variables. However, unlike their closely related counterparts, generalized Chinese restaurant processes, the ability to analyze IBP models in a systematic and general manner is not yet available. The limitations are both in terms of knowledge about the effects of different priors and in terms of models based on a wider choice of random variables. This work will not only provide a thorough description of the properties of existing models but also provide a simple template to devise and analyze new models.

[1]  Y. Teh,et al.  Indian Buffet Processes with Power-law Behavior , 2009, NIPS.

[2]  G. Wahba,et al.  Multivariate Bernoulli distribution , 2012, 1206.1874.

[3]  Lancelot F. James,et al.  Poisson Process Partition Calculus with Applications to Exchangeable Models and Bayesian Nonparametrics , 2002 .

[4]  Lancelot F. James,et al.  Posterior Analysis for Normalized Random Measures with Independent Increments , 2009 .

[5]  Carl E. Rasmussen,et al.  A choice model with infinitely many latent features , 2006, ICML.

[6]  Lancelot F. James Bayesian Poisson process partition calculus with an application to Bayesian Lévy moving averages , 2005, math/0508283.

[7]  Michael I. Jordan,et al.  Bayesian Nonparametric Latent Feature Models , 2011 .

[8]  Peter Orbanz,et al.  Nonparametric Priors on Complete Separable Metric Spaces , 2012 .

[9]  Zoubin Ghahramani,et al.  An Infinite Latent Attribute Model for Network Data , 2012, ICML.

[10]  Zoubin Ghahramani,et al.  Nonparametric Bayesian Sparse Factor Models with application to Gene Expression modelling , 2010, The Annals of Applied Statistics.

[11]  T. Griffiths,et al.  Bayesian nonparametric latent feature models , 2007 .

[12]  J. Pitman Poisson-Kingman partitions , 2002, math/0210396.

[13]  O. Barndorff-Nielsen,et al.  Multivariate subordination, self-decomposability and stability , 2001, Advances in Applied Probability.

[14]  Thomas L. Griffiths,et al.  A Non-Parametric Bayesian Method for Inferring Hidden Causes , 2006, UAI.

[15]  Lancelot F. James,et al.  Generalized weighted Chinese restaurant processes for species sampling mixture models , 2003 .

[16]  J. Pitman Combinatorial Stochastic Processes , 2006 .

[17]  David B. Dunson,et al.  Beta-Negative Binomial Process and Poisson Factor Analysis , 2011, AISTATS.

[18]  Daniel M. Roy,et al.  Bayesian Models of Graphs, Arrays and Other Exchangeable Random Structures , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Michael I. Jordan,et al.  Combinatorial Clustering and the Beta Negative Binomial Process , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Lancelot F. James,et al.  Bayesian analysis of multistate event history data: beta-Dirichlet process prior , 2012 .

[21]  Michael I. Jordan,et al.  Feature allocations, probability functions, and paintboxes , 2013, 1301.6647.

[22]  K. Doksum Tailfree and Neutral Random Probabilities and Their Posterior Distributions , 1974 .

[23]  David B. Dunson,et al.  The Kernel Beta Process , 2011, NIPS.

[24]  Jim Pitman,et al.  Partition structures derived from Brownian motion and stable subordinators , 1997 .

[25]  Tamara Broderick,et al.  Clusters and Features from Combinatorial Stochastic Processes , 2014 .

[26]  Chong Wang,et al.  The IBP Compound Dirichlet Process and its Application to Focused Topic Modeling , 2010, ICML.

[27]  Michael I. Jordan,et al.  Hierarchical Beta Processes and the Indian Buffet Process , 2007, AISTATS.

[28]  Alexander V. Gnedin,et al.  Three Sampling Formulas , 2002, Combinatorics, Probability and Computing.

[29]  Michael,et al.  On a Class of Bayesian Nonparametric Estimates : I . Density Estimates , 2008 .

[30]  Peter Orbanz,et al.  Construction of Nonparametric Bayesian Models from Parametric Bayes Equations , 2009, NIPS.

[31]  Lancelot F. James Bayesian calculus for gamma processes with applications to semiparametric intensity models , 2003 .

[32]  James G. Scott,et al.  Priors for Random Count Matrices Derived from a Family of Negative Binomial Processes , 2014, 1404.3331.

[33]  Yongdai Kim NONPARAMETRIC BAYESIAN ESTIMATORS FOR COUNTING PROCESSES , 1999 .

[34]  N. Hjort Nonparametric Bayes Estimators Based on Beta Processes in Models for Life History Data , 1990 .

[35]  Michalis K. Titsias,et al.  The Infinite Gamma-Poisson Feature Model , 2007, NIPS.

[36]  Michael I. Jordan,et al.  Posteriors, conjugacy, and exponential families for completely random measures , 2014, Bernoulli.

[37]  Michael I. Jordan,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..

[38]  Lancelot F. James Poisson calculus for spatial neutral to the right processes , 2003, math/0305053.

[39]  Albert Y. Lo,et al.  On a class of Bayesian nonparametric estimates: II. Hazard rate estimates , 1989 .

[40]  Y. Teh,et al.  MCMC for Normalized Random Measure Mixture Models , 2013, 1310.0595.

[41]  Daniel M. Roy,et al.  The combinatorial structure of beta negative binomial processes , 2013, Bernoulli.

[42]  Jun Zhu,et al.  Discriminative infinite latent feature models , 2013, 2013 IEEE China Summit and International Conference on Signal and Information Processing.

[43]  Hemant Ishwaran,et al.  Computational Methods for Multiplicative Intensity Models Using Weighted Gamma Processes , 2004 .

[44]  Thomas L. Griffiths,et al.  The Indian Buffet Process: An Introduction and Review , 2011, J. Mach. Learn. Res..

[45]  Albert Y. Lo,et al.  Bayesian nonparametric statistical inference for Poisson point processes , 1982 .

[46]  J. Pitman,et al.  Two Recursive Decompositions of Brownian Bridge Related to the Asymptotics of Random Mappings , 2006 .

[47]  Thomas L. Griffiths,et al.  Infinite latent feature models and the Indian buffet process , 2005, NIPS.

[48]  Seungjin Choi,et al.  Bayesian multi-subject common spatial patterns with Indian Buffet process priors , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[49]  J. Pitman,et al.  Regenerative composition structures , 2003, math/0307307.