The Infinite Gamma-Poisson Feature Model

We present a probability distribution over non-negative integer valued matrices with possibly an infinite number of columns. We also derive a stochastic process that reproduces this distribution over equivalence classes. This model can play the role of the prior in nonparametric Bayesian learning scenarios where multiple latent features are associated with the observed data and each feature can have multiple appearances or occurrences within each data point. Such data arise naturally when learning visual object recognition systems from unlabelled images. Together with the nonparametric prior we consider a likelihood model that explains the visual appearance and location of local image patches. Inference with this model is carried out using a Markov chain Monte Carlo algorithm.

[1]  W. Ewens The sampling theory of selectively neutral alleles. , 1972, Theoretical population biology.

[2]  C. Antoniak Mixtures of Dirichlet Processes with Applications to Bayesian Nonparametric Problems , 1974 .

[3]  Radford M. Neal Bayesian Mixture Modeling , 1992 .

[4]  Geoffrey E. Hinton,et al.  Autoencoders, Minimum Description Length and Helmholtz Free Energy , 1993, NIPS.

[5]  A. W. Kemp,et al.  Univariate Discrete Distributions , 1993 .

[6]  M. Newton Approximate Bayesian-inference With the Weighted Likelihood Bootstrap , 1994 .

[7]  S. MacEachern Estimating normal means with a conjugate style dirichlet process prior , 1994 .

[8]  Eric Saund,et al.  A Multiple Cause Mixture Model for Unsupervised Learning , 1995, Neural Computation.

[9]  Brendan J. Frey,et al.  Learning flexible sprites in video layers , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  P. Green,et al.  Modelling Heterogeneity With and Without the Dirichlet Process , 2001 .

[11]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[12]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[13]  John F. Canny,et al.  GaP: a factor model for discrete data , 2004, SIGIR '04.

[14]  Aleks Jakulin,et al.  Applying Discrete PCA in Data Analysis , 2004, UAI.

[15]  Thomas L. Griffiths,et al.  Infinite latent feature models and the Indian buffet process , 2005, NIPS.

[16]  Antonio Torralba,et al.  Describing Visual Scenes using Transformed Dirichlet Processes , 2005, NIPS.