Improved Generative Models for Continuous Image Features through Tree-structured Non-parametric Distributions

Density estimation arises in a wide range of vision problems and methods which can deal with high dimensional image features are of great importance. While in principle a nonparametric distribution can be estimated for the full feature distribution using Parzen windows technique, the amount of data to make these estimates accurate is usually either unattainable or unmanageable. Consequently, most modelers resort to parametric models such as mixtures of Gaussians (or other more complicated parametric forms) or make independence assumptions about the features. Such assumptions could be detrimental to the performance of vision systems since realistically, image features have neither a simple parametric form, nor are they independent. In this paper, we revive non-parametric models for image feature distributions by finding the best tree-structured graphical model (using the Chow-Liu algorithm) for our data, and estimating non-parametric distributions over the oneand two-node marginals necessary to define the graph. This procedure has the appealing property that, if the tree-structured model represents the true conditional independence relations for the features, then our estimated joint distribution converges rapidly to the true distribution of the data. Even when this is not true, it converges to the best possible tree-structured model for the original distribution. We illustrate the effectiveness of this technique on simulated data and a real-world plankton classification problem.

[1]  Matthew B. Blaschko,et al.  Combining Local and Global Image Features for Object Class Recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[2]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[3]  Pierre Soille,et al.  Morphological Image Analysis: Principles and Applications , 2003 .

[4]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[5]  Sebastiano B. Serpico,et al.  Multisource data classification with dependence trees , 2002, IEEE Trans. Geosci. Remote. Sens..

[6]  David G. Stork,et al.  Pattern Classification , 1973 .

[7]  Franz Pernkopf,et al.  Discriminative versus generative parameter and structure learning of Bayesian network classifiers , 2005, ICML.

[8]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[9]  Padhraic Smyth,et al.  Conditional Chow-Liu Tree Structures for Modeling Discrete-Valued Vector Time Series , 2004, UAI.

[10]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[11]  L. Györfi,et al.  Nonparametric entropy estimation. An overview , 1997 .

[12]  Robert M. Haralick,et al.  Approximating high dimensional probability distributions , 2004, ICPR 2004.

[13]  David Heckerman,et al.  Probabilistic similarity networks , 1991, Networks.

[14]  Allen R. Hanson,et al.  Automatic In Situ Identification of Plankton , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[15]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[16]  Jeff A. Bilmes,et al.  Factored sparse inverse covariance matrices , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[17]  Michael I. Jordan Graphical Models , 2003 .

[18]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[19]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[20]  Ivar Weibull,et al.  Image Analysis — Principles and Applications in Materials Technology , 1995 .

[21]  Michael I. Jordan,et al.  Beyond Independent Components: Trees and Clusters , 2003, J. Mach. Learn. Res..

[22]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[23]  Allen R. Hanson,et al.  On multi-scale differential features and their representations for image retrieval and recognition , 2003 .