Compositional Generative Mapping for Tree-Structured Data—Part I: Bottom-Up Probabilistic Modeling of Trees

We introduce a novel compositional (recursive) probabilistic model for trees that defines an approximated bottom-up generative process from the leaves to the root of a tree. The proposed model defines contextual state transitions from the joint configuration of the children to the parent nodes. We argue that the bottom-up context postulates different probabilistic assumptions with respect to a top-down approach, leading to different representational capabilities. We discuss classes of applications that are best suited to a bottom-up approach. In particular, the bottom-up context is shown to better correlate and model the co-occurrence of substructures among the child subtrees of internal nodes. A mixed memory approximation is introduced to factorize the joint children-to-parent state transition matrix as a mixture of pairwise transitions. The proposed approach is the first practical bottom-up generative model for tree-structured data that maintains the same computational class of its top-down counterpart. Comparative experimental analyses exploiting synthetic and real-world datasets show that the proposed model can deal with deep structures better than a top-down generative model. The model is also shown to better capture structural information from real-world data comprising trees with a large out-degree. The proposed bottom-up model can be used as a fundamental building block for the development of other new powerful models.

[1]  Joost Engelfriet,et al.  Bottom-up and top-down tree transformations— a comparison , 1975, Mathematical systems theory.

[2]  Alessandro Sperduti,et al.  Supervised neural networks for the classification of structures , 1997, IEEE Trans. Neural Networks.

[3]  Robert D. Nowak,et al.  Wavelet-based statistical signal processing using hidden Markov models , 1998, IEEE Trans. Signal Process..

[4]  Antonio Criminisi,et al.  Object categorization by learned universal visual dictionary , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[5]  Jeff A. Bilmes,et al.  Dynamic Bayesian Multinets , 2000, UAI.

[6]  A. Raftery A model for high-order Markov chains , 1985 .

[7]  Carl E. Rasmussen,et al.  Factorial Hidden Markov Models , 1997 .

[8]  Michael I. Jordan,et al.  Factorial Hidden Markov Models , 1995, Machine Learning.

[9]  Peter Tiño,et al.  Visualization of Tree-Structured Data Through Generative Topographic Mapping , 2008, IEEE Transactions on Neural Networks.

[10]  Hubert Comon,et al.  Tree automata techniques and applications , 1997 .

[11]  A. Raftery,et al.  The Mixture Transition Distribution Model for High-Order Markov Chains and Non-Gaussian Time Series , 2002 .

[12]  Alessio Micheli,et al.  Recursive self-organizing network models , 2004, Neural Networks.

[13]  Alessio Micheli,et al.  A general framework for unsupervised processing of structured data , 2004, Neurocomputing.

[14]  Alex Pentland,et al.  Coupled hidden Markov models for complex action recognition , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Michael I. Jordan,et al.  Mixed Memory Markov Models: Decomposing Complex Stochastic Processes as Mixtures of Simpler Ones , 1999, Machine Learning.

[16]  Davide Bacciu,et al.  Bottom-Up Generative Modeling of Tree-Structured Data , 2010, ICONIP.

[17]  Alessandro Sperduti,et al.  A general framework for adaptive processing of data structures , 1998, IEEE Trans. Neural Networks.

[18]  Paulo Gonçalves,et al.  Computational methods for hidden Markov tree models-an application to wavelet trees , 2004, IEEE Transactions on Signal Processing.

[19]  Ludovic Denoyer,et al.  Report on the XML mining track at INEX 2005 and INEX 2006: categorization and clustering of XML documents , 2007, SIGF.

[20]  Davide Bacciu,et al.  Compositional Generative Mapping for Tree-Structured Data—Part II: Topographic Projection Model , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Robert A. Jacobs,et al.  Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.

[22]  Paolo Frasconi,et al.  Hidden Tree Markov Models for Document Image Classification , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Alessandro Sperduti,et al.  Route kernels for trees , 2009, ICML '09.

[24]  Ludovic Denoyer,et al.  Bayesian network model for semi-structured document classification , 2004, Inf. Process. Manag..

[25]  Joseph N. Wilson,et al.  Twenty Years of Mixture of Experts , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[26]  Ioannis Patras,et al.  Tree-Structured Feature Extraction Using Mutual Information , 2012, IEEE Transactions on Neural Networks and Learning Systems.