论文信息 - Compositional Generative Mapping for Tree-Structured Data—Part I: Bottom-Up Probabilistic Modeling of Trees

Compositional Generative Mapping for Tree-Structured Data—Part I: Bottom-Up Probabilistic Modeling of Trees

We introduce a novel compositional (recursive) probabilistic model for trees that defines an approximated bottom-up generative process from the leaves to the root of a tree. The proposed model defines contextual state transitions from the joint configuration of the children to the parent nodes. We argue that the bottom-up context postulates different probabilistic assumptions with respect to a top-down approach, leading to different representational capabilities. We discuss classes of applications that are best suited to a bottom-up approach. In particular, the bottom-up context is shown to better correlate and model the co-occurrence of substructures among the child subtrees of internal nodes. A mixed memory approximation is introduced to factorize the joint children-to-parent state transition matrix as a mixture of pairwise transitions. The proposed approach is the first practical bottom-up generative model for tree-structured data that maintains the same computational class of its top-down counterpart. Comparative experimental analyses exploiting synthetic and real-world datasets show that the proposed model can deal with deep structures better than a top-down generative model. The model is also shown to better capture structural information from real-world data comprising trees with a large out-degree. The proposed bottom-up model can be used as a fundamental building block for the development of other new powerful models.

Davide Bacciu | Alessio Micheli | Alessandro Sperduti

[1] Joost Engelfriet,et al. Bottom-up and top-down tree transformations— a comparison , 1975, Mathematical systems theory.

[2] Alessandro Sperduti,et al. Supervised neural networks for the classification of structures , 1997, IEEE Trans. Neural Networks.

[3] Robert D. Nowak,et al. Wavelet-based statistical signal processing using hidden Markov models , 1998, IEEE Trans. Signal Process..

[4] Antonio Criminisi,et al. Object categorization by learned universal visual dictionary , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[5] Jeff A. Bilmes,et al. Dynamic Bayesian Multinets , 2000, UAI.

[6] A. Raftery. A model for high-order Markov chains , 1985 .

[7] Carl E. Rasmussen,et al. Factorial Hidden Markov Models , 1997 .

[8] Michael I. Jordan,et al. Factorial Hidden Markov Models , 1995, Machine Learning.

[9] Peter Tiño,et al. Visualization of Tree-Structured Data Through Generative Topographic Mapping , 2008, IEEE Transactions on Neural Networks.

[10] Hubert Comon,et al. Tree automata techniques and applications , 1997 .

[11] A. Raftery,et al. The Mixture Transition Distribution Model for High-Order Markov Chains and Non-Gaussian Time Series , 2002 .

[12] Alessio Micheli,et al. Recursive self-organizing network models , 2004, Neural Networks.

[13] Alessio Micheli,et al. A general framework for unsupervised processing of structured data , 2004, Neurocomputing.

[14] Alex Pentland,et al. Coupled hidden Markov models for complex action recognition , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15] Michael I. Jordan,et al. Mixed Memory Markov Models: Decomposing Complex Stochastic Processes as Mixtures of Simpler Ones , 1999, Machine Learning.

[16] Davide Bacciu,et al. Bottom-Up Generative Modeling of Tree-Structured Data , 2010, ICONIP.

[17] Alessandro Sperduti,et al. A general framework for adaptive processing of data structures , 1998, IEEE Trans. Neural Networks.

[18] Paulo Gonçalves,et al. Computational methods for hidden Markov tree models-an application to wavelet trees , 2004, IEEE Transactions on Signal Processing.

[19] Ludovic Denoyer,et al. Report on the XML mining track at INEX 2005 and INEX 2006: categorization and clustering of XML documents , 2007, SIGF.

[20] Davide Bacciu,et al. Compositional Generative Mapping for Tree-Structured Data—Part II: Topographic Projection Model , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[21] Robert A. Jacobs,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.

[22] Paolo Frasconi,et al. Hidden Tree Markov Models for Document Image Classification , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[23] Alessandro Sperduti,et al. Route kernels for trees , 2009, ICML '09.

[24] Ludovic Denoyer,et al. Bayesian network model for semi-structured document classification , 2004, Inf. Process. Manag..

[25] Joseph N. Wilson,et al. Twenty Years of Mixture of Experts , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[26] Ioannis Patras,et al. Tree-Structured Feature Extraction Using Mutual Information , 2012, IEEE Transactions on Neural Networks and Learning Systems.