Poisson Sum-Product Networks: A Deep Architecture for Tractable Multivariate Poisson Distributions

Multivariate count data are pervasive in science in the form of histograms, contingency tables and others. Previous work on modeling this type of distributions do not allow for fast and tractable inference. In this paper we present a novel Poisson graphical model, the first based on sum product networks, called PSPN, allowing for positive as well as negative dependencies. We present algorithms for learning tree PSPNs from data as well as for tractable inference via symbolic evaluation. With these, information-theoretic measures such as entropy, mutual information, and distances among count variables can be computed without resorting to approximations. Additionally, we show a connection between PSPNs and LDA, linking the structure of tree PSPNs to a hierarchy of topics. The experimental results on several synthetic and real world datasets demonstrate that PSPN often outperform state-of-the-art while remaining tractable.

[1]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[2]  Steffen L. Lauritzen,et al.  Graphical models in R , 1996 .

[3]  Pradeep Ravikumar,et al.  Graphical Models via Generalized Linear Models , 2012, NIPS.

[4]  Genevera I. Allen,et al.  A Local Poisson Graphical Model for Inferring Networks From Sequencing Data , 2013, IEEE Transactions on NanoBioscience.

[5]  Zhe Gan,et al.  Deep Poisson Factor Modeling , 2015, NIPS.

[6]  Ashwin Srinivasan,et al.  Statistical Evaluation of the Predictive Toxicology Challenge 2000-2001 , 2003, Bioinform..

[7]  B. Hansen Approximate Asymptotic P Values for Structural-Change Tests , 1997 .

[8]  João Gama,et al.  Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Porto, Portugal, September 7-11, 2015, Proceedings, Part II , 2015, ECML/PKDD.

[9]  Pradeep Ravikumar,et al.  Fixed-Length Poisson MRF: Adding Dependencies to the Multinomial , 2015, NIPS.

[10]  Ernst Joachim Weniger,et al.  Nonlinear sequence transformations for the acceleration of convergence and the summation of divergent series , 1989 .

[11]  Pedro M. Domingos,et al.  Sum-product networks: A new deep architecture , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[12]  Fabian Hadiji,et al.  Poisson Dependency Networks: Gradient Boosted Models for Multivariate Count Data , 2015, Machine Learning.

[13]  Hannu Toivonen,et al.  Statistical evaluation of the predictive toxicology challenge , 2000 .

[14]  Pradeep Ravikumar,et al.  Mixed Graphical Models via Exponential Families , 2014, AISTATS.

[15]  A. Debnath,et al.  Structure-activity relationship of mutagenic aromatic and heteroaromatic nitro compounds. Correlation with molecular orbital energies and hydrophobicity. , 1991, Journal of medicinal chemistry.

[16]  Daniel Lowd,et al.  Learning Markov Networks With Arithmetic Circuits , 2013, AISTATS.

[17]  Pradeep Ravikumar,et al.  Square Root Graphical Models: Multivariate Generalizations of Univariate Exponential Families that Permit Positive Dependencies , 2016, ICML.

[18]  D. E. Roberts,et al.  The epsilon algorithm and related topics , 2000 .

[19]  Pedro M. Domingos,et al.  Learning the Structure of Sum-Product Networks , 2013, ICML.

[20]  Pradeep Ravikumar,et al.  On Poisson Graphical Models , 2013, NIPS.

[21]  Kurt Mehlhorn,et al.  Weisfeiler-Lehman Graph Kernels , 2011, J. Mach. Learn. Res..

[22]  Mingyuan Zhou,et al.  The Poisson Gamma Belief Network , 2015, NIPS.

[23]  Pradeep Ravikumar,et al.  Admixture of Poisson MRFs: A Topic Model with Word Dependencies , 2014, ICML.

[24]  Robert B. O'Hara,et al.  Do not log‐transform count data , 2010 .

[25]  K. Hornik,et al.  Generalized M‐fluctuation tests for parameter instability , 2007 .

[26]  D. Andrews Tests for Parameter Instability and Structural Change with Unknown Change Point , 1993 .

[27]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[28]  Pedro M. Domingos,et al.  Discriminative Learning of Sum-Product Networks , 2012, NIPS.

[29]  David M. Blei,et al.  Deep Exponential Families , 2014, AISTATS.

[30]  Hans-Peter Kriegel,et al.  Protein function prediction via graph kernels , 2005, ISMB.

[31]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[32]  P. McCullagh,et al.  Generalized Linear Models , 1984 .