Lifted generative learning of Markov logic networks

Markov logic networks (MLNs) are a well-known statistical relational learning formalism that combines Markov networks with first-order logic. MLNs attach weights to formulas in first-order logic. Learning MLNs from data is a challenging task as it requires searching through the huge space of possible theories. Additionally, evaluating a theory’s likelihood requires learning the weight of all formulas in the theory. This in turn requires performing probabilistic inference, which, in general, is intractable in MLNs. Lifted inference speeds up probabilistic inference by exploiting symmetries in a model. We explore how to use lifted inference when learning MLNs. Specifically, we investigate generative learning where the goal is to maximize the likelihood of the model given the data. First, we provide a generic algorithm for learning maximum likelihood weights that works with any exact lifted inference approach. In contrast, most existing approaches optimize approximate measures such as the pseudo-likelihood. Second, we provide a concrete parameter learning algorithm based on first-order knowledge compilation. Third, we propose a structure learning algorithm that learns liftable MLNs, which is the first MLN structure learning algorithm that exactly optimizes the likelihood of the model. Finally, we perform an empirical evaluation on three real-world datasets. Our parameter learning algorithm results in more accurate models than several competing approximate approaches. It learns more accurate models in terms of test-set log-likelihood as well as prediction tasks. Furthermore, our tractable learner outperforms intractable models on prediction tasks suggesting that liftable models are a powerful hypothesis space, which may be sufficient for many standard learning problems.

[1]  Tuyen N. Huynh,et al.  Exact Lifted Inference with Distinct Soft Evidence on Every Object , 2012, AAAI.

[2]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[3]  Pierre Marquis,et al.  A Knowledge Compilation Map , 2002, J. Artif. Intell. Res..

[4]  Guy Van den Broeck,et al.  Skolemization for Weighted First-Order Model Counting , 2013, KR.

[5]  David Poole,et al.  First-order probabilistic inference , 2003, IJCAI.

[6]  Pedro M. Domingos,et al.  Probabilistic theorem proving , 2011, UAI.

[7]  John D. Lafferty,et al.  Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Daniel Lowd,et al.  Learning Markov Networks With Arithmetic Circuits , 2013, AISTATS.

[9]  G. Rota The Number of Partitions of a Set , 1964 .

[10]  Fahiem Bacchus,et al.  Towards Completely Lifted Search-based Probabilistic Inference , 2011, ArXiv.

[11]  Guy Van den Broeck,et al.  Lifted Relax, Compensate and then Recover: From Approximate to Exact Lifted Probabilistic Inference , 2012, UAI.

[12]  Luc De Raedt,et al.  Probabilistic inductive logic programming , 2004 .

[13]  Adnan Darwiche,et al.  Modeling and Reasoning with Bayesian Networks , 2009 .

[14]  Jeff A. Bilmes,et al.  PAC-learning Bounded Tree-width Graphical Models , 2004, UAI.

[15]  Guy Van den Broeck,et al.  Lifted Generative Parameter Learning , 2013, StarAI@AAAI.

[16]  Guy Van den Broeck,et al.  Completeness Results for Lifted Variable Elimination , 2013, AISTATS.

[17]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[18]  Guy Van den Broeck On the Completeness of First-Order Knowledge Compilation for Lifted Probabilistic Inference , 2011, NIPS.

[19]  Luc De Raedt,et al.  Probabilistic Inductive Logic Programming - Theory and Applications , 2008, Probabilistic Inductive Logic Programming.

[20]  Leslie Pack Kaelbling,et al.  Lifted Probabilistic Inference with Counting Formulas , 2008, AAAI.

[21]  Adnan Darwiche,et al.  On probabilistic inference by weighted model counting , 2008, Artif. Intell..

[22]  Dan Roth,et al.  Lifted First-Order Probabilistic Inference , 2005, IJCAI.

[23]  Pedro M. Domingos,et al.  Sound and Efficient Inference with Probabilistic and Deterministic Dependencies , 2006, AAAI.

[24]  Pedro M. Domingos,et al.  Learning the structure of Markov logic networks , 2005, ICML.

[25]  Kristian Kersting,et al.  Lifted Online Training of Relational Models with Stochastic Gradient Methods , 2012, ECML/PKDD.

[26]  J. Besag Statistical Analysis of Non-Lattice Data , 1975 .

[27]  Guy Van den Broeck Lifted Inference and Learning in Statistical Relational Models , 2013 .

[28]  Raymond J. Mooney,et al.  Bottom-up learning of Markov logic network structure , 2007, ICML '07.

[29]  Pedro M. Domingos,et al.  Lifted First-Order Belief Propagation , 2008, AAAI.

[30]  Ian McGraw,et al.  FastInf: An Efficient Approximate Inference Library , 2010, J. Mach. Learn. Res..

[31]  Pedro M. Domingos,et al.  Discriminative Training of Markov Logic Networks , 2005, AAAI.

[32]  Ofer Meshi,et al.  Template Based Inference in Symmetric Relational Markov Random Fields , 2007, UAI.

[33]  Pedro M. Domingos,et al.  Learning Markov Logic Networks Using Structural Motifs , 2010, ICML.

[34]  Guy Van den Broeck,et al.  Tractability through Exchangeability: A New Perspective on Efficient Probabilistic Inference , 2014, AAAI.

[35]  Pedro M. Domingos,et al.  Efficient Weight Learning for Markov Logic Networks , 2007, PKDD.

[36]  Andrew McCallum,et al.  Introduction to Statistical Relational Learning , 2007 .

[37]  Pedro M. Domingos,et al.  A Tractable First-Order Probabilistic Logic , 2012, AAAI.

[38]  Kristian Kersting,et al.  Counting Belief Propagation , 2009, UAI.

[39]  Kristian Kersting,et al.  Lifted Probabilistic Inference , 2012, ECAI.

[40]  Luc De Raedt,et al.  Lifted Probabilistic Inference by First-Order Knowledge Compilation , 2011, IJCAI.

[41]  Carlos Guestrin,et al.  Efficient Principled Learning of Thin Junction Trees , 2007, NIPS.

[42]  Jesse Davis,et al.  Unachievable Region in Precision-Recall Space and Its Effect on Empirical Evaluation , 2012, ICML.

[43]  Ben Taskar,et al.  Introduction to statistical relational learning , 2007 .

[44]  Guy Van den Broeck,et al.  Liftability of Probabilistic Inference: Upper and Lower Bounds , 2012 .

[45]  James D. Park,et al.  MAP Complexity Results and Approximation Methods , 2002, UAI.

[46]  Guy Van den Broeck On the Complexity and Approximation of Binary Evidence in Lifted Inference , 2013, StarAI@AAAI.

[47]  Raymond J. Mooney,et al.  Max-Margin Weight Learning for Markov Logic Networks , 2009, ECML/PKDD.

[48]  Guy Van den Broeck,et al.  Conditioning in First-Order Knowledge Compilation and Lifted Probabilistic Inference , 2012, AAAI.

[49]  Guy Van den Broeck,et al.  Symmetric Weighted First-Order Model Counting , 2014, PODS.

[50]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[51]  Pedro Domingos,et al.  Tractable Markov Logic , 2012 .

[52]  Jesse Davis,et al.  Generalized Counting for Lifted Variable Elimination , 2012, ILP.

[53]  Pedro M. Domingos,et al.  Learning Arithmetic Circuits , 2008, UAI.

[54]  Matthew Richardson,et al.  The Alchemy System for Statistical Relational AI: User Manual , 2007 .

[55]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.