Differentiable Logic Machines

The integration of reasoning, learning, and decisionmaking is key to build more general AI systems. As a step in this direction, we propose a novel neural-logic architecture that can solve both inductive logic programming (ILP) and deep reinforcement learning (RL) problems. Our architecture defines a restricted but expressive continuous space of first-order logic programs by assigning weights to predicates instead of rules. Therefore, it is fully differentiable and can be efficiently trained with gradient descent. Besides, in order to solve more efficiently RL problems, we propose a novel critic architecture that enables actor-critic algorithms. Compared to state-of-the-art methods on both ILP and RL problems, our proposition achieves excellent performance, while being able to provide a fully interpretable solution and scaling much better, especially during the testing phase.

[1]  Ali Payani,et al.  Incorporating Relational Background Knowledge into Reinforcement Learning via Differentiable Inductive Logic Programming , 2020, ArXiv.

[2]  Martijn van Otterlo,et al.  Solving Relational and First-Order Logical Markov Decision Processes: A Survey , 2012, Reinforcement Learning.

[3]  K. Kersting,et al.  Statistical Relational AI : Logic , Probability and Computation , 2011 .

[4]  Robert Givan,et al.  Relational Reinforcement Learning: An Overview , 2004, ICML 2004.

[5]  Marcin Andrychowicz,et al.  Hindsight Experience Replay , 2017, NIPS.

[6]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[7]  Chong Wang,et al.  Neural Logic Machines , 2019, ICLR.

[8]  R. Bellman A Markovian Decision Process , 1957 .

[9]  Learn to Explain Efficiently via Neural Logic Inductive Learning , 2020, ICLR.

[10]  Luc De Raedt,et al.  DeepProbLog: Neural Probabilistic Logic Programming , 2018, BNAIC/BENELEARN.

[11]  Ali Payani,et al.  Learning Algorithms via Neural Logic Networks , 2019, ArXiv.

[12]  Fan Yang,et al.  Differentiable Learning of Logical Rules for Knowledge Base Reasoning , 2017, NIPS.

[13]  Ben Poole,et al.  Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.

[14]  Andrew Cropper,et al.  Turning 30: New Ideas in Inductive Logic Programming , 2020, ArXiv.

[15]  Luciano Serafini,et al.  Neural-Symbolic Computing: An Effective Methodology for Principled Integration of Machine Learning and Reasoning , 2019, FLAP.

[16]  Sebastijan Dumancic,et al.  From Statistical Relational to Neuro-Symbolic Artificial Intelligence , 2020, IJCAI.

[17]  Vítor Santos Costa,et al.  Inductive Logic Programming , 2013, Lecture Notes in Computer Science.

[18]  Kai-Uwe Kühnberger,et al.  Neural-Symbolic Learning and Reasoning: A Survey and Interpretation , 2017, Neuro-Symbolic Artificial Intelligence.

[19]  Jason Weston,et al.  End-To-End Memory Networks , 2015, NIPS.

[20]  Chuang Gan,et al.  Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding , 2018, NeurIPS.

[21]  Fangkai Yang,et al.  SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning , 2018, AAAI.

[22]  De,et al.  Relational Reinforcement Learning , 2001, Encyclopedia of Machine Learning and Data Mining.

[23]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[24]  Ali Payani,et al.  Inductive Logic Programming via Differentiable Deep Neural Logic Networks , 2019, ArXiv.

[25]  Sergio Gomez Colmenarejo,et al.  Hybrid computing using a neural network with dynamic external memory , 2016, Nature.

[26]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[27]  Francisco Herrera,et al.  Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI , 2020, Inf. Fusion.

[28]  Ronald J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[29]  Joshua B. Tenenbaum,et al.  Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[30]  Gary Marcus,et al.  Deep Learning: A Critical Appraisal , 2018, ArXiv.

[31]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[32]  Shan Luo,et al.  Neural Logic Reinforcement Learning , 2019, ICML.

[33]  Richard Evans,et al.  Learning Explanatory Rules from Noisy Data , 2017, J. Artif. Intell. Res..