论文信息 - Synthesis of Differentiable Functional Programs for Lifelong Learning

Synthesis of Differentiable Functional Programs for Lifelong Learning

We present a neurosymbolic approach to the lifelong learning of algorithmic tasks that mix perception and procedural reasoning. Reusing highlevel concepts across domains and learning complex procedures are two key challenges in lifelong learning. We show that a combination of gradientbased learning and symbolic program synthesis can be a more effective response to these challenges than purely neural methods. Concretely, our approach, called HOUDINI, represents neural networks as strongly typed, end-to-end differentiable functional programs that use symbolic higher-order combinators to compose a library of neural functions. Our learning algorithm consists of: (1) a program synthesizer that performs a type-directed search over programs in this language, and decides on the library functions that should be reused and the architectures that should be used to combine them; and (2) a neural module that trains synthesized programs using stochastic gradient descent. We evaluate our approach on three algorithmic tasks. Our experiments show that our type-directed search technique is able to significantly prune the search space of programs, and that the overall approach transfers high-level concepts more effectively than monolithic neural networks as well as traditional transfer learning.

[1] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[2] Pushmeet Kohli,et al. Learning Continuous Semantic Representations of Symbolic Expressions , 2016, ICML.

[3] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[4] Quoc V. Le,et al. Large-Scale Evolution of Image Classifiers , 2017, ICML.

[5] Tim Rocktäschel,et al. Programming with a Differentiable Forth Interpreter , 2016, ICML.

[6] Dawn Xiaodong Song,et al. Making Neural Programming Architectures Generalize via Recursion , 2017, ICLR.

[7] Armando Solar-Lezama,et al. Learning to Infer Graphics Programs from Hand-Drawn Images , 2017, NeurIPS.

[8] Junjie Yan,et al. Practical Network Blocks Design with Q-Learning , 2017, ArXiv.

[9] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[10] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.

[11] Pushmeet Kohli,et al. RobustFill: Neural Program Learning under Noisy I/O , 2017, ICML.

[12] C. Cordell Green,et al. What Is Program Synthesis? , 1985, J. Autom. Reason..

[13] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[14] Pushmeet Kohli,et al. TerpreT: A Probabilistic Programming Language for Program Induction , 2016, ArXiv.

[15] Sebastian Thrun,et al. Lifelong robot learning , 1993, Robotics Auton. Syst..

[16] John W. Backus,et al. Can programming be liberated from the von Neumann style?: a functional style and its algebra of programs , 1978, CACM.

[17] Richard S. Zemel,et al. Gated Graph Sequence Neural Networks , 2015, ICLR.

[18] Estevam R. Hruschka,et al. Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.

[19] Trevor Darrell,et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[20] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .

[21] Isil Dillig,et al. Synthesizing data structure transformations from input-output examples , 2015, PLDI.

[22] Sumit Gulwani,et al. Neural-Guided Deductive Search for Real-Time Program Synthesis from Examples , 2018, ICLR.

[23] Sumit Gulwani,et al. FlashExtract: a framework for data extraction by examples , 2014, PLDI.

[24] Johannes Stallkamp,et al. Man vs. computer: Benchmarking machine learning algorithms for traffic sign recognition , 2012, Neural Networks.

[25] Chrisantha Fernando,et al. PathNet: Evolution Channels Gradient Descent in Super Neural Networks , 2017, ArXiv.

[26] Pushmeet Kohli,et al. Adaptive Neural Compilation , 2016, NIPS.

[27] Forrest Briggs,et al. Functional Genetic Programming with Combinators , 2006 .

[28] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[29] Sebastian Nowozin,et al. DeepCoder: Learning to Write Programs , 2016, ICLR.

[30] Lihong Li,et al. Neuro-Symbolic Program Synthesis , 2016, ICLR.

[31] Quoc V. Le,et al. Neural Programmer: Inducing Latent Programs with Gradient Descent , 2015, ICLR.

[32] Dan Klein,et al. Neural Module Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Marcin Andrychowicz,et al. Neural Random Access Machines , 2015, ERCIM News.

[34] Max Welling,et al. Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[35] Marc Brockschmidt,et al. Differentiable Programs with Neural Libraries , 2016, ICML.

[36] Richard Bellman,et al. ON A ROUTING PROBLEM , 1958 .

[37] Kenta Oono,et al. Chainer : a Next-Generation Open Source Framework for Deep Learning , 2015 .

[38] Peter-Michael Osera,et al. Type-and-example-directed program synthesis , 2015, PLDI.

[39] Y. LeCun,et al. Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..