论文信息 - Relational inductive bias for physical construction in humans and machines

Relational inductive bias for physical construction in humans and machines

While current deep learning systems excel at tasks such as object classification, language processing, and gameplay, few can construct or modify a complex system such as a tower of blocks. We hypothesize that what these systems lack is a "relational inductive bias": a capacity for reasoning about inter-object relations and making choices over a structured description of a scene. To test this hypothesis, we focus on a task that involves gluing pairs of blocks together to stabilize a tower, and quantify how well humans perform. We then introduce a deep reinforcement learning agent which uses object- and relation-centric scene and policy representations and apply it to the task. Our results show that these structured representations allow the agent to outperform both humans and more naive approaches, suggesting that relational inductive bias is an important component in solving structured reasoning problems and for building more intelligent, flexible machines.

[1] Rob Fergus,et al. Learning Physical Intuition of Block Towers by Example , 2016, ICML.

[2] Ali Farhadi,et al. Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Ali Farhadi,et al. "What Happens If..." Learning to Predict the Effect of Forces in Images , 2016, ECCV.

[4] T. Griffiths,et al. Google and the Mind , 2007, Psychological science.

[5] Jessica B. Hamrick,et al. Simulation as an engine of physical scene understanding , 2013, Proceedings of the National Academy of Sciences.

[6] Dedre Gentner,et al. Structure-Mapping: A Theoretical Framework for Analogy , 1983, Cogn. Sci..

[7] Allan Collins,et al. A spreading-activation theory of semantic processing , 1975 .

[8] Jitendra Malik,et al. Learning Visual Predictive Models of Physics for Playing Billiards , 2015, ICLR.

[9] K. Holyoak. Analogy and Relational Reasoning , 2012 .

[10] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[12] Charles Kemp,et al. The discovery of structural form , 2008, Proceedings of the National Academy of Sciences.

[13] Bernt Schiele,et al. Long-Term Image Boundary Extrapolation , 2016, ArXiv.

[14] Samuel S. Schoenholz,et al. Neural Message Passing for Quantum Chemistry , 2017, ICML.

[15] Stefano Ermon,et al. Label-Free Supervision of Neural Networks with Physics and Domain Knowledge , 2016, AAAI.

[16] Razvan Pascanu,et al. Metacontrol for Adaptive Imagination-Based Optimization , 2017, ICLR.

[17] Le Song,et al. 2 Common Formulation for Greedy Algorithms on Graphs , 2018 .

[18] Katherine D. Kinzler,et al. Core knowledge. , 2007, Developmental science.

[19] Joshua B. Tenenbaum,et al. A Compositional Object-Based Approach to Learning Physical Dynamics , 2016, ICLR.

[20] Misha Denil,et al. Learning to Perform Physics Experiments via Deep Reinforcement Learning , 2016, ICLR.

[21] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22] Jiajun Wu,et al. Physics 101: Learning Physical Object Properties from Unlabeled Videos , 2016, BMVC.