论文信息 - Hierarchical Rule Induction Network for Abstract Visual Reasoning

Hierarchical Rule Induction Network for Abstract Visual Reasoning

Abstract reasoning refers to the ability to analyze information, discover rules at an intangible level, and solve problems in innovative ways. Raven's Progressive Matrices (RPM) test is typically used to examine the capability of abstract reasoning. In the test, the subject is asked to identify the correct choice from the answer set to fill the missing panel at the bottom right of RPM (e.g., a 3$\times$3 matrix), following the underlying rules inside the matrix. Recent studies, taking advantage of Convolutional Neural Networks (CNNs), have achieved encouraging progress to accomplish the RPM test problems. Unfortunately, simply relying on the relation extraction at the matrix level, they fail to recognize the complex attribute patterns inside or across rows/columns of RPM. To address this problem, in this paper we propose a Hierarchical Rule Induction Network (HriNet), by intimating human induction strategies. HriNet extracts multiple granularity rule embeddings at different levels and integrates them through a gated embedding fusion module. We further introduce a rule similarity metric based on the embeddings, so that HriNet can not only be trained using a tuplet loss but also infer the best answer according to the similarity score. To comprehensively evaluate HriNet, we first fix the defects contained in the very recent RAVEN dataset and generate a new one named Balanced-RAVEN. Then extensive experiments are conducted on the large-scale dataset PGM and our Balanced-RAVEN, the results of which show that HriNet outperforms the state-of-the-art models by a large margin.

[1] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2] Maithilee Kunda,et al. A computational model for solving problems from the Raven’s Progressive Matrices intelligence test using iconic visual representations , 2013, Cognitive Systems Research.

[3] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[4] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[5] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[6] Kenneth D. Forbus,et al. A Structure-Mapping Model of Raven's Progressive Matrices , 2010 .

[7] Felix Hill,et al. Learning to Make Analogies by Contrasting Abstract Relational Structure , 2019, ICLR.

[8] Stephan Lewandowsky,et al. A Bayesian Model of Rule Induction in Raven's Progressive Matrices , 2012, CogSci.

[9] Gui-Song Xia,et al. Compositional Boosting for Computing Hierarchical Image Structures , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Pere Garcia,et al. Similarity-based reasoning , 2000 .

[11] Song-Chun Zhu,et al. Attributed Grammars for Joint Estimation of Human Attributes, Part and Pose , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[12] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Felix Hill,et al. Measuring abstract reasoning in neural networks , 2018, ICML.

[14] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15] Kihyuk Sohn,et al. Improved Deep Metric Learning with Multi-class N-pair Loss Objective , 2016, NIPS.

[16] Simone Cirillo,et al. An anthropomorphic method for progressive matrix problems , 2013, Cognitive Systems Research.

[17] Jake Porway,et al. A stochastic graph grammar for compositional object representation and recognition , 2009, Pattern Recognit..

[18] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[19] Ron Sun,et al. Similarity-Based Reasoning, Raven's Matrices, and General Intelligence , 2018, IJCAI.

[20] Jun Zhu,et al. A Reconfigurable Tangram Model for Scene Representation and Categorization , 2016, IEEE Transactions on Image Processing.

[21] M A Just,et al. From the SelectedWorks of Marcel Adam Just 1990 What one intelligence test measures : A theoretical account of the processing in the Raven Progressive Matrices Test , 2016 .

[22] Frank Klawonn,et al. Similarity Based Reasoning , 1995 .

[23] Razvan Pascanu,et al. A simple neural network module for relational reasoning , 2017, NIPS.

[24] Feng Gao,et al. RAVEN: A Dataset for Relational and Analogical Visual REasoNing , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Kenneth D. Forbus,et al. Solving Geometric Analogy Problems Through Two-Stage Analogical Mapping , 2009, Cogn. Sci..

[26] Ashok K. Goel,et al. The Structural Affinity Method for Solving the Raven's Progressive Matrices Test for Intelligence , 2018, AAAI.

[27] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[28] Maithilee Kunda,et al. Fractals and Ravens , 2014, Artif. Intell..

[29] Michael G. Thomason,et al. Syntactic Methods in Pattern Recognition , 1982 .

[30] Kenneth D. Forbus,et al. Modeling Visual Problem Solving as Analogical Reasoning , 2017, Psychological review.

[31] Ashok K. Goel,et al. Confident Reasoning on Raven's Progressive Matrices Tests , 2014, AAAI.

[32] Michael Werman,et al. IQ of Neural Networks , 2017, ArXiv.

[33] J. Raven. The Raven's Progressive Matrices: Change and Stability over Culture and Time , 2000, Cognitive Psychology.

[34] Yixin Zhu,et al. Learning Perceptual Inference by Contrasting , 2019, NeurIPS.

[35] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).