论文信息 - Geometric Generalization Based Zero-Shot Learning Dataset Infinite World: Simple Yet Powerful

Geometric Generalization Based Zero-Shot Learning Dataset Infinite World: Simple Yet Powerful

Raven's Progressive Matrices are one of the widely used tests in evaluating the human test taker's fluid intelligence. Analogously, this paper introduces geometric generalization based zero-shot learning tests to measure the rapid learning ability and the internal consistency of deep generative models. Our empirical research analysis on state-of-the-art generative models discern their ability to generalize concepts across classes. In the process, we introduce Infinite World, an evaluable, scalable, multi-modal, light-weight dataset and Zero-Shot Intelligence Metric ZSI. The proposed tests condenses human-level spatial and numerical reasoning tasks to its simplistic geometric forms. The dataset is scalable to a theoretical limit of infinity, in numerical features of the generated geometric figures, image size and in quantity. We systematically analyze state-of-the-art model's internal consistency, identify their bottlenecks and propose a pro-active optimization method for few-shot and zero-shot learning.

[1] John F. Canny,et al. A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[3] Zhe Gan,et al. AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[5] Michael D. Howard,et al. Complementary Learning Systems , 2014, Cogn. Sci..

[6] Daan Wierstra,et al. One-shot Learning with Memory-Augmented Neural Networks , 2016, ArXiv.

[7] Peter Johnston,et al. ON READING COMPREHENSION , 2002 .

[8] Li Fei-Fei,et al. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Konrad P. Körding,et al. Toward an Integration of Deep Learning and Neuroscience , 2016, bioRxiv.

[10] Jürgen Schmidhuber,et al. Deep learning in neural networks: An overview , 2014, Neural Networks.

[11] Yoshua Bengio,et al. FigureQA: An Annotated Figure Dataset for Visual Reasoning , 2017, ICLR.

[12] Dimitris N. Metaxas,et al. StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[13] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[14] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[15] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[16] R. Cattell. Theory of fluid and crystallized intelligence: A critical experiment. , 1963 .

[17] J. Tenenbaum. A Bayesian framework for concept learning , 1999 .

[18] Michael I. Jordan,et al. Estimation, Optimization, and Parallelism when Data is Sparse , 2013, NIPS.

[19] Ruslan Salakhutdinov,et al. Gated-Attention Architectures for Task-Oriented Language Grounding , 2017, AAAI.

[20] Philip H. S. Torr,et al. An embarrassingly simple approach to zero-shot learning , 2015, ICML.

[21] Alun D. Preece,et al. Interpretability of deep learning models: A survey of results , 2017, 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[22] Razvan Pascanu,et al. A simple neural network module for relational reasoning , 2017, NIPS.

[23] J. Duncan,et al. Fluid intelligence after frontal lobe lesions , 1995, Neuropsychologia.

[24] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[25] David H. Douglas,et al. ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[26] Razvan Pascanu,et al. Memory-based Parameter Adaptation , 2018, ICLR.

[27] Herbert A. Simon,et al. Why a Diagram is (Sometimes) Worth Ten Thousand Words , 1987, Cogn. Sci..

[28] Anima Anandkumar,et al. Efficient approaches for escaping higher order saddle points in non-convex optimization , 2016, COLT.

[29] Maurice K. Wong,et al. Algorithm AS136: A k-means clustering algorithm. , 1979 .

[30] A. Einstein,et al. On gravitational waves , 1937 .

[31] Jorge Nocedal,et al. Optimization Methods for Large-Scale Machine Learning , 2016, SIAM Rev..

[32] Rishi Sharma,et al. A Note on the Inception Score , 2018, ArXiv.

[33] Furong Huang,et al. Escaping From Saddle Points - Online Stochastic Gradient for Tensor Decomposition , 2015, COLT.