论文信息 - Connecting Context-specific Adaptation in Humans to Meta-learning.

Connecting Context-specific Adaptation in Humans to Meta-learning.

Cognitive control, the ability of a system to adapt to the demands of a task, is an integral part of cognition. A widely accepted fact about cognitive control is that it is context-sensitive: Adults and children alike infer information about a task's demands from contextual cues and use these inferences to learn from ambiguous cues. However, the precise way in which people use contextual cues to guide adaptation to a new task remains poorly understood. This work connects the context-sensitive nature of cognitive control to a method for meta-learning with context-conditioned adaptation. We begin by identifying an essential difference between human learning and current approaches to meta-learning: In contrast to humans, existing meta-learning algorithms do not make use of task-specific contextual cues but instead rely exclusively on online feedback in the form of task-specific labels or rewards. To remedy this, we introduce a framework for using contextual information about a task to guide the initialization of task-specific models before adaptation to online feedback. We show how context-conditioned meta-learning can capture human behavior in a cognitive task and how it can be scaled to improve the speed of learning in various settings, including few-shot classification and low-sample reinforcement learning. Our work demonstrates that guiding meta-learning with task information can capture complex, human-like behavior, thereby deepening our understanding of cognitive control.

[1] Anne G. E. Collins,et al. Learning Structures Through Reinforcement , 2018 .

[2] Raymond J. Mooney,et al. Adapting Discriminative Reranking to Grounded Language Learning , 2013, ACL.

[3] Sergey Levine,et al. Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning , 2019, CoRL.

[4] Zhenguo Li,et al. Meta Reinforcement Learning with Task Embedding and Shared Policy , 2019, IJCAI.

[5] Etienne Koechlin,et al. Foundations of human reasoning in the prefrontal cortex , 2014, Science.

[6] Mark O. Riedl,et al. Guiding Reinforcement Learning Exploration Using Natural Language , 2017, AAMAS.

[7] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8] Sergey Levine,et al. Temporal Difference Models: Model-Free Deep RL for Model-Based Control , 2018, ICLR.

[9] Dan Klein,et al. Learning with Latent Language , 2017, NAACL.

[10] Anne G E Collins,et al. Cognitive control over learning: creating, clustering, and generalizing task-set structure. , 2013, Psychological review.

[11] M. Frank,et al. Mechanisms of hierarchical reinforcement learning in corticostriatal circuits 1: computational analysis. , 2012, Cerebral cortex.

[12] Sung Whan Yoon,et al. TapNet: Neural Network Augmented with Task-Adaptive Projection for Few-Shot Learning , 2019, ICML.

[13] Luke S. Zettlemoyer,et al. Reading between the Lines: Learning to Map High-Level Instructions to Commands , 2010, ACL.

[14] Yizhou Sun,et al. Few-Shot Representation Learning for Out-Of-Vocabulary Words , 2019, ACL.

[15] Anne Collins,et al. Computational evidence for hierarchically structured reinforcement learning in humans , 2019, Proceedings of the National Academy of Sciences.

[16] Emilio Salinas,et al. Fast Remapping of Sensory Stimuli onto Motor Actions on the Basis of Contextual Modulation , 2004, The Journal of Neuroscience.

[17] Denise M Werchan,et al. 8-Month-Old Infants Spontaneously Learn and Generalize Hierarchical Rules , 2015, Psychological science.

[18] Raymond J. Mooney,et al. Learning to Interpret Natural Language Navigation Instructions from Observations , 2011, Proceedings of the AAAI Conference on Artificial Intelligence.

[19] Jonathan D. Cohen,et al. The Computational and Neural Basis of Cognitive Control: Charted Territory and New Frontiers , 2014, Cogn. Sci..

[20] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[21] Denise M Werchan,et al. Role of Prefrontal Cortex in Learning and Generalizing Hierarchical Rules in 8-Month-Old Infants , 2016, The Journal of Neuroscience.

[22] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[23] Yuxin Peng,et al. Fine-Grained Image Classification via Combining Vision and Language , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Yoshua Bengio,et al. On the Optimization of a Synaptic Learning Rule , 2007 .

[25] Alexandre Lacoste,et al. TADAM: Task dependent adaptive metric for improved few-shot learning , 2018, NeurIPS.

[26] Pushmeet Kohli,et al. Learning to Understand Goal Specifications by Modelling Reward , 2018, ICLR.

[27] Sergey Levine,et al. Probabilistic Model-Agnostic Meta-Learning , 2018, NeurIPS.

[28] Regina Barzilay,et al. Grounding Language for Transfer in Deep Reinforcement Learning , 2017, J. Artif. Intell. Res..

[29] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.

[30] Jonathan D. Cohen,et al. Anterior cingulate and prefrontal cortex: who's in control? , 2000, Nature Neuroscience.

[31] Aaron C. Courville,et al. FiLM: Visual Reasoning with a General Conditioning Layer , 2017, AAAI.

[32] Leslie Pack Kaelbling,et al. Learning to Achieve Goals , 1993, IJCAI.

[33] M. Botvinick,et al. Conflict monitoring and cognitive control. , 2001, Psychological review.

[34] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[35] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.

[36] Luke S. Zettlemoyer,et al. Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions , 2013, TACL.

[37] Pedro H. O. Pinheiro,et al. Adaptive Cross-Modal Few-Shot Learning , 2019, NeurIPS.

[38] E. Koechlin,et al. Reasoning, Learning, and Creativity: Frontal Lobe Function and Human Decision-Making , 2012, PLoS biology.

[39] Benjamin Kuipers,et al. Walk the Talk: Connecting Language, Knowledge, and Action in Route Instructions , 2006, AAAI.

[40] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[41] Patrick Jähnichen,et al. Cross-modal Hallucination for Few-shot Fine-grained Recognition , 2018, ArXiv.

[42] Samy Bengio,et al. Zero-Shot Learning by Convex Combination of Semantic Embeddings , 2013, ICLR.

[43] Dong Yan,et al. Reward Shaping via Meta-Learning , 2019, ArXiv.

[44] Dan Klein,et al. Alignment-Based Compositional Semantics for Instruction Following , 2015, EMNLP.

[45] S. Monsell. Control of mental processes , 2021, Unsolved Mysteries of the Mind.

[46] Seungjin Choi,et al. Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace , 2018, ICML.

[47] Hannah S. Locke,et al. Flexible neural mechanisms of cognitive control within human prefrontal cortex , 2009, Proceedings of the National Academy of Sciences.

[48] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.

[49] Kristina M. Visscher,et al. A Core System for the Implementation of Task Sets , 2006, Neuron.

[50] Eunho Yang,et al. Learning to Balance: Bayesian Meta-Learning for Imbalanced and Out-of-distribution Tasks , 2019, ICLR.

[51] Daniel Jurafsky,et al. Learning to Follow Navigational Directions , 2010, ACL.

[52] Tamim Asfour,et al. ProMP: Proximal Meta-Policy Search , 2018, ICLR.

[53] Joseph J. Lim,et al. Toward Multimodal Model-Agnostic Meta-Learning , 2018, ArXiv.

[54] Bernt Schiele,et al. Learning Deep Representations of Fine-Grained Visual Descriptions , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[55] Roger B. Grosse,et al. Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions , 2019, ICLR.

[56] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.

[57] V. Bruce. Unsolved mysteries of the mind : tutorial essays in cognition , 1998 .

[58] Kyoung Mu Lee,et al. Learning to Forget for Meta-Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[59] Sebastian Thrun,et al. Lifelong Learning Algorithms , 1998, Learning to Learn.

[60] K. Sakai. Task set and prefrontal cortex. , 2008, Annual review of neuroscience.

[61] Francisco Barceló,et al. Task Switching and Novelty Processing Activate a Common Neural Network for Cognitive Control , 2006, Journal of Cognitive Neuroscience.