暂无分享,去创建一个
Yoshua Bengio | Sergey Levine | Anirudh Goyal | Matthew Botvinick | Yoshua Bengio | S. Levine | M. Botvinick | Anirudh Goyal
[1] Sergey Levine,et al. InfoBot: Transfer and Exploration via the Information Bottleneck , 2019, ICLR.
[2] Shakir Mohamed,et al. Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning , 2015, NIPS.
[3] Shimon Whiteson,et al. Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.
[4] Naftali Tishby,et al. The information bottleneck method , 2000, ArXiv.
[5] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[6] Doina Precup,et al. An information-theoretic approach to curiosity-driven reinforcement learning , 2012, Theory in Biosciences.
[7] Pieter Abbeel,et al. Emergence of Grounded Compositional Language in Multi-Agent Populations , 2017, AAAI.
[8] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[9] Thomas M. Cover,et al. Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing) , 2006 .
[10] Yoshua Bengio,et al. BabyAI: First Steps Towards Grounded Language Learning With a Human In the Loop , 2018, ArXiv.
[11] Daan Wierstra,et al. Variational Intrinsic Control , 2016, ICLR.
[12] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.
[13] Filip De Turck,et al. VIME: Variational Information Maximizing Exploration , 2016, NIPS.
[14] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[15] Alexander A. Alemi,et al. Deep Variational Information Bottleneck , 2017, ICLR.
[16] Regina Barzilay,et al. Representation Learning for Grounded Spatial Reasoning , 2017, TACL.
[17] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.
[18] D. Kahneman. Maps of Bounded Rationality: Psychology for Behavioral Economics , 2003 .
[19] Sergey Levine,et al. Recurrent Independent Mechanisms , 2019, ICLR.
[20] Jason Weston,et al. End-To-End Memory Networks , 2015, NIPS.
[21] Daniel Polani,et al. Grounding subgoals in information transitions , 2011, 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).
[22] Jonathan D. Cohen,et al. Toward a Rational and Mechanistic Account of Mental Effort. , 2017, Annual review of neuroscience.
[23] A. Dickinson. Actions and habits: the development of behavioural autonomy , 1985 .
[24] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.
[25] Christopher Joseph Pal,et al. Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding , 2018, NeurIPS.
[26] S. Sloman. The empirical case for two systems of reasoning. , 1996 .
[27] Yoshua Bengio,et al. Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation , 2013, ArXiv.
[28] M. Botvinick,et al. Motivation and cognitive control: from behavior to neural mechanism. , 2015, Annual review of psychology.
[29] Alex Graves,et al. Recurrent Models of Visual Attention , 2014, NIPS.
[30] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[31] Stefano Soatto,et al. Information Dropout: learning optimal representations through noise , 2017, ArXiv.