暂无分享,去创建一个
[1] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.
[2] Hang Li,et al. Meta-SGD: Learning to Learn Quickly for Few Shot Learning , 2017, ArXiv.
[3] B. Welford. Note on a Method for Calculating Corrected Sums of Squares and Products , 1962 .
[4] Ashique Mahmood. Incremental Off-policy Reinforcement Learning Algorithms , 2017 .
[5] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.
[6] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[7] Christopher Joseph Pal,et al. A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms , 2019, ICLR.
[8] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[9] Ameet Talwalkar,et al. Random Search and Reproducibility for Neural Architecture Search , 2019, UAI.
[10] David Lopez-Paz,et al. Invariant Risk Minimization , 2019, ArXiv.
[11] Amit Dhurandhar,et al. Invariant Risk Minimization Games , 2020, ICML.
[12] Jing Peng,et al. An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.
[13] W. D. Wightman. Scientific Method , 1932, Nature.
[14] Martha White,et al. Meta-Learning Representations for Continual Learning , 2019, NeurIPS.
[15] Paul J. Werbos,et al. Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.
[16] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[17] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.
[18] Benjamin Recht,et al. Simple random search of static linear policies is competitive for reinforcement learning , 2018, NeurIPS.
[19] David Lopez-Paz,et al. From Dependence to Causation , 2016, 1607.03300.
[20] Ole Tange,et al. GNU Parallel: The Command-Line Power Tool , 2011, login Usenix Mag..
[21] Geoffrey E. Hinton,et al. Training Recurrent Neural Networks , 2013 .
[22] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[23] Richard S. Sutton,et al. On the role of tracking in stationary environments , 2007, ICML '07.
[24] Mohamed Chtourou,et al. On the training of recurrent neural networks , 2011, Eighth International Multi-Conference on Systems, Signals & Devices.
[25] Richard S. Sutton,et al. Sample-based learning and search with permanent and transient memories , 2008, ICML '08.
[26] Illtyd Trethowan. Causality , 1938 .
[27] Ethan Caballero,et al. Out-of-Distribution Generalization via Risk Extrapolation (REx) , 2020, ArXiv.
[28] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[29] Richard S. Sutton,et al. Representation Search through Generate and Test , 2013, AAAI Workshop: Learning Rich Representations from Low-Level Sensors.
[30] Christopher Joseph Pal,et al. Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding , 2018, NeurIPS.
[31] Yoshua Bengio,et al. Learning Neural Causal Models from Unknown Interventions , 2019, ArXiv.
[32] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .