暂无分享,去创建一个
[1] John Langford,et al. Efficient Optimal Learning for Contextual Bandits , 2011, UAI.
[2] Mark B. Ring. Continual learning in reinforcement environments , 1995, GMD-Bericht.
[3] Li Zhou,et al. Latent Contextual Bandits and their Application to Personalized Recommendations for New Users , 2016, IJCAI.
[4] Sebastian Thrun,et al. Learning to Learn , 1998, Springer US.
[5] David H. Wolpert,et al. No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..
[6] Chris Tar,et al. A Growing Long-term Episodic & Semantic Memory , 2016, ArXiv.
[7] Eric Eaton,et al. ELLA: An Efficient Lifelong Learning Algorithm , 2013, ICML.
[8] Quoc V. Le,et al. Multi-task Sequence to Sequence Learning , 2015, ICLR.
[9] Eric Eaton,et al. Autonomous Cross-Domain Knowledge Transfer in Lifelong Policy Gradient Reinforcement Learning , 2015, IJCAI.
[10] Eric Eaton,et al. Lifelong Transfer Learning for Heterogeneous Teams of Agents in Sequential Decision Processes , 2016 .
[11] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.
[12] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.
[13] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[14] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.