论文信息 - A Brief Study of In-Domain Transfer and Learning from Fewer Samples using A Few Simple Priors

A Brief Study of In-Domain Transfer and Learning from Fewer Samples using A Few Simple Priors

Domain knowledge can often be encoded in the structure of a network, such as convolutional layers for vision, which has been shown to increase generalization and decrease sample complexity, or the number of samples required for successful learning. In this study, we ask whether sample complexity can be reduced for systems where the structure of the domain is unknown beforehand, and the structure and parameters must both be learned from the data. We show that sample complexity reduction through learning structure is possible for at least two simple cases. In studying these cases, we also gain insight into how this might be done for more complex domains.

[1] John Langford,et al. Efficient Optimal Learning for Contextual Bandits , 2011, UAI.

[2] Mark B. Ring. Continual learning in reinforcement environments , 1995, GMD-Bericht.

[3] Li Zhou,et al. Latent Contextual Bandits and their Application to Personalized Recommendations for New Users , 2016, IJCAI.

[4] Sebastian Thrun,et al. Learning to Learn , 1998, Springer US.

[5] David H. Wolpert,et al. No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[6] Chris Tar,et al. A Growing Long-term Episodic & Semantic Memory , 2016, ArXiv.

[7] Eric Eaton,et al. ELLA: An Efficient Lifelong Learning Algorithm , 2013, ICML.

[8] Quoc V. Le,et al. Multi-task Sequence to Sequence Learning , 2015, ICLR.

[9] Eric Eaton,et al. Autonomous Cross-Domain Knowledge Transfer in Lifelong Policy Gradient Reinforcement Learning , 2015, IJCAI.

[10] Eric Eaton,et al. Lifelong Transfer Learning for Heterogeneous Teams of Agents in Sequential Decision Processes , 2016 .

[11] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.

[12] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[13] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[14] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.