Between Imitation and Intention Learning
暂无分享,去创建一个
[1] J. Lockard,et al. University of Maryland , 1844, The American journal of dental science.
[2] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .
[3] Reid G. Simmons,et al. Complexity Analysis of Real-Time Reinforcement Learning , 1993, AAAI.
[4] Dean A. Pomerleau,et al. Neural Network Perception for Mobile Robot Guidance , 1993 .
[5] 金田 重郎,et al. C4.5: Programs for Machine Learning (書評) , 1995 .
[6] Richard S. Sutton,et al. Reinforcement Learning with Replacing Eligibility Traces , 2005, Machine Learning.
[7] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[8] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[9] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[10] Richard S. Sutton,et al. Reinforcement learning with replacing eligibility traces , 2004, Machine Learning.
[11] Yishay Mansour,et al. A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes , 1999, Machine Learning.
[12] Pieter Abbeel,et al. Exploration and apprenticeship learning in reinforcement learning , 2005, ICML.
[13] Luc De Raedt,et al. Proceedings of the 22nd international conference on Machine learning , 2005 .
[14] Andrew G. Barto,et al. Autonomous shaping: knowledge transfer in reinforcement learning , 2006, ICML.
[15] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[16] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[17] Ian H. Witten,et al. The WEKA data mining software: an update , 2009, SKDD.
[18] Manuel Lopes,et al. Active Learning for Reward Estimation in Inverse Reinforcement Learning , 2009, ECML/PKDD.
[19] Thomas J. Walsh,et al. Generalizing Apprenticeship Learning across Hypothesis Classes , 2010, ICML.
[20] Michael L. Littman,et al. Apprenticeship Learning About Multiple Intentions , 2011, ICML.
[21] Marie desJardins,et al. Multi-source option-based policy transfer , 2013 .
[22] Phillipp Bergmann. Dynamic Programming Deterministic And Stochastic Models , 2016 .