Learning for control from multiple demonstrations
暂无分享,去创建一个
Pieter Abbeel | Andrew Y. Ng | Adam Coates | A. Ng | P. Abbeel | A. Coates | Adam Coates
[1] S. B. Needleman,et al. A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.
[2] David Q. Mayne,et al. Differential dynamic programming , 1972, The Mathematical Gazette.
[3] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[4] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .
[5] R. Fildes. Journal of the Royal Statistical Society (B): Gary K. Grunwald, Adrian E. Raftery and Peter Guttorp, 1993, “Time series of continuous proportions”, 55, 103–116.☆ , 1993 .
[6] Craig Boutilier,et al. Context-Specific Independence in Bayesian Networks , 1996, UAI.
[7] Stefan Schaal,et al. Robot Learning From Demonstration , 1997, ICML.
[8] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[9] Eric Feron,et al. Control Logic for Automated Aerobatic Flight of a Miniature Helicopter , 2002 .
[10] Radford M. Neal,et al. Multiple Alignment of Continuous Time Series , 2004, NIPS.
[11] Andrew W. Moore,et al. Locally Weighted Learning for Control , 1997, Artificial Intelligence Review.
[12] Ben Tse,et al. Autonomous Inverted Helicopter Flight via Reinforcement Learning , 2004, ISER.
[13] Andrew W. Moore,et al. Locally Weighted Learning , 1997, Artificial Intelligence Review.
[14] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[15] Pieter Abbeel,et al. Learning vehicular dynamics, with application to modeling helicopters , 2005, NIPS.
[16] Pieter Abbeel,et al. An Application of Reinforcement Learning to Aerobatic Helicopter Flight , 2006, NIPS.
[17] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[18] Pieter Abbeel,et al. Using inaccurate models in reinforcement learning , 2006, ICML.
[19] Robert E. Schapire,et al. A Game-Theoretic Approach to Apprenticeship Learning , 2007, NIPS.
[20] Aude Billard,et al. On Learning, Representing, and Generalizing a Task in a Humanoid Robot , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[21] J. Listgarten. Analysis of sibling time series data: Alignment and difference detection , 2007 .
[22] Csaba Szepesvári,et al. Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods , 2007, UAI.
[23] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[24] M. Grassi,et al. AIAA Guidance, Navigation, and Control Conference , 2008 .