暂无分享,去创建一个
Jonathan Lee | Ajay Kumar Tanwani | Michael Laskey | Kenneth Y. Goldberg | Anil Aswani | Michael Laskey | Ken Goldberg | A. Tanwani | A. Aswani | Jonathan Lee
[1] Anca D. Dragan,et al. Comparing human-centric and robot-centric sampling for robot deep learning from demonstrations , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[2] J A Bagnell,et al. An Invitation to Imitation , 2015 .
[3] Byron Boots,et al. Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction , 2017, ICML.
[4] Jinfeng Yi,et al. Improved Dynamic Regret for Non-degenerate Functions , 2016, NIPS.
[5] Rebecca Willett,et al. Online Convex Optimization in Dynamic Environments , 2015, IEEE Journal of Selected Topics in Signal Processing.
[6] Karthik Sridharan,et al. Optimization, Learning, and Games with Predictable Sequences , 2013, NIPS.
[7] Pieter Abbeel,et al. An Algorithmic Perspective on Imitation Learning , 2018, Found. Trends Robotics.
[8] Seshadhri Comandur,et al. Electronic Colloquium on Computational Complexity, Report No. 88 (2007) Adaptive Algorithms for Online Decision Problems , 2022 .
[9] Byron Boots,et al. Convergence of Value Aggregation for Imitation Learning , 2018, AISTATS.
[10] Jinfeng Yi,et al. Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient , 2016, ICML.
[11] Elad Hazan,et al. Introduction to Online Convex Optimization , 2016, Found. Trends Optim..
[12] Byron Boots,et al. Accelerating Imitation Learning with Predictive Models , 2018, AISTATS.
[13] Byron Boots,et al. Model-Based Imitation Learning with Accelerated Convergence , 2018, ArXiv.
[14] Nolan Wagener,et al. Fast Policy Learning through Imitation and Reinforcement , 2018, UAI.
[15] Aryan Mokhtari,et al. Optimization in Dynamic Environments : Improved Regret Rates for Strongly Convex Problems , 2016 .
[16] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.
[17] Elad Hazan,et al. An optimal algorithm for stochastic strongly-convex optimization , 2010, 1006.2425.
[18] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.