Imitation Learning by Coaching
暂无分享,去创建一个
He He | Hal Daumé | Jason Eisner | Jason Eisner | Hal Daumé | He He
[1] Tamir Hazan,et al. Direct Loss Minimization for Structured Prediction , 2010, NIPS.
[2] H. Brendan McMahan,et al. Follow-the-Regularized-Leader and Mirror Descent: Equivalence Theorems and L1 Regularization , 2011, AISTATS.
[3] Ludovic Denoyer,et al. Datum-Wise Classification: A Sequential Approach to Sparsity , 2011, ECML/PKDD.
[4] Elad Hazan,et al. Logarithmic regret algorithms for online convex optimization , 2006, Machine Learning.
[5] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[6] Philip Resnik,et al. Online Large-Margin Training of Syntactic and Structural Translation Features , 2008, EMNLP.
[7] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[8] Ben Taskar,et al. An End-to-End Discriminative Approach to Machine Translation , 2006, ACL.
[9] J. Andrew Bagnell,et al. Efficient Reductions for Imitation Learning , 2010, AISTATS.
[10] Quoc V. Le,et al. Proximal regularization for online and batch learning , 2009, ICML '09.
[11] John Langford,et al. Search-based structured prediction , 2009, Machine Learning.
[12] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[13] Sham M. Kakade,et al. Mind the Duality Gap: Logarithmic regret algorithms for online optimization , 2008, NIPS.
[14] Balázs Kégl,et al. Fast classification using sparse decision DAGs , 2012, ICML.