Maximum margin planning
暂无分享,去创建一个
J. Andrew Bagnell | Martin Zinkevich | Nathan D. Ratliff | Martin A. Zinkevich | J. Bagnell | Drew Bagnell | Ashesh Jain | Michael Hu | Nathan D. Ratliff | Ashesh Jain | Michael Hu | Drew Bagnell | Martin A Zinkevich | Martin Zinkevich
[1] Naum Zuselevich Shor,et al. Minimization Methods for Non-Differentiable Functions , 1985, Springer Series in Computational Mathematics.
[2] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .
[3] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[4] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[5] Martial Hebert,et al. Mobility planning for autonomous navigation of multiple robots in unstructured environments , 1998, Proceedings of the 1998 IEEE International Symposium on Intelligent Control (ISIC) held jointly with IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA) Intell.
[6] Jason Weston,et al. Multi-Class Support Vector Machines , 1998 .
[7] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.
[8] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[9] D. Bertsekas,et al. Convergen e Rate of In remental Subgradient Algorithms , 2000 .
[10] Ben Taskar,et al. Max-Margin Markov Networks , 2003, NIPS.
[11] Sebastian Thrun,et al. Perspectives on standardization in mobile robot programming: the Carnegie Mellon Navigation (CARMEN) Toolkit , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).
[12] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.
[13] Johan A. K. Suykens,et al. Advances in learning theory : methods, models and applications , 2003 .
[14] Tong Zhang,et al. Solving large scale linear prediction problems using stochastic gradient descent algorithms , 2004, ICML.
[15] Martial Hebert,et al. Natural terrain classification using 3-d ladar data , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.
[16] Jing Peng,et al. SVM vs regularized least squares classification , 2004, ICPR 2004.
[17] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[18] Ben Taskar,et al. Learning structured prediction models: a large margin approach , 2005, ICML.
[19] Yann LeCun,et al. Off-Road Obstacle Avoidance through End-to-End Learning , 2005, NIPS.
[20] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..
[21] Ben Taskar,et al. Structured Prediction via the Extragradient Method , 2005, NIPS.
[22] David M. Bradley,et al. Boosting Structured Prediction for Imitation Learning , 2006, NIPS.
[23] Elad Hazan,et al. Logarithmic regret algorithms for online convex optimization , 2006, Machine Learning.
[24] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.