Large-scale cost function learning for path planning using deep inverse reinforcement learning
暂无分享,去创建一个
Dushyant Rao | Markus Wulfmeier | Ingmar Posner | Dominic Zeng Wang | Peter Ondruska | Markus Wulfmeier | I. Posner | Peter Ondruska | Dushyant Rao | Dominic Zeng Wang
[1] Kee-Eung Kim,et al. Bayesian Nonparametric Feature Construction for Inverse Reinforcement Learning , 2013, IJCAI.
[2] Markus Wulfmeier,et al. Maximum Entropy Deep Inverse Reinforcement Learning , 2015, 1507.04888.
[3] Hannes Sommer,et al. Predicting actions to act predictably: Cooperative partial motion planning with maximum entropy models , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[4] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.
[5] Jianxiong Xiao,et al. DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[6] Wolfram Burgard,et al. Socially compliant mobile robot navigation via inverse reinforcement learning , 2016, Int. J. Robotics Res..
[7] Sergey Levine,et al. Feature Construction for Inverse Reinforcement Learning , 2010, NIPS.
[8] Sebastian Thrun,et al. Apprenticeship learning for motion planning with application to parking lot navigation , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[9] Sebastian Thrun,et al. Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.
[10] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.
[11] Ian D. Reid,et al. Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[12] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[13] William Whittaker,et al. Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.
[14] David Silver,et al. Learning to search: Functional gradient techniques for imitation learning , 2009, Auton. Robots.
[15] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[16] Paul Newman,et al. Continually improving large scale long term visual navigation of a vehicle in dynamic urban environments , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.
[17] Andrea Vedaldi,et al. MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.
[18] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[19] Wolfram Burgard,et al. Principles of Robot Motion: Theory, Algorithms, and Implementation ERRATA!!!! 1 , 2007 .
[20] Kian Hsiang Low,et al. Inverse Reinforcement Learning with Locally Consistent Reward Functions , 2015, NIPS.
[21] Sergey Levine,et al. Nonlinear Inverse Reinforcement Learning with Gaussian Processes , 2011, NIPS.
[22] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.
[23] Martial Hebert,et al. Activity Forecasting , 2012, ECCV.
[24] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[25] Markus Wulfmeier,et al. Watch this: Scalable cost-function learning for path planning in urban environments , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[26] Howie Choset,et al. Principles of Robot Motion: Theory, Algorithms, and Implementation ERRATA!!!! 1 , 2007 .
[27] Manuel Lopes,et al. Active Learning for Reward Estimation in Inverse Reinforcement Learning , 2009, ECML/PKDD.
[28] Steven M. LaValle,et al. Planning algorithms , 2006 .
[29] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..
[30] Sebastian Thrun,et al. Junior: The Stanford entry in the Urban Challenge , 2008, J. Field Robotics.
[31] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[32] Dushyant Rao,et al. Incorporating Human Domain Knowledge into Large Scale Cost Function Learning , 2016, ArXiv.
[33] Dean A. Pomerleau,et al. Neural Network Based Autonomous Navigation , 1990 .
[34] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[35] Er Meng Joo,et al. A review of inverse reinforcement learning theory and recent advances , 2012, IEEE Congress on Evolutionary Computation.
[36] Wolfram Burgard,et al. Learning driving styles for autonomous vehicles from demonstration , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).
[37] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[38] Yann LeCun,et al. A multirange architecture for collision‐free off‐road robot navigation , 2009, J. Field Robotics.