Maximum Entropy Deep Inverse Reinforcement Learning
暂无分享,去创建一个
[1] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.
[2] M.H. Hassoun,et al. Fundamentals of Artificial Neural Networks , 1996, Proceedings of the IEEE.
[3] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[4] Jan A Snyman,et al. Practical Mathematical Optimization: An Introduction to Basic Optimization Theory and Classical and New Gradient-Based Algorithms , 2005 .
[5] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[6] Robert E. Schapire,et al. A Game-Theoretic Approach to Apprenticeship Learning , 2007, NIPS.
[7] Yoshua Bengio,et al. Scaling learning algorithms towards AI , 2007 .
[8] Csaba Szepesvári,et al. Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods , 2007, UAI.
[9] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..
[10] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[11] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[12] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[13] Chris L. Baker,et al. Action understanding as inverse planning , 2009, Cognition.
[14] Manuel Lopes,et al. Active Learning for Reward Estimation in Inverse Reinforcement Learning , 2009, ECML/PKDD.
[15] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..
[16] Sergey Levine,et al. Feature Construction for Inverse Reinforcement Learning , 2010, NIPS.
[17] Sergey Levine,et al. Nonlinear Inverse Reinforcement Learning with Gaussian Processes , 2011, NIPS.
[18] Michael L. Littman,et al. Apprenticeship Learning About Multiple Intentions , 2011, ICML.
[19] Martial Hebert,et al. Activity Forecasting , 2012, ECCV.
[20] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.
[21] Pascal Vincent,et al. Unsupervised Feature Learning and Deep Learning: A Review and New Perspectives , 2012, ArXiv.
[22] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[23] Shotaro Akaho,et al. An Application of Inverse Reinforcement Learning to Medical Records of Diabetes Treatment , 2013 .
[24] Kee-Eung Kim,et al. Bayesian Nonparametric Feature Construction for Inverse Reinforcement Learning , 2013, IJCAI.
[25] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.
[26] Jonathan Tompson,et al. Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.
[27] Andrea Vedaldi,et al. MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.
[28] Ian D. Reid,et al. Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[29] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.