暂无分享,去创建一个
[1] Matus Telgarsky,et al. Spectrally-normalized margin bounds for neural networks , 2017, NIPS.
[2] Yingyu Liang,et al. Generalization and Equilibrium in Generative Adversarial Nets (GANs) , 2017, ICML.
[3] J. Andrew Bagnell,et al. Efficient Reductions for Imitation Learning , 2010, AISTATS.
[4] Jieping Ye,et al. Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation , 2019, KDD.
[5] Sergey Levine,et al. Learning Robust Rewards with Adversarial Inverse Reinforcement Learning , 2017, ICLR 2017.
[6] Tuo Zhao,et al. On Computation and Generalization of Generative Adversarial Imitation Learning , 2020, ICLR.
[7] Alexey Dosovitskiy,et al. End-to-End Driving Via Conditional Imitation Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[8] Dean Pomerleau,et al. Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.
[9] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[10] Peter Stone,et al. Behavioral Cloning from Observation , 2018, IJCAI.
[11] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[12] Léon Bottou,et al. Towards Principled Methods for Training Generative Adversarial Networks , 2017, ICLR.
[13] Michael H. Bowling,et al. Apprenticeship learning using linear programming , 2008, ICML '08.
[14] A. Müller. Integral Probability Metrics and Their Generating Classes of Functions , 1997, Advances in Applied Probability.
[15] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.
[16] Shai Ben-David,et al. Understanding Machine Learning: From Theory to Algorithms , 2014 .
[17] Ilya Kostrikov,et al. Imitation Learning via Off-Policy Distribution Matching , 2019, ICLR.
[18] Rémi Munos,et al. Minimax Regret Bounds for Reinforcement Learning , 2017, ICML.
[19] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[20] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[21] Hilbert J. Kappen,et al. On the Sample Complexity of Reinforcement Learning with a Generative Model , 2012, ICML.
[22] Yang Yu,et al. Learning Environmental Calibration Actions for Policy Self-Evolution , 2018, IJCAI.
[23] Kavosh Asadi,et al. Lipschitz Continuity in Model-based Reinforcement Learning , 2018, ICML.
[24] Sergey Levine,et al. Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow , 2018, ICLR.
[25] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.
[26] Yang Yu,et al. Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning , 2018, AAAI.
[27] Igor Vajda,et al. On Divergences and Informations in Statistics and Information Theory , 2006, IEEE Transactions on Information Theory.
[28] Alexander A. Alemi,et al. Deep Variational Information Bottleneck , 2017, ICLR.
[29] Yang Yu,et al. Towards Sample Efficient Reinforcement Learning , 2018, IJCAI.
[30] Ameet Talwalkar,et al. Foundations of Machine Learning , 2012, Adaptive computation and machine learning.
[31] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[32] Imre Csiszár,et al. Information Theory and Statistics: A Tutorial , 2004, Found. Trends Commun. Inf. Theory.
[33] Sergey Levine,et al. When to Trust Your Model: Model-Based Policy Optimization , 2019, NeurIPS.
[34] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[35] Zhi-Hua Zhou,et al. Imitation Learning from Pixel-Level Demonstrations by HashReward , 2019 .
[36] Feng Liu,et al. On Computation and Generalization of Generative Adversarial Networks under Spectrum Control , 2019, ICLR.
[37] Jürgen Schmidhuber,et al. A Machine Learning Approach to Visual Perception of Forest Trails for Mobile Robots , 2016, IEEE Robotics and Automation Letters.
[38] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[39] Imre Csiszár,et al. Information Theory - Coding Theorems for Discrete Memoryless Systems, Second Edition , 2011 .
[40] John Langford,et al. Learning to Search Better than Your Teacher , 2015, ICML.
[41] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[42] Richard Zemel,et al. A Divergence Minimization Perspective on Imitation Learning Methods , 2019, CoRL.
[43] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[44] Michael Kearns,et al. Near-Optimal Reinforcement Learning in Polynomial Time , 2002, Machine Learning.
[45] Tao Xu,et al. On the Discrimination-Generalization Tradeoff in GANs , 2017, ICLR.
[46] Yuandong Tian,et al. Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees , 2018, ICLR.
[47] Martial Hebert,et al. Improving Multi-Step Prediction of Learned Time Series Models , 2015, AAAI.
[48] Stefano Ermon,et al. Model-Free Imitation Learning with Policy Optimization , 2016, ICML.