暂无分享,去创建一个
Joelle Pineau | Doina Precup | Yarin Gal | Clare Lyle | Amy Zhang | Angelos Filos | Shagun Sodhani | Marta Kwiatkowska | Doina Precup | Clare Lyle | Joelle Pineau | Angelos Filos | Y. Gal | Amy Zhang | Shagun Sodhani | M. Kwiatkowska
[1] Vladimir Vapnik,et al. Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities , 1971 .
[2] Kim G. Larsen,et al. Bisimulation through probabilistic testing (preliminary report) , 1989, POPL '89.
[3] David A. McAllester. PAC-Bayesian model averaging , 1999, COLT '99.
[4] J. Pearl. Causality: Models, Reasoning and Inference , 2000 .
[5] Robert Givan,et al. Equivalence notions and model minimization in Markov decision processes , 2003, Artif. Intell..
[6] Lihong Li,et al. PAC model-free reinforcement learning , 2006, ICML.
[7] Thomas J. Walsh,et al. Towards a Unified Theory of State Abstraction for MDPs , 2006, AI&M.
[8] Benjamin Van Roy. Performance Loss Bounds for Approximate Value Iteration with State Aggregation , 2006, Math. Oper. Res..
[9] R. Scheines,et al. Interventions and Causal Inference , 2007, Philosophy of Science.
[10] Peter Auer,et al. Near-optimal Regret Bounds for Reinforcement Learning , 2008, J. Mach. Learn. Res..
[11] Doina Precup,et al. Using Bisimulation for Policy Transfer in MDPs , 2010, AAAI.
[12] Tor Lattimore,et al. PAC Bounds for Discounted MDPs , 2012, ALT.
[13] Bernhard Schölkopf,et al. On causal and anticausal learning , 2012, ICML.
[14] Lihong Li,et al. Sample Complexity of Multi-task Reinforcement Learning , 2013, UAI.
[15] Jonas Peters,et al. Causal inference by using invariant prediction: identification and confidence intervals , 2015, 1501.01332.
[16] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.
[17] John Shawe-Taylor,et al. Learning Shared Representations in Multi-task Reinforcement Learning , 2016, ArXiv.
[18] Trevor Darrell,et al. Adversarial Discriminative Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[19] Gintare Karolina Dziugaite,et al. Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural Networks with Many More Parameters than Training Data , 2017, UAI.
[20] Matus Telgarsky,et al. Spectrally-normalized margin bounds for neural networks , 2017, NIPS.
[21] Yee Whye Teh,et al. Distral: Robust multitask reinforcement learning , 2017, NIPS.
[22] Tom Schaul,et al. Successor Features for Transfer in Reinforcement Learning , 2016, NIPS.
[23] Samy Bengio,et al. A Study on Overfitting in Deep Reinforcement Learning , 2018, ArXiv.
[24] Joelle Pineau,et al. Natural Environment Benchmarks for Reinforcement Learning , 2018, ArXiv.
[25] Ron Meir,et al. Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory , 2017, ICML.
[26] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[27] Yi Zhang,et al. Stronger generalization bounds for deep nets via a compression approach , 2018, ICML.
[28] Yuval Tassa,et al. DeepMind Control Suite , 2018, ArXiv.
[29] Christina Heinze-Deml,et al. Invariant Causal Prediction for Nonlinear Models , 2017, Journal of Causal Inference.
[30] Yann LeCun,et al. Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks , 2018, ArXiv.
[31] Marc G. Bellemare,et al. DeepMDP: Learning Continuous Latent Space Models for Representation Learning , 2019, ICML.
[32] David Lopez-Paz,et al. Invariant Risk Minimization , 2019, ArXiv.
[33] Nan Jiang,et al. Provably efficient RL with Rich Observations via Latent State Decoding , 2019, ICML.
[34] Yuandong Tian,et al. Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees , 2018, ICLR.
[35] Raia Hadsell,et al. Attention Privileged Reinforcement Learning For Domain Transfer , 2019, ArXiv.
[36] Taehoon Kim,et al. Quantifying Generalization in Reinforcement Learning , 2018, ICML.
[37] S. Levine,et al. Meta-Learning without Memorization , 2019, ICLR.
[38] Xingyou Song,et al. Observational Overfitting in Reinforcement Learning , 2019, ICLR.
[39] Hossein Mobahi,et al. Fantastic Generalization Measures and Where to Find Them , 2019, ICLR.
[40] Andrea Bonarini,et al. Sharing Knowledge in Multi-Task Deep Reinforcement Learning , 2020, ICLR.