Representation Balancing MDPs for Off-Policy Policy Evaluation
暂无分享,去创建一个
Yao Liu | Finale Doshi-Velez | Emma Brunskill | Aldo Faisal | Omer Gottesman | Aniruddh Raghu | Matthieu Komorowski | Finale Doshi-Velez | Omer Gottesman | M. Komorowski | A. Faisal | Aniruddh Raghu | Yao Liu | E. Brunskill | F. Doshi-Velez
[1] Michael Kearns,et al. Near-Optimal Reinforcement Learning in Polynomial Time , 2002, Machine Learning.
[2] Mihaela van der Schaar,et al. Learning Optimal Policies from Observational Data , 2018, ArXiv.
[3] Yishay Mansour,et al. Learning Bounds for Importance Weighting , 2010, NIPS.
[4] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[5] Nan Jiang,et al. Doubly Robust Off-policy Value Evaluation for Reinforcement Learning , 2015, ICML.
[6] Philip S. Thomas,et al. Using Options for Long-Horizon Off-Policy Evaluation , 2017, ArXiv.
[7] Philip S. Thomas,et al. Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning , 2016, ICML.
[8] Mehrdad Farajtabar,et al. More Robust Doubly Robust Off-policy Evaluation , 2018, ICML.
[9] Mihaela van der Schaar,et al. GANITE: Estimation of Individualized Treatment Effects using Generative Adversarial Nets , 2018, ICLR.
[10] Fredrik D. Johansson,et al. Learning Weighted Representations for Generalization Across Designs , 2018, 1802.08598.
[11] John Langford,et al. Doubly Robust Policy Evaluation and Learning , 2011, ICML.
[12] Philip S. Thomas,et al. High-Confidence Off-Policy Evaluation , 2015, AAAI.
[13] Leo A. Celi,et al. The MIMIC Code Repository: enabling reproducibility in critical care research , 2017, J. Am. Medical Informatics Assoc..
[14] Mihaela van der Schaar,et al. Bayesian Inference of Individualized Treatment Effects using Multi-task Gaussian Processes , 2017, NIPS.
[15] Sören R. Künzel,et al. Meta-learners for Estimating Heterogeneous Treatment Effects using Machine Learning , 2017 .
[16] Kenji Fukumizu,et al. On integral probability metrics, φ-divergences and binary classification , 2009, 0901.2698.
[17] Stefan Wager,et al. Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.
[18] Suchi Saria,et al. Reliable Decision Support using Counterfactual Models , 2017, NIPS.
[19] Philip S. Thomas,et al. Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation , 2017, NIPS.
[20] Peter Stone,et al. Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation , 2016, AAAI.
[21] Uri Shalit,et al. Learning Representations for Counterfactual Inference , 2016, ICML.
[22] Louis Wehenkel,et al. Clinical data based optimal STI strategies for HIV: a reinforcement learning approach , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.
[23] Sergey Levine,et al. Offline policy evaluation across representations with applications to educational games , 2014, AAMAS.
[24] Sören R. Künzel,et al. Metalearners for estimating heterogeneous treatment effects using machine learning , 2017, Proceedings of the National Academy of Sciences.
[25] J. Robins,et al. Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .
[26] Gert R. G. Lanckriet,et al. On the empirical estimation of integral probability metrics , 2012 .
[27] Doina Precup,et al. Eligibility Traces for Off-Policy Policy Evaluation , 2000, ICML.
[28] Uri Shalit,et al. Estimating individual treatment effect: generalization bounds and algorithms , 2016, ICML.
[29] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.