Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation
暂无分享,去创建一个
Finale Doshi-Velez | Emma Brunskill | Leo Anthony Celi | Ramtin Keramati | Omer Gottesman | L. Celi | Omer Gottesman | Ramtin Keramati | E. Brunskill | F. Doshi-Velez
[1] John Langford,et al. Doubly Robust Policy Evaluation and Learning , 2011, ICML.
[2] Z. Obermeyer,et al. Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. , 2016, The New England journal of medicine.
[3] Aldo A. Faisal,et al. The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care , 2018, Nature Medicine.
[4] J. Robins,et al. Double/Debiased Machine Learning for Treatment and Structural Parameters , 2017 .
[5] Masatoshi Uehara,et al. Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes , 2019, J. Mach. Learn. Res..
[6] Fredrik D. Johansson,et al. Guidelines for reinforcement learning in healthcare , 2019, Nature Medicine.
[7] Uri Shalit,et al. Estimating individual treatment effect: generalization bounds and algorithms , 2016, ICML.
[8] Philip S. Thomas,et al. Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning , 2016, ICML.
[9] Srivatsan Srinivasan,et al. Evaluating Reinforcement Learning Algorithms in Observational Health Settings , 2018, ArXiv.
[10] Yao Liu,et al. Representation Balancing MDPs for Off-Policy Policy Evaluation , 2018, NeurIPS.
[11] Peter Szolovits,et al. MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.
[12] Susan Athey,et al. Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.
[13] K. Imai,et al. Covariate balancing propensity score , 2014 .
[14] S. Athey,et al. Generalized random forests , 2016, The Annals of Statistics.
[15] Marcello Restelli,et al. Policy Optimization via Importance Sampling , 2018, NeurIPS.
[16] Yishay Mansour,et al. Learning Bounds for Importance Weighting , 2010, NIPS.
[17] Xinkun Nie,et al. Quasi-oracle estimation of heterogeneous treatment effects , 2017, Biometrika.
[18] Nan Jiang,et al. Doubly Robust Off-policy Value Evaluation for Reinforcement Learning , 2015, ICML.
[19] Doina Precup,et al. Eligibility Traces for Off-Policy Policy Evaluation , 2000, ICML.
[20] Wei-Yin Loh,et al. Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..
[21] Jeremy C. Weiss,et al. Derivation, Validation, and Potential Treatment Implications of Novel Clinical Phenotypes for Sepsis. , 2019, JAMA.
[22] Yao Zhang,et al. Robust Recursive Partitioning for Heterogeneous Treatment Effects with Uncertainty Quantification , 2020, NeurIPS.
[23] David Sontag,et al. Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models , 2019, ICML.
[24] R. Tibshirani,et al. Finding and assessing treatment effect sweet spots in clinical trial data. , 2020, 2011.10157.