Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions
暂无分享,去创建一个
Yao Liu | Finale Doshi-Velez | Emma Brunskill | Joseph Futoma | Omer Gottesman | Soanli Parbhoo | leo Anthony celi | L. Celi | Omer Gottesman | Yao Liu | Joseph D. Futoma | E. Brunskill | Soanli Parbhoo | F. Doshi-Velez
[1] Yuriy Brun,et al. Preventing undesirable behavior of intelligent machines , 2019, Science.
[2] Andrew W. Moore,et al. Variable Resolution Discretization in Optimal Control , 2002, Machine Learning.
[3] A. Girbes,et al. Time to stop randomized and large pragmatic trials for intensive care medicine syndromes: the case of sepsis and acute respiratory distress syndrome. , 2020, Journal of thoracic disease.
[4] Yisong Yue,et al. Batch Policy Learning under Constraints , 2019, ICML.
[5] Peter Stone,et al. Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation , 2016, AAAI.
[6] Adam Tauman Kalai,et al. Adaptively Learning the Crowd Kernel , 2011, ICML.
[7] H. O. Oudemans-van Straaten,et al. Unexplained mortality differences between septic shock trials: a systematic analysis of population characteristics and control-group mortality rates , 2018, Intensive Care Medicine.
[8] Peter Szolovits,et al. MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.
[9] Philip S. Thomas,et al. Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning , 2016, ICML.
[10] Nan Jiang,et al. Doubly Robust Off-policy Value Evaluation for Reinforcement Learning , 2015, ICML.
[11] Johan Pallud,et al. A Tumor Growth Inhibition Model for Low-Grade Glioma Treated with Chemotherapy or Radiotherapy , 2012, Clinical Cancer Research.
[12] Lihong Li,et al. Policy Certificates: Towards Accountable Reinforcement Learning , 2018, ICML.
[13] Percy Liang,et al. Understanding Black-box Predictions via Influence Functions , 2017, ICML.
[14] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.
[15] David Sontag,et al. Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models , 2019, ICML.
[16] Fredrik D. Johansson,et al. Guidelines for reinforcement learning in healthcare , 2019, Nature Medicine.
[17] Yao Liu,et al. Combining Parametric and Nonparametric Models for Off-Policy Evaluation , 2019, ICML.
[18] Aldo A. Faisal,et al. The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care , 2018, Nature Medicine.
[19] Srivatsan Srinivasan,et al. Evaluating Reinforcement Learning Algorithms in Observational Health Settings , 2018, ArXiv.
[20] Samuel J. Gershman,et al. Human-in-the-Loop Interpretability Prior , 2018, NeurIPS.
[21] I. Kohane,et al. Biases in electronic health record data due to processes within the healthcare system: retrospective observational study , 2018, British Medical Journal.
[22] Philip S. Thomas,et al. High-Confidence Off-Policy Evaluation , 2015, AAAI.
[23] Pierre Geurts,et al. Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..
[24] Alan E Jones,et al. Emergency department hypotension predicts sudden unexpected in-hospital mortality: a prospective cohort study. , 2006, Chest.
[25] Claude Guerin,et al. High versus low blood-pressure target in patients with septic shock. , 2014, The New England journal of medicine.
[26] Doina Precup,et al. Eligibility Traces for Off-Policy Policy Evaluation , 2000, ICML.
[27] S. Weisberg,et al. Characterizations of an Empirical Influence Function for Detecting Influential Cases in Regression , 1980 .