Preventing undesirable behavior of intelligent machines
暂无分享,去创建一个
Yuriy Brun | Philip S Thomas | Bruno Castro da Silva | Andrew G Barto | Stephen Giguere | Emma Brunskill | A. Barto | P. Thomas | S. Giguere | Emma Brunskill | Yuriy Brun | Bruno Castro da Silva | E. Brunskill
[1] Sean P. Meyn,et al. An analysis of reinforcement learning with function approximation , 2008, ICML '08.
[2] Michael Carl Tschantz,et al. Automated Experiments on Ad Privacy Settings , 2014, Proc. Priv. Enhancing Technol..
[3] Howard C. Zisser,et al. Prandial Insulin Dosing Using Run-to-Run Control , 2007, Diabetes Care.
[4] Datta N. Godbole,et al. Addressing Multiobjective Control: Safety and Performance through Constrained Optimization , 2001, HSCC.
[5] Yair Zick,et al. Algorithmic Transparency via Quantitative Input Influence , 2017 .
[6] Scott Niekum,et al. Toward Probabilistic Safety Bounds for Robot Learning from Demonstration , 2017, AAAI Fall Symposia.
[7] Jun Sakuma,et al. Fairness-aware Learning through Regularization Approach , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.
[8] Javier García,et al. A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..
[9] Darinka Dentcheva,et al. Concavity and efficient points of discrete distributions in probabilistic programming , 2000, Math. Program..
[10] N. Shah,et al. Increased Mortality of Patients With Diabetes Reporting Severe Hypoglycemia , 2012, Diabetes Care.
[11] Sean A. Munson,et al. Unequal Representation and Gender Stereotypes in Image Search Results for Occupations , 2015, CHI.
[12] Franco Turini,et al. k-NN as an implementation of situation testing for discrimination discovery and prevention , 2011, KDD.
[13] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .
[14] Alexandre M. Bayen,et al. A time-dependent Hamilton-Jacobi formulation of reachable sets for continuous dynamic games , 2005, IEEE Transactions on Automatic Control.
[15] Jan Peters,et al. Learning motor primitives for robotics , 2009, 2009 IEEE International Conference on Robotics and Automation.
[16] Daniel Kuhn,et al. Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations , 2015, Mathematical Programming.
[17] Eyal Dassau,et al. Safety Constraints in an Artificial Pancreatic β Cell: An Implementation of Model Predictive Control with Insulin on Board , 2009, Journal of diabetes science and technology.
[18] Leo Breiman,et al. Random Forests , 2001, Machine Learning.
[19] Vishal Gupta,et al. Data-driven robust optimization , 2013, Math. Program..
[20] George Konidaris,et al. Value Function Approximation in Reinforcement Learning Using the Fourier Basis , 2011, AAAI.
[21] Mohammad Ghavamzadeh,et al. Algorithms for CVaR Optimization in MDPs , 2014, NIPS.
[22] Suchi Saria,et al. A $3 Trillion Challenge to Computational Scientists: Transforming Healthcare Delivery , 2014, IEEE Intelligent Systems.
[23] Adam Tauman Kalai,et al. Decoupled Classifiers for Group-Fair and Efficient Machine Learning , 2017, FAT.
[24] Toon Calders,et al. Three naive Bayes approaches for discrimination-free classification , 2010, Data Mining and Knowledge Discovery.
[25] R. Hovorka. Continuous glucose monitoring and closed‐loop systems , 2006, Diabetic medicine : a journal of the British Diabetic Association.
[26] G. Steil,et al. Closed-loop insulin delivery-the path to physiological glucose control. , 2004, Advanced drug delivery reviews.
[27] Tjerk P. Straatsma,et al. NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations , 2010, Comput. Phys. Commun..
[28] Thomas G. Dietterich,et al. Allowing a wildfire to burn: estimating the effect on future fire suppression costs , 2013 .
[29] C. Toumazou,et al. Clinical Safety and Feasibility of the Advanced Bolus Calculator for Type 1 Diabetes Based on Case-Based Reasoning: A 6-Week Nonrandomized Single-Arm Pilot Study. , 2016, Diabetes technology & therapeutics.
[30] Arkadi Nemirovski,et al. On safe tractable approximations of chance constraints , 2012, Eur. J. Oper. Res..
[31] Marek Petrik,et al. Safe Policy Improvement by Minimizing Robust Baseline Regret , 2016, NIPS.
[32] Yinyu Ye,et al. Distributionally Robust Optimization Under Moment Uncertainty with Application to Data-Driven Problems , 2010, Oper. Res..
[33] Eyal Dassau,et al. Bolus calculator: a review of four "smart" insulin pumps. , 2008, Diabetes technology & therapeutics.
[34] Itay Gurvich,et al. Staffing Call Centers with Uncertain Demand Forecasts: A Chance-Constrained Optimization Approach , 2010, Manag. Sci..
[35] John Langford,et al. A Reductions Approach to Fair Classification , 2018, ICML.
[36] G. Steil,et al. Evaluation of the Effect of Gain on the Meal Response of an Automated Closed-Loop Insulin Delivery System , 2006, Diabetes.
[37] Nan Jiang,et al. Doubly Robust Off-policy Evaluation for Reinforcement Learning , 2015, ArXiv.
[38] Doina Precup,et al. Eligibility Traces for Off-Policy Policy Evaluation , 2000, ICML.
[39] M. Houck. A Chance Constrained Optimization Model for reservoir design and operation , 1979 .
[40] Aaron Roth,et al. Fairness in Learning: Classic and Contextual Bandits , 2016, NIPS.
[41] Tom Fawcett,et al. Robust Classification for Imprecise Environments , 2000, Machine Learning.
[42] Peter W. Glynn,et al. Likelihood robust optimization for data-driven problems , 2013, Computational Management Science.
[43] David S. Wishart,et al. Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.
[44] M. Kearns,et al. Fairness in Criminal Justice Risk Assessments: The State of the Art , 2017, Sociological Methods & Research.
[45] Suresh Venkatasubramanian,et al. On the (im)possibility of fairness , 2016, ArXiv.
[46] Sang Won Suh,et al. Hypoglycemic neuronal death is triggered by glucose reperfusion and activation of neuronal NADPH oxidase. , 2007, The Journal of clinical investigation.
[47] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[48] Scott Kuindersma,et al. Variational Bayesian Optimization for Runtime Risk-Sensitive Control , 2012, Robotics: Science and Systems.
[49] Toon Calders,et al. Building Classifiers with Independency Constraints , 2009, 2009 IEEE International Conference on Data Mining Workshops.
[50] Sridhar Mahadevan,et al. Projected Natural Actor-Critic , 2013, NIPS.
[51] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..
[52] Toon Calders,et al. Discrimination Aware Decision Tree Learning , 2010, 2010 IEEE International Conference on Data Mining.
[53] Giuseppe Carlo Calafiore,et al. Uncertain convex programs: randomized solutions and confidence levels , 2005, Math. Program..
[54] Eyal Dassau,et al. Periodic zone-MPC with asymmetric costs for outpatient-ready safety of an artificial pancreas to treat type 1 diabetes , 2016, Autom..
[55] Wei Chu,et al. A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.
[56] Jeffrey M. Bradshaw,et al. Myths of Automation, Part 2: Some Very Human Consequences , 2014, IEEE Intelligent Systems.
[57] Marcello Restelli,et al. Multi-Objective Reinforcement Learning with Continuous Pareto Frontier Approximation , 2014, AAAI.
[58] Dale E. Seborg,et al. An Improved PID Switching Control Strategy for Type 1 Diabetes , 2008, IEEE Transactions on Biomedical Engineering.
[59] Larry D. Pyeatt,et al. Reinforcement Learning for Closed-Loop Propofol Anesthesia: A Human Volunteer Study , 2010, IAAI.
[60] Guy N. Rothblum,et al. Probably Approximately Metric-Fair Learning , 2018, ICML.
[61] Mark Kotanchek,et al. Pareto-Front Exploitation in Symbolic Regression , 2005 .
[62] Yuriy Brun,et al. Fairness testing: testing software for discrimination , 2017, ESEC/SIGSOFT FSE.
[63] J. Hammersley. MONTE CARLO METHODS FOR SOLVING MULTIVARIABLE PROBLEMS , 1960 .
[64] Selim Soylu,et al. Closed-loop control of blood glucose level in type-1 diabetics: A simulation study , 2013, 2013 8th International Conference on Electrical and Electronics Engineering (ELECO).
[65] Alexander Shapiro,et al. Convex Approximations of Chance Constrained Programs , 2006, SIAM J. Optim..
[66] Klaus M. Schmidt,et al. A Theory of Fairness, Competition, and Cooperation , 1999 .
[67] W. Hoeffding. Probability Inequalities for sums of Bounded Random Variables , 1963 .
[68] Peter Stone,et al. Importance Sampling Policy Evaluation with an Estimated Behavior Policy , 2018, ICML.
[69] Janet L. Kolodner,et al. Case-Based Reasoning , 1989, IJCAI 1989.
[70] L. Schaupp,et al. On‐line adaptive algorithm with glucose prediction capacity for subcutaneous closed loop control of glucose: evaluation under fasting conditions in patients with Type 1 diabetes , 2006, Diabetic medicine : a journal of the British Diabetic Association.
[71] Shlomo Zilberstein. Building Strong Semi-Autonomous Systems , 2015, AAAI.
[72] Steffen Udluft,et al. Safe exploration for reinforcement learning , 2008, ESANN.
[73] C. C. Palerm,et al. A Run-to-Run Control Strategy to Adjust Basal Insulin Infusion Rates in Type 1 Diabetes. , 2008, Journal of process control.
[74] Francis J. Doyle,et al. Run-to-run control of blood glucose concentrations for people with type 1 diabetes mellitus , 2006, IEEE Transactions on Biomedical Engineering.
[75] M. Shichiri,et al. Closed-loop subcutaneous insulin infusion algorithm with a short-acting insulin analog for long-term clinical application of a wearable artificial endocrine pancreas. , 1997, Frontiers of medical and biological engineering : the international journal of the Japan Society of Medical Electronics and Biological Engineering.
[76] G. Tang,et al. Indian Hedgehog: A Mechanotransduction Mediator in Condylar Cartilage , 2004, Journal of dental research.
[77] Stavroula G. Mougiakakou,et al. An Actor-Critic based controller for glucose regulation in type 1 diabetes , 2013, Comput. Methods Programs Biomed..
[78] Shie Mannor,et al. Probabilistic Goal Markov Decision Processes , 2011, IJCAI.
[79] B. Efron. Better Bootstrap Confidence Intervals , 1987 .
[80] Toon Calders,et al. Classifying without discriminating , 2009, 2009 2nd International Conference on Computer, Control and Communication.
[81] Qiang Liu,et al. Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation , 2018, NeurIPS.
[82] Percy Liang,et al. Fairness Without Demographics in Repeated Loss Minimization , 2018, ICML.
[83] R. Hovorka,et al. Nonlinear model predictive control of glucose concentration in subjects with type 1 diabetes. , 2004, Physiological measurement.
[84] Avi Feller,et al. Algorithmic Decision Making and the Cost of Fairness , 2017, KDD.
[85] Roman Hovorka,et al. Efficacy and safety of glucose control with Space GlucoseControl in the medical intensive care unit--an open clinical investigation. , 2012, Diabetes technology & therapeutics.
[86] Philip S. Thomas,et al. Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation , 2017, NIPS.
[87] Howard C. Zisser,et al. Clinical Update on Optimal Prandial Insulin Dosing Using a Refined Run-to-Run Control Algorithm , 2009, Journal of diabetes science and technology.
[88] Armin Falk,et al. A Theory of Reciprocity , 2001, Games Econ. Behav..
[89] E. Sachs,et al. On-line process optimization and control using the sequential design of experiments , 1990, Digest of Technical Papers.1990 Symposium on VLSI Technology.
[90] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[91] Wlodzislaw Duch,et al. Heterogeneous Forests of Decision Trees , 2002, ICANN.
[92] Jaime F. Fisac,et al. Reachability-based safe learning with Gaussian processes , 2014, 53rd IEEE Conference on Decision and Control.
[93] William Nick Street,et al. Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..
[94] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.
[95] Stefan Schaal,et al. A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..
[96] Vera Rich,et al. Ecological warning: Threat of floods from saline lake , 1986, Nature.
[97] M. Rabin. Published by: American , 2022 .
[98] A. Charnes,et al. Chance-Constrained Programming , 1959 .
[99] P. Kim. Data-Driven Discrimination at Work , 2017 .
[100] Jie Wang,et al. How Biased Is Your Model? Concentration Inequalities, Information and Model Bias , 2017, IEEE Transactions on Information Theory.
[101] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[102] Yongpei Guan,et al. A Chance-Constrained Two-Stage Stochastic Program for Unit Commitment With Uncertain Wind Power Output , 2012 .
[103] R. Jibson. Regression models for estimating coseismic landslide displacement , 2007 .
[104] Esther Rolf,et al. Delayed Impact of Fair Machine Learning , 2018, ICML.
[105] Roman Hovorka,et al. Closed-loop insulin delivery: from bench to clinical practice , 2011, Nature Reviews Endocrinology.
[106] Eyal Dassau,et al. A Run-to-Run Approach to Enhance Continuous Glucose Monitor Accuracy Based on Continuous Wear , 2015 .
[107] Dong Shen,et al. Optimization of insulin pump therapy based on high order run-to-run control scheme , 2015, Comput. Methods Programs Biomed..
[108] Yisong Yue,et al. Batch Policy Learning under Constraints , 2019, ICML.
[109] Nan Jiang,et al. Doubly Robust Off-policy Value Evaluation for Reinforcement Learning , 2015, ICML.
[110] A. Høstmark,et al. Postprandial light physical activity blunts the blood glucose increase. , 2006, Preventive medicine.
[111] Toniann Pitassi,et al. The reusable holdout: Preserving validity in adaptive data analysis , 2015, Science.
[112] Toon Calders,et al. Handling Conditional Discrimination , 2011, 2011 IEEE 11th International Conference on Data Mining.
[113] Benjamin Fish,et al. A Confidence-Based Approach for Balancing Fairness and Accuracy , 2016, SDM.
[114] Seiya Shimoda,et al. Comparison between closed-loop portal and peripheral venous insulin delivery systems for an artificial endocrine pancreas , 2004, Journal of Artificial Organs.
[115] C. Cobelli,et al. The UVA/PADOVA Type 1 Diabetes Simulator , 2014, Journal of diabetes science and technology.
[116] Massimiliano Pontil,et al. Empirical Bernstein Bounds and Sample-Variance Penalization , 2009, COLT.
[117] Sanjoy Dasgupta,et al. Off-Policy Temporal Difference Learning with Function Approximation , 2001, ICML.
[118] Alexandra Chouldechova,et al. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.
[119] Philip S. Thomas,et al. Importance Sampling with Unequal Support , 2016, AAAI.
[120] Philip S. Thomas,et al. Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning , 2016, ICML.
[121] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[122] Francis J. Doyle,et al. Survey on iterative learning control, repetitive control, and run-to-run control , 2009 .
[123] Nathan Srebro,et al. Equality of Opportunity in Supervised Learning , 2016, NIPS.
[124] Carlos Eduardo Scheidegger,et al. Certifying and Removing Disparate Impact , 2014, KDD.
[125] Shie Mannor,et al. Optimizing the CVaR via Sampling , 2014, AAAI.
[126] Sham M. Kakade,et al. Optimizing Average Reward Using Discounted Rewards , 2001, COLT/EuroCOLT.
[127] Romain Laroche,et al. Safe Policy Improvement with Baseline Bootstrapping , 2017, ICML.
[128] S. Shankar Sastry,et al. Autonomous Helicopter Flight via Reinforcement Learning , 2003, NIPS.
[129] R.S. Parker,et al. A model-based algorithm for blood glucose control in Type I diabetic patients , 1999, IEEE Transactions on Biomedical Engineering.
[130] Latanya Sweeney,et al. Discrimination in online ad delivery , 2013, CACM.
[131] Todd,et al. Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning , 2002, Nature Medicine.
[132] Claudio Cobelli,et al. Toward a Run-to-Run Adaptive Artificial Pancreas: In Silico Results , 2018, IEEE Transactions on Biomedical Engineering.
[133] Simon J Fisher,et al. Diabetes increases brain damage caused by severe hypoglycemia. , 2009, American journal of physiology. Endocrinology and metabolism.
[134] Philip S. Thomas,et al. High-Confidence Off-Policy Evaluation , 2015, AAAI.
[135] H. Zisser,et al. Run-to-run control of meal-related insulin dosing. , 2005, Diabetes technology & therapeutics.
[136] Garud Iyengar,et al. Ambiguous chance constrained problems and robust optimization , 2006, Math. Program..
[137] Francis J. Doyle,et al. FLEXIBLE RUN–TO–RUN STRATEGY FOR INSULIN DOSING IN TYPE 1 DIABETIC SUBJECTS , 2006 .
[138] Andreas Krause,et al. Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics , 2016, Machine Learning.
[139] B.W. Bequette,et al. Model predictive control of blood glucose in type I diabetics using subcutaneous glucose measurements , 2002, Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301).
[140] A. Folsom,et al. Coronary heart disease risk prediction in the Atherosclerosis Risk in Communities (ARIC) study. , 2003, Journal of clinical epidemiology.
[141] Toniann Pitassi,et al. Fairness through awareness , 2011, ITCS '12.
[142] Andy Liaw,et al. Classification and Regression by randomForest , 2007 .
[143] Marc D. Breton,et al. In silico testing and in vivo experiments with closed-loop control of blood glucose in diabetes , 2008 .
[144] Nuno C. Martins,et al. Control Design for Markov Chains under Safety Constraints: A Convex Approach , 2012, ArXiv.
[145] J. Pankow,et al. Prediction of coronary heart disease in middle-aged adults with diabetes. , 2003, Diabetes care.
[146] H. Kahn,et al. Methods of Reducing Sample Size in Monte Carlo Computations , 1953, Oper. Res..
[147] B. L. Miller,et al. Chance Constrained Programming with Joint Constraints , 1965 .
[148] S. Garg,et al. Improved glycemic control in intensively treated adult subjects with type 1 diabetes using insulin guidance software. , 2008, Diabetes technology & therapeutics.
[149] Andrew G. Barto,et al. Lyapunov Design for Safe Reinforcement Learning , 2003, J. Mach. Learn. Res..
[150] András Prékopa,et al. ON PROBABILISTIC CONSTRAINED PROGRAMMING , 2015 .
[151] Joaquin Quiñonero Candela,et al. Counterfactual reasoning and learning systems: the example of computational advertising , 2013, J. Mach. Learn. Res..
[152] Benjamin Van Roy,et al. On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming , 2004, Math. Oper. Res..
[153] Bert Huang,et al. New Fairness Metrics for Recommendation that Embrace Differences , 2017, ArXiv.
[154] Krishna P. Gummadi,et al. Fairness Constraints: Mechanisms for Fair Classification , 2015, AISTATS.
[155] Dimitrios I. Fotiadis,et al. Machine learning applications in cancer prognosis and prediction , 2014, Computational and structural biotechnology journal.
[156] E. Araki,et al. Strict glycemic control in diabetic dogs with closed-loop intraperitoneal insulin infusion algorithm designed for an artificial endocrine pancreas , 2003, Journal of Artificial Organs.
[157] Stuart Russell. Should We Fear Supersmart Robots? , 2016, Scientific American.
[158] Jon M. Kleinberg,et al. Inherent Trade-Offs in the Fair Determination of Risk Scores , 2016, ITCS.
[159] G. Dantzig,et al. Notes on Linear Programming: Part 1. The Generalized Simplex Method for Minimizing a Linear Form under Linear Inequality Restraints , 1954 .
[160] Claudio Cobelli,et al. Artificial Pancreas: Model Predictive Control Design from Clinical Experience , 2013, Journal of diabetes science and technology.
[161] A. Messac,et al. The normalized normal constraint method for generating the Pareto frontier , 2003 .
[162] Boris Kovatchev,et al. Future of Automated Insulin Delivery Systems. , 2017, Diabetes technology & therapeutics.
[163] Claudio Cobelli,et al. Automatic adaptation of basal therapy for Type 1 diabetic patients: A Run-to-Run approach , 2017, Biomed. Signal Process. Control..
[164] Jeffrey M. Perkel,et al. Programming: Pick up Python , 2015, Nature.
[165] Eyal Dassau,et al. Dynamic Insulin on Board: Incorporation of Circadian Insulin Sensitivity Variation , 2013, Journal of diabetes science and technology.
[166] Bruce A Buckingham,et al. Home use of a bihormonal bionic pancreas versus insulin pump therapy in adults with type 1 diabetes: a multicentre randomised crossover trial , 2017, The Lancet.
[167] Nikolaus Hansen,et al. The CMA Evolution Strategy: A Comparing Review , 2006, Towards a New Evolutionary Computation.
[168] Claudio Cobelli,et al. Postprandial Glucose Regulation via KNN Meal Classification in Type 1 Diabetes , 2019, IEEE Control Systems Letters.
[169] Philip S. Thomas,et al. High Confidence Policy Improvement , 2015, ICML.
[170] Gajendra P.S. Raghava,et al. Prediction of CTL epitopes using QM, SVM and ANN techniques. , 2004, Vaccine.
[171] D. Bertsimas,et al. Shortfall as a risk measure: properties, optimization and applications , 2004 .
[172] Howard C. Zisser,et al. Outcome Measures for Artificial Pancreas Clinical Trials: A Consensus Report , 2016, Diabetes Care.
[173] Jon D. McAuliffe,et al. Uniform, nonparametric, non-asymptotic confidence sequences , 2018 .
[174] Victoria E. Cotero,et al. Mini-review: Impact of recurrent hypoglycemia on cognitive and brain function , 2010, Physiology & Behavior.
[175] Matt J. Kusner,et al. Counterfactual Fairness , 2017, NIPS.
[176] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.
[177] Suresh Venkatasubramanian,et al. Auditing Black-box Models by Obscuring Features , 2016, ArXiv.
[178] Philip S. Thomas,et al. Personalized Ad Recommendation Systems for Life-Time Value Optimization with Guarantees , 2015, IJCAI.
[179] Francis J. Doyle,et al. A run‐to‐run framework for prandial insulin dosing: handling real‐life uncertainty , 2007 .
[180] Bernhard E. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.