Preventing undesirable behavior of intelligent machines

Making well-behaved algorithms Machine learning algorithms are being used in an ever-increasing number of applications, and many of these applications affect quality of life. Yet such algorithms often exhibit undesirable behavior, from various types of bias to causing financial loss or delaying medical diagnoses. In standard machine learning approaches, the burden of avoiding this harmful behavior is placed on the user of the algorithm, who most often is not a computer scientist. Thomas et al. introduce a general framework for algorithm design in which this burden is shifted from the user to the designer of the algorithm. The researchers illustrate the benefits of their approach using examples in gender fairness and diabetes management. Science, this issue p. 999 A machine learning algorithm design framework shifts the burden of avoiding undesirable behavior from user to designer. Intelligent machines using machine learning algorithms are ubiquitous, ranging from simple data analysis and pattern recognition tools to complex systems that achieve superhuman performance on various tasks. Ensuring that they do not exhibit undesirable behavior—that they do not, for example, cause harm to humans—is therefore a pressing problem. We propose a general and flexible framework for designing machine learning algorithms. This framework simplifies the problem of specifying and regulating undesirable behavior. To show the viability of this framework, we used it to create machine learning algorithms that precluded the dangerous behavior caused by standard machine learning algorithms in our experiments. Our framework for designing machine learning algorithms simplifies the safe and responsible application of machine learning.

[1]  Sean P. Meyn,et al.  An analysis of reinforcement learning with function approximation , 2008, ICML '08.

[2]  Michael Carl Tschantz,et al.  Automated Experiments on Ad Privacy Settings , 2014, Proc. Priv. Enhancing Technol..

[3]  Howard C. Zisser,et al.  Prandial Insulin Dosing Using Run-to-Run Control , 2007, Diabetes Care.

[4]  Datta N. Godbole,et al.  Addressing Multiobjective Control: Safety and Performance through Constrained Optimization , 2001, HSCC.

[5]  Yair Zick,et al.  Algorithmic Transparency via Quantitative Input Influence , 2017 .

[6]  Scott Niekum,et al.  Toward Probabilistic Safety Bounds for Robot Learning from Demonstration , 2017, AAAI Fall Symposia.

[7]  Jun Sakuma,et al.  Fairness-aware Learning through Regularization Approach , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[8]  Javier García,et al.  A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..

[9]  Darinka Dentcheva,et al.  Concavity and efficient points of discrete distributions in probabilistic programming , 2000, Math. Program..

[10]  N. Shah,et al.  Increased Mortality of Patients With Diabetes Reporting Severe Hypoglycemia , 2012, Diabetes Care.

[11]  Sean A. Munson,et al.  Unequal Representation and Gender Stereotypes in Image Search Results for Occupations , 2015, CHI.

[12]  Franco Turini,et al.  k-NN as an implementation of situation testing for discrimination discovery and prevention , 2011, KDD.

[13]  Dean Pomerleau,et al.  ALVINN, an autonomous land vehicle in a neural network , 2015 .

[14]  Alexandre M. Bayen,et al.  A time-dependent Hamilton-Jacobi formulation of reachable sets for continuous dynamic games , 2005, IEEE Transactions on Automatic Control.

[15]  Jan Peters,et al.  Learning motor primitives for robotics , 2009, 2009 IEEE International Conference on Robotics and Automation.

[16]  Daniel Kuhn,et al.  Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations , 2015, Mathematical Programming.

[17]  Eyal Dassau,et al.  Safety Constraints in an Artificial Pancreatic β Cell: An Implementation of Model Predictive Control with Insulin on Board , 2009, Journal of diabetes science and technology.

[18]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[19]  Vishal Gupta,et al.  Data-driven robust optimization , 2013, Math. Program..

[20]  George Konidaris,et al.  Value Function Approximation in Reinforcement Learning Using the Fourier Basis , 2011, AAAI.

[21]  Mohammad Ghavamzadeh,et al.  Algorithms for CVaR Optimization in MDPs , 2014, NIPS.

[22]  Suchi Saria,et al.  A $3 Trillion Challenge to Computational Scientists: Transforming Healthcare Delivery , 2014, IEEE Intelligent Systems.

[23]  Adam Tauman Kalai,et al.  Decoupled Classifiers for Group-Fair and Efficient Machine Learning , 2017, FAT.

[24]  Toon Calders,et al.  Three naive Bayes approaches for discrimination-free classification , 2010, Data Mining and Knowledge Discovery.

[25]  R. Hovorka Continuous glucose monitoring and closed‐loop systems , 2006, Diabetic medicine : a journal of the British Diabetic Association.

[26]  G. Steil,et al.  Closed-loop insulin delivery-the path to physiological glucose control. , 2004, Advanced drug delivery reviews.

[27]  Tjerk P. Straatsma,et al.  NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations , 2010, Comput. Phys. Commun..

[28]  Thomas G. Dietterich,et al.  Allowing a wildfire to burn: estimating the effect on future fire suppression costs , 2013 .

[29]  C. Toumazou,et al.  Clinical Safety and Feasibility of the Advanced Bolus Calculator for Type 1 Diabetes Based on Case-Based Reasoning: A 6-Week Nonrandomized Single-Arm Pilot Study. , 2016, Diabetes technology & therapeutics.

[30]  Arkadi Nemirovski,et al.  On safe tractable approximations of chance constraints , 2012, Eur. J. Oper. Res..

[31]  Marek Petrik,et al.  Safe Policy Improvement by Minimizing Robust Baseline Regret , 2016, NIPS.

[32]  Yinyu Ye,et al.  Distributionally Robust Optimization Under Moment Uncertainty with Application to Data-Driven Problems , 2010, Oper. Res..

[33]  Eyal Dassau,et al.  Bolus calculator: a review of four "smart" insulin pumps. , 2008, Diabetes technology & therapeutics.

[34]  Itay Gurvich,et al.  Staffing Call Centers with Uncertain Demand Forecasts: A Chance-Constrained Optimization Approach , 2010, Manag. Sci..

[35]  John Langford,et al.  A Reductions Approach to Fair Classification , 2018, ICML.

[36]  G. Steil,et al.  Evaluation of the Effect of Gain on the Meal Response of an Automated Closed-Loop Insulin Delivery System , 2006, Diabetes.

[37]  Nan Jiang,et al.  Doubly Robust Off-policy Evaluation for Reinforcement Learning , 2015, ArXiv.

[38]  Doina Precup,et al.  Eligibility Traces for Off-Policy Policy Evaluation , 2000, ICML.

[39]  M. Houck A Chance Constrained Optimization Model for reservoir design and operation , 1979 .

[40]  Aaron Roth,et al.  Fairness in Learning: Classic and Contextual Bandits , 2016, NIPS.

[41]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[42]  Peter W. Glynn,et al.  Likelihood robust optimization for data-driven problems , 2013, Computational Management Science.

[43]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[44]  M. Kearns,et al.  Fairness in Criminal Justice Risk Assessments: The State of the Art , 2017, Sociological Methods & Research.

[45]  Suresh Venkatasubramanian,et al.  On the (im)possibility of fairness , 2016, ArXiv.

[46]  Sang Won Suh,et al.  Hypoglycemic neuronal death is triggered by glucose reperfusion and activation of neuronal NADPH oxidase. , 2007, The Journal of clinical investigation.

[47]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[48]  Scott Kuindersma,et al.  Variational Bayesian Optimization for Runtime Risk-Sensitive Control , 2012, Robotics: Science and Systems.

[49]  Toon Calders,et al.  Building Classifiers with Independency Constraints , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[50]  Sridhar Mahadevan,et al.  Projected Natural Actor-Critic , 2013, NIPS.

[51]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[52]  Toon Calders,et al.  Discrimination Aware Decision Tree Learning , 2010, 2010 IEEE International Conference on Data Mining.

[53]  Giuseppe Carlo Calafiore,et al.  Uncertain convex programs: randomized solutions and confidence levels , 2005, Math. Program..

[54]  Eyal Dassau,et al.  Periodic zone-MPC with asymmetric costs for outpatient-ready safety of an artificial pancreas to treat type 1 diabetes , 2016, Autom..

[55]  Wei Chu,et al.  A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.

[56]  Jeffrey M. Bradshaw,et al.  Myths of Automation, Part 2: Some Very Human Consequences , 2014, IEEE Intelligent Systems.

[57]  Marcello Restelli,et al.  Multi-Objective Reinforcement Learning with Continuous Pareto Frontier Approximation , 2014, AAAI.

[58]  Dale E. Seborg,et al.  An Improved PID Switching Control Strategy for Type 1 Diabetes , 2008, IEEE Transactions on Biomedical Engineering.

[59]  Larry D. Pyeatt,et al.  Reinforcement Learning for Closed-Loop Propofol Anesthesia: A Human Volunteer Study , 2010, IAAI.

[60]  Guy N. Rothblum,et al.  Probably Approximately Metric-Fair Learning , 2018, ICML.

[61]  Mark Kotanchek,et al.  Pareto-Front Exploitation in Symbolic Regression , 2005 .

[62]  Yuriy Brun,et al.  Fairness testing: testing software for discrimination , 2017, ESEC/SIGSOFT FSE.

[63]  J. Hammersley MONTE CARLO METHODS FOR SOLVING MULTIVARIABLE PROBLEMS , 1960 .

[64]  Selim Soylu,et al.  Closed-loop control of blood glucose level in type-1 diabetics: A simulation study , 2013, 2013 8th International Conference on Electrical and Electronics Engineering (ELECO).

[65]  Alexander Shapiro,et al.  Convex Approximations of Chance Constrained Programs , 2006, SIAM J. Optim..

[66]  Klaus M. Schmidt,et al.  A Theory of Fairness, Competition, and Cooperation , 1999 .

[67]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[68]  Peter Stone,et al.  Importance Sampling Policy Evaluation with an Estimated Behavior Policy , 2018, ICML.

[69]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1989, IJCAI 1989.

[70]  L. Schaupp,et al.  On‐line adaptive algorithm with glucose prediction capacity for subcutaneous closed loop control of glucose: evaluation under fasting conditions in patients with Type 1 diabetes , 2006, Diabetic medicine : a journal of the British Diabetic Association.

[71]  Shlomo Zilberstein Building Strong Semi-Autonomous Systems , 2015, AAAI.

[72]  Steffen Udluft,et al.  Safe exploration for reinforcement learning , 2008, ESANN.

[73]  C. C. Palerm,et al.  A Run-to-Run Control Strategy to Adjust Basal Insulin Infusion Rates in Type 1 Diabetes. , 2008, Journal of process control.

[74]  Francis J. Doyle,et al.  Run-to-run control of blood glucose concentrations for people with type 1 diabetes mellitus , 2006, IEEE Transactions on Biomedical Engineering.

[75]  M. Shichiri,et al.  Closed-loop subcutaneous insulin infusion algorithm with a short-acting insulin analog for long-term clinical application of a wearable artificial endocrine pancreas. , 1997, Frontiers of medical and biological engineering : the international journal of the Japan Society of Medical Electronics and Biological Engineering.

[76]  G. Tang,et al.  Indian Hedgehog: A Mechanotransduction Mediator in Condylar Cartilage , 2004, Journal of dental research.

[77]  Stavroula G. Mougiakakou,et al.  An Actor-Critic based controller for glucose regulation in type 1 diabetes , 2013, Comput. Methods Programs Biomed..

[78]  Shie Mannor,et al.  Probabilistic Goal Markov Decision Processes , 2011, IJCAI.

[79]  B. Efron Better Bootstrap Confidence Intervals , 1987 .

[80]  Toon Calders,et al.  Classifying without discriminating , 2009, 2009 2nd International Conference on Computer, Control and Communication.

[81]  Qiang Liu,et al.  Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation , 2018, NeurIPS.

[82]  Percy Liang,et al.  Fairness Without Demographics in Repeated Loss Minimization , 2018, ICML.

[83]  R. Hovorka,et al.  Nonlinear model predictive control of glucose concentration in subjects with type 1 diabetes. , 2004, Physiological measurement.

[84]  Avi Feller,et al.  Algorithmic Decision Making and the Cost of Fairness , 2017, KDD.

[85]  Roman Hovorka,et al.  Efficacy and safety of glucose control with Space GlucoseControl in the medical intensive care unit--an open clinical investigation. , 2012, Diabetes technology & therapeutics.

[86]  Philip S. Thomas,et al.  Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation , 2017, NIPS.

[87]  Howard C. Zisser,et al.  Clinical Update on Optimal Prandial Insulin Dosing Using a Refined Run-to-Run Control Algorithm , 2009, Journal of diabetes science and technology.

[88]  Armin Falk,et al.  A Theory of Reciprocity , 2001, Games Econ. Behav..

[89]  E. Sachs,et al.  On-line process optimization and control using the sequential design of experiments , 1990, Digest of Technical Papers.1990 Symposium on VLSI Technology.

[90]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[91]  Wlodzislaw Duch,et al.  Heterogeneous Forests of Decision Trees , 2002, ICANN.

[92]  Jaime F. Fisac,et al.  Reachability-based safe learning with Gaussian processes , 2014, 53rd IEEE Conference on Decision and Control.

[93]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[94]  Gerald Tesauro,et al.  Temporal difference learning and TD-Gammon , 1995, CACM.

[95]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[96]  Vera Rich,et al.  Ecological warning: Threat of floods from saline lake , 1986, Nature.

[97]  M. Rabin Published by: American , 2022 .

[98]  A. Charnes,et al.  Chance-Constrained Programming , 1959 .

[99]  P. Kim Data-Driven Discrimination at Work , 2017 .

[100]  Jie Wang,et al.  How Biased Is Your Model? Concentration Inequalities, Information and Model Bias , 2017, IEEE Transactions on Information Theory.

[101]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[102]  Yongpei Guan,et al.  A Chance-Constrained Two-Stage Stochastic Program for Unit Commitment With Uncertain Wind Power Output , 2012 .

[103]  R. Jibson Regression models for estimating coseismic landslide displacement , 2007 .

[104]  Esther Rolf,et al.  Delayed Impact of Fair Machine Learning , 2018, ICML.

[105]  Roman Hovorka,et al.  Closed-loop insulin delivery: from bench to clinical practice , 2011, Nature Reviews Endocrinology.

[106]  Eyal Dassau,et al.  A Run-to-Run Approach to Enhance Continuous Glucose Monitor Accuracy Based on Continuous Wear , 2015 .

[107]  Dong Shen,et al.  Optimization of insulin pump therapy based on high order run-to-run control scheme , 2015, Comput. Methods Programs Biomed..

[108]  Yisong Yue,et al.  Batch Policy Learning under Constraints , 2019, ICML.

[109]  Nan Jiang,et al.  Doubly Robust Off-policy Value Evaluation for Reinforcement Learning , 2015, ICML.

[110]  A. Høstmark,et al.  Postprandial light physical activity blunts the blood glucose increase. , 2006, Preventive medicine.

[111]  Toniann Pitassi,et al.  The reusable holdout: Preserving validity in adaptive data analysis , 2015, Science.

[112]  Toon Calders,et al.  Handling Conditional Discrimination , 2011, 2011 IEEE 11th International Conference on Data Mining.

[113]  Benjamin Fish,et al.  A Confidence-Based Approach for Balancing Fairness and Accuracy , 2016, SDM.

[114]  Seiya Shimoda,et al.  Comparison between closed-loop portal and peripheral venous insulin delivery systems for an artificial endocrine pancreas , 2004, Journal of Artificial Organs.

[115]  C. Cobelli,et al.  The UVA/PADOVA Type 1 Diabetes Simulator , 2014, Journal of diabetes science and technology.

[116]  Massimiliano Pontil,et al.  Empirical Bernstein Bounds and Sample-Variance Penalization , 2009, COLT.

[117]  Sanjoy Dasgupta,et al.  Off-Policy Temporal Difference Learning with Function Approximation , 2001, ICML.

[118]  Alexandra Chouldechova,et al.  Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.

[119]  Philip S. Thomas,et al.  Importance Sampling with Unequal Support , 2016, AAAI.

[120]  Philip S. Thomas,et al.  Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning , 2016, ICML.

[121]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[122]  Francis J. Doyle,et al.  Survey on iterative learning control, repetitive control, and run-to-run control , 2009 .

[123]  Nathan Srebro,et al.  Equality of Opportunity in Supervised Learning , 2016, NIPS.

[124]  Carlos Eduardo Scheidegger,et al.  Certifying and Removing Disparate Impact , 2014, KDD.

[125]  Shie Mannor,et al.  Optimizing the CVaR via Sampling , 2014, AAAI.

[126]  Sham M. Kakade,et al.  Optimizing Average Reward Using Discounted Rewards , 2001, COLT/EuroCOLT.

[127]  Romain Laroche,et al.  Safe Policy Improvement with Baseline Bootstrapping , 2017, ICML.

[128]  S. Shankar Sastry,et al.  Autonomous Helicopter Flight via Reinforcement Learning , 2003, NIPS.

[129]  R.S. Parker,et al.  A model-based algorithm for blood glucose control in Type I diabetic patients , 1999, IEEE Transactions on Biomedical Engineering.

[130]  Latanya Sweeney,et al.  Discrimination in online ad delivery , 2013, CACM.

[131]  Todd,et al.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning , 2002, Nature Medicine.

[132]  Claudio Cobelli,et al.  Toward a Run-to-Run Adaptive Artificial Pancreas: In Silico Results , 2018, IEEE Transactions on Biomedical Engineering.

[133]  Simon J Fisher,et al.  Diabetes increases brain damage caused by severe hypoglycemia. , 2009, American journal of physiology. Endocrinology and metabolism.

[134]  Philip S. Thomas,et al.  High-Confidence Off-Policy Evaluation , 2015, AAAI.

[135]  H. Zisser,et al.  Run-to-run control of meal-related insulin dosing. , 2005, Diabetes technology & therapeutics.

[136]  Garud Iyengar,et al.  Ambiguous chance constrained problems and robust optimization , 2006, Math. Program..

[137]  Francis J. Doyle,et al.  FLEXIBLE RUN–TO–RUN STRATEGY FOR INSULIN DOSING IN TYPE 1 DIABETIC SUBJECTS , 2006 .

[138]  Andreas Krause,et al.  Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics , 2016, Machine Learning.

[139]  B.W. Bequette,et al.  Model predictive control of blood glucose in type I diabetics using subcutaneous glucose measurements , 2002, Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301).

[140]  A. Folsom,et al.  Coronary heart disease risk prediction in the Atherosclerosis Risk in Communities (ARIC) study. , 2003, Journal of clinical epidemiology.

[141]  Toniann Pitassi,et al.  Fairness through awareness , 2011, ITCS '12.

[142]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[143]  Marc D. Breton,et al.  In silico testing and in vivo experiments with closed-loop control of blood glucose in diabetes , 2008 .

[144]  Nuno C. Martins,et al.  Control Design for Markov Chains under Safety Constraints: A Convex Approach , 2012, ArXiv.

[145]  J. Pankow,et al.  Prediction of coronary heart disease in middle-aged adults with diabetes. , 2003, Diabetes care.

[146]  H. Kahn,et al.  Methods of Reducing Sample Size in Monte Carlo Computations , 1953, Oper. Res..

[147]  B. L. Miller,et al.  Chance Constrained Programming with Joint Constraints , 1965 .

[148]  S. Garg,et al.  Improved glycemic control in intensively treated adult subjects with type 1 diabetes using insulin guidance software. , 2008, Diabetes technology & therapeutics.

[149]  Andrew G. Barto,et al.  Lyapunov Design for Safe Reinforcement Learning , 2003, J. Mach. Learn. Res..

[150]  András Prékopa,et al.  ON PROBABILISTIC CONSTRAINED PROGRAMMING , 2015 .

[151]  Joaquin Quiñonero Candela,et al.  Counterfactual reasoning and learning systems: the example of computational advertising , 2013, J. Mach. Learn. Res..

[152]  Benjamin Van Roy,et al.  On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming , 2004, Math. Oper. Res..

[153]  Bert Huang,et al.  New Fairness Metrics for Recommendation that Embrace Differences , 2017, ArXiv.

[154]  Krishna P. Gummadi,et al.  Fairness Constraints: Mechanisms for Fair Classification , 2015, AISTATS.

[155]  Dimitrios I. Fotiadis,et al.  Machine learning applications in cancer prognosis and prediction , 2014, Computational and structural biotechnology journal.

[156]  E. Araki,et al.  Strict glycemic control in diabetic dogs with closed-loop intraperitoneal insulin infusion algorithm designed for an artificial endocrine pancreas , 2003, Journal of Artificial Organs.

[157]  Stuart Russell Should We Fear Supersmart Robots? , 2016, Scientific American.

[158]  Jon M. Kleinberg,et al.  Inherent Trade-Offs in the Fair Determination of Risk Scores , 2016, ITCS.

[159]  G. Dantzig,et al.  Notes on Linear Programming: Part 1. The Generalized Simplex Method for Minimizing a Linear Form under Linear Inequality Restraints , 1954 .

[160]  Claudio Cobelli,et al.  Artificial Pancreas: Model Predictive Control Design from Clinical Experience , 2013, Journal of diabetes science and technology.

[161]  A. Messac,et al.  The normalized normal constraint method for generating the Pareto frontier , 2003 .

[162]  Boris Kovatchev,et al.  Future of Automated Insulin Delivery Systems. , 2017, Diabetes technology & therapeutics.

[163]  Claudio Cobelli,et al.  Automatic adaptation of basal therapy for Type 1 diabetic patients: A Run-to-Run approach , 2017, Biomed. Signal Process. Control..

[164]  Jeffrey M. Perkel,et al.  Programming: Pick up Python , 2015, Nature.

[165]  Eyal Dassau,et al.  Dynamic Insulin on Board: Incorporation of Circadian Insulin Sensitivity Variation , 2013, Journal of diabetes science and technology.

[166]  Bruce A Buckingham,et al.  Home use of a bihormonal bionic pancreas versus insulin pump therapy in adults with type 1 diabetes: a multicentre randomised crossover trial , 2017, The Lancet.

[167]  Nikolaus Hansen,et al.  The CMA Evolution Strategy: A Comparing Review , 2006, Towards a New Evolutionary Computation.

[168]  Claudio Cobelli,et al.  Postprandial Glucose Regulation via KNN Meal Classification in Type 1 Diabetes , 2019, IEEE Control Systems Letters.

[169]  Philip S. Thomas,et al.  High Confidence Policy Improvement , 2015, ICML.

[170]  Gajendra P.S. Raghava,et al.  Prediction of CTL epitopes using QM, SVM and ANN techniques. , 2004, Vaccine.

[171]  D. Bertsimas,et al.  Shortfall as a risk measure: properties, optimization and applications , 2004 .

[172]  Howard C. Zisser,et al.  Outcome Measures for Artificial Pancreas Clinical Trials: A Consensus Report , 2016, Diabetes Care.

[173]  Jon D. McAuliffe,et al.  Uniform, nonparametric, non-asymptotic confidence sequences , 2018 .

[174]  Victoria E. Cotero,et al.  Mini-review: Impact of recurrent hypoglycemia on cognitive and brain function , 2010, Physiology & Behavior.

[175]  Matt J. Kusner,et al.  Counterfactual Fairness , 2017, NIPS.

[176]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[177]  Suresh Venkatasubramanian,et al.  Auditing Black-box Models by Obscuring Features , 2016, ArXiv.

[178]  Philip S. Thomas,et al.  Personalized Ad Recommendation Systems for Life-Time Value Optimization with Guarantees , 2015, IJCAI.

[179]  Francis J. Doyle,et al.  A run‐to‐run framework for prandial insulin dosing: handling real‐life uncertainty , 2007 .

[180]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.