Pitfalls of machine learning for tail events in high risk environments

Most of today’s Machine Learning (ML) methods and implementations are based on correlations, in the sense of a statistical relationship between a set of inputs and the output(s) under investigation. The relationship might be obscure to the human mind, but through the use of ML, mathematics and statistics makes it seemingly apparent. However, to base safety critical decisions on such methods suffer from the same pitfalls as decisions based on any other correlation metric that disregards causality. Causality is key to ensure that applied mitigation tactics will actually affect the outcome in the desired way. This paper reviews the current situation and challenges of applying ML in high risk environments. It further outlines how phenomenological knowledge, together with an uncertainty-based risk perspective can be incorporated to alleviate the missing causality considerations in current practice. do what they do might become a serious problem as computers become more responsible for making important decisions”. The problem of accidents in ML systems is discussed in detail in a paper led by Google Brain researchers Amodei, Olah, Steinhardt, Christiano, Schulman, & Mané (2016). Here the authors motivate the increasing need to address these safety problems by some general trends, one of which relates to the increasing autonomy in AI systems. “Systems that simply output a recommendation to human users, such as speech systems, typically have relatively limited potential to cause harm. By contrast, systems that exert direct control over the world, such as machines controlling industrial processes, can cause harms in a way that humans cannot necessarily correct or oversee.” In this paper we turn our attention towards the introduction of ML in design and operation of complex engineering systems in high risk environments. These are systems where today’s methods for assessing risk relies heavily on understanding the underlying physical processes and our ability to model these. This is in contrast to e.g. mass production of components where more data-driven, statistically founded methods can be applied to estimate rates of failure. We acknowledge the need for ML technologies to address increasing complexity of engineering systems, and the challenges that follow in quantifying uncertainty and risk based on detailed numerical simulation. However, this type of application reduces the tolerance for erroneous model behavior. Most of the methods applied in ML are based

[1]  LIII , 2018, Out of the Shadow.

[2]  K. E. Knutsen,et al.  Towards a framework for assurance of autonomous navigation systems in the maritime industry , 2018 .

[3]  Yang Lu,et al.  Industry 4.0: A survey on technologies, applications and open research issues , 2017, J. Ind. Inf. Integr..

[4]  Hassan Maatouk,et al.  Gaussian Process Emulators for Computer Experiments with Inequality Constraints , 2016, Mathematical Geosciences.

[5]  John Schulman,et al.  Concrete Problems in AI Safety , 2016, ArXiv.

[6]  Gómez,et al.  A vision of industry 4 . 0 from an artificial intelligence point of view , 2016 .

[7]  T. Sullivan Introduction to Uncertainty Quantification , 2015 .

[8]  Carolo Friederico Gauss Theoria Motus Corporum Coelestium in Sectionibus Conicis Solem Ambientium , 2014 .

[9]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[10]  J. Pearl The Mathematics of Causal Relations , 2010 .

[11]  Ting Yu Incorporating prior domain knowledge into inductive machine learning : its implementation in contemporary capital markets , 2007 .

[12]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[13]  Vladimir Estivill-Castro,et al.  Why so many clustering algorithms: a position paper , 2002, SKDD.

[14]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[15]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[16]  Hid N. Grouni,et al.  Methods of structural safety , 1986 .

[17]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[18]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[19]  Robin Sibson,et al.  SLINK: An Optimally Efficient Algorithm for the Single-Link Cluster Method , 1973, Comput. J..

[20]  Adrien-Marie Legendre,et al.  Nouvelles méthodes pour la détermination des orbites des comètes , 1970 .

[21]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[22]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[23]  J. Morgan,et al.  Problems in the Analysis of Survey Data, and a Proposal , 1963 .

[24]  J. Knott The organization of behavior: A neuropsychological theory , 1951 .

[25]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[26]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[27]  P. Verhulst Notice sur la loi que la population pursuit dans son accroissement , 1838 .