论文信息 - Predicting future AI failures from historic examples

Predicting future AI failures from historic examples

Purpose The purpose of this paper is to explain to readers how intelligent systems can fail and how artificial intelligence (AI) safety is different from cybersecurity. The goal of cybersecurity is to reduce the number of successful attacks on the system; the goal of AI Safety is to make sure zero attacks succeed in bypassing the safety mechanisms. Unfortunately, such a level of performance is unachievable. Every security system will eventually fail; there is no such thing as a 100 per cent secure system. Design/methodology/approach AI Safety can be improved based on ideas developed by cybersecurity experts. For narrow AI Safety, failures are at the same, moderate level of criticality as in cybersecurity; however, for general AI, failures have a fundamentally different impact. A single failure of a superintelligent system may cause a catastrophic event without a chance for recovery. Findings In this paper, the authors present and analyze reported failures of artificially intelligent systems and extrapolate our analysis to future AIs. The authors suggest that both the frequency and the seriousness of future AI failures will steadily increase. Originality/value This is a first attempt to assemble a public data set of AI failures and is extremely valuable to AI Safety researchers.

Roman V Yampolskiy

[1] Richard M. Karp,et al. Reducibility Among Combinatorial Problems , 1972, 50 Years of Integer Programming.

[2] James R. Meehan,et al. TALE-SPIN, An Interactive Program that Writes Stories , 1977, IJCAI.

[3] W R Hartston. Artificial stupidity , 1986 .

[4] G Macpherson,et al. A blot on the profession , 1988, British medical journal.

[5] Karl Sims,et al. Evolving virtual creatures , 1994, SIGGRAPH.

[6] M. G. Rodd. Safe AI—is this possible?☆ , 1995 .

[7] Helen Nissenbaum,et al. Bias in computer systems , 1996, TOIS.

[8] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.

[9] Milind Tambe,et al. Electric Elves: What Went Wrong and Why , 2006, AAAI Spring Symposium: What Went Wrong and Why: Lessons from AI Research and Applications.

[10] Andreas Abecker,et al. AAAI 2006 Spring Symposium Reports , 2006, AI Mag..

[11] Mehmet H. Göker,et al. What went wrong and why : lessons from AI research and applications : papers from the AAAI Spring Symposium , 2006 .

[12] Eliezer Yudkowsky. Artificial Intelligence as a Positive and Negative Factor in Global Risk , 2006 .

[13] James H. Moor,et al. The Nature, Importance, and Difficulty of Machine Ethics , 2006, IEEE Intelligent Systems.

[14] Jonathan Lyons. Artificial stupidity , 2007, SIGGRAPH '07.

[15] Raffaello D'Andrea,et al. Coordinating Hundreds of Cooperative, Autonomous Vehicles in Warehouses , 2007, AI Mag..

[16] Mehmet H. Göker,et al. Advancing AI Research and Applications by Learning from What Went Wrong and Why , 2008, AI Mag..

[17] D. Chelberg,et al. RoboCup for the Mechanically , Athletically and Culturally Challenged , 2008 .

[18] J. A. Robinson. Development of Logic Programming : What went wrong , 2008 .

[19] Armin Krishnan,et al. Killer Robots: Legality and Ethicality of Autonomous Weapons , 2009 .

[20] Roman V. Yampolskiy,et al. Artificial Intelligence Safety Engineering: Why Machine Ethics Is a Wrong Approach , 2011, PT-AI.

[21] Eliezer Yudkowsky,et al. Complex Value Systems in Friendly AI , 2011, AGI.