The First Law of Robotics (A Call to Arms)

Even before the advent of Artificial Intelligence, science fiction writer Isaac Asimov recognized that an agent must place the protection of humans from harm at a higher priority than obeying human orders. Inspired by Asimov, we pose the following fundamental questions: (1) How should one formalize the rich, but informal, notion of "harm"? (2) How can an agent avoid performing harmful actions, and do so in a computationally tractable manner? (3) How should an agent resolve conflict between its goals and the need to avoid harm? (4) When should an agent prevent a human from harming herself? While we address some of these questions in technical detail, the primary goal of this paper is to focus attention on Asimov's concern: society will reject autonomous agents unless we have some credible means of making them safe!

[1]  A. Church,et al.  Astounding Science Fiction. , 1952 .

[2]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1967 .

[3]  Austin Tate,et al.  Generating Project Networks , 1977, IJCAI.

[4]  Amir Pnueli,et al.  The temporal logic of programs , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[5]  T. Christensen,et al.  Military applications of modeling: Selected case studies: Francis P. HOEBER Volume 1 in: Military Applications of Modeling, Gordon and Breach, New York, 1982, xviii + 222 pages, $65.00 , 1984 .

[6]  Stephen F. Smith,et al.  ISIS—a knowledge‐based system for factory scheduling , 1984 .

[7]  H. Levesque,et al.  Readings in Knowledge Representation , 1985 .

[8]  Nancy G Leveson,et al.  Software safety: why, what, and how , 1986, CSUR.

[9]  Richard E. Korf,et al.  Planning as Search: A Quantitative Approach , 1987, Artif. Intell..

[10]  Y. Shoham Reasoning About Change: Time and Causation from the Standpoint of Artificial Intelligence , 1987 .

[11]  David Chapman,et al.  Planning for Conjunctive Goals , 1987, Artif. Intell..

[12]  H. Garcia-Molina,et al.  Sagas , 1987, SIGMOD Conference.

[13]  H. Price Change in View: Principles of Reasoning , 1988 .

[14]  Edwin P. D. Pednault,et al.  Synthesizing plans that contain actions with context‐dependent effects 1 , 1988, Comput. Intell..

[15]  Robert Wilensky,et al.  Knowledge Intensive Planning , 1988 .

[16]  David E. Smith,et al.  Reasoning About Action I: A Possible Worlds Approach , 1987, Artif. Intell..

[17]  David P. Miller,et al.  Hierarchical planning involving deadlines, travel time, and resources , 1988, Comput. Intell..

[18]  Bengt Jonsson,et al.  A framework for reasoning about time and reliability , 1989, [1989] Proceedings. Real-Time Systems Symposium.

[19]  Stuart J. Russell,et al.  On Optimal Game-Tree Search using Rational Meta-Reasoning , 1989, IJCAI.

[20]  Edwin P. D. Pednault,et al.  ADL: Exploring the Middle Ground Between STRIPS and the Situation Calculus , 1989, KR.

[21]  David E. Smith Controlling Backward Inference , 1989, Artif. Intell..

[22]  Mark Drummond,et al.  Situated Control Rules , 1989, KR.

[23]  David E. Wilkins,et al.  Can AI planners solve practical problems? , 1990, Comput. Intell..

[24]  Ernest Davis,et al.  Representations of commonsense knowledge , 2014, notThenot Morgan Kaufmann series in representation and reasoning.

[25]  David A. McAllester,et al.  Systematic Nonlinear Planning , 1991, AAAI.

[26]  Hirofumi Katsuno,et al.  On the Difference between Updating a Knowledge Base and Revising It , 1991, KR.

[27]  James F. Allen Temporal reasoning and planning , 1991 .

[28]  Oren Etzioni,et al.  Embedding Decision-Analytic Control in a Learning Architecture , 1991, Artif. Intell..

[29]  Henry A. Kautz,et al.  Reasoning about plans , 1991, Morgan Kaufmann series in representation and reasoning.

[30]  Austin Tate,et al.  O-Plan: The open Planning Architecture , 1991, Artif. Intell..

[31]  James F. Allen Planning as Temporal Reasoning , 1991, KR.

[32]  Jon Doyle,et al.  Modular utility representation for decision-theoretic planning , 1992 .

[33]  Martha E. Pollack,et al.  The Uses of Plans , 1992, Artif. Intell..

[34]  Oren Etzioni,et al.  Building Softbots for UNIX (Preliminary Report) , 1992 .

[35]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[36]  Daniel S. Weld,et al.  UCPOP: A Sound, Complete, Partial Order Planner for ADL , 1992, KR.

[37]  Peter Haddawy,et al.  Representations for Decision-Theoretic Planning: Utility Functions for Deadline Goals , 1992, KR.

[38]  Oren Etzioni,et al.  An Approach to Planning with Incomplete Information , 1992, KR.

[39]  Anand S. Rao,et al.  An Abstract Architecture for Rational Agents , 1992, KR.

[40]  Ian Horswill,et al.  Cultural Support for Improvisation , 1992, AAAI.

[41]  J. Penberthy Planning with continuous change , 1993 .

[42]  Oren Etzioni,et al.  Intelligence without Robots: A Reply to Brooks , 1993, AI Mag..

[43]  Anthony Barrett,et al.  Characterizing Subgoal Interactions for Planning , 1993, International Joint Conference on Artificial Intelligence.

[44]  Steve Hanks,et al.  Optimal Planning with a Goal-directed Utility Model , 1994, AIPS.

[45]  Daniel S. Weld,et al.  Temporal Planning with Continuous Change , 1994, AAAI.

[46]  Bharat K. Bhargava,et al.  Ensuring relaxed atomicity for flexible transactions in multidatabase systems , 1994, SIGMOD '94.

[47]  Kristian J. Hammond,et al.  The Stabilization of Environments , 1995, Artif. Intell..

[48]  Inman Harvey,et al.  Noise and the Reality Gap: The Use of Simulation in Evolutionary Robotics , 1995, ECAL.

[49]  Ronald Fagin,et al.  Reasoning about knowledge , 1995 .

[50]  Rajeev Rastogi,et al.  ConTracts - A Low-Level Mechanism for Building General-Purpose Workflow Management-Systems. , 1995 .

[51]  James S. Hodges,et al.  Credible Uses of the Distributed Interactive Simulation (DIS) System , 1996 .

[52]  Gustavo Alonso,et al.  Advanced transaction models in workflow contexts , 1996, Proceedings of the Twelfth International Conference on Data Engineering.