The Singularity and Machine Ethics

Many researchers have argued that a self-improving artificial intelligence (AI) could become so vastly more powerful than humans that we would not be able to stop it from achieving its goals. If so, and if the AI’s goals differ from ours, then this could be disastrous for humans. One proposed solution is to program the AI’s goal system to want what we want before the AI self-improves beyond our capacity to control it. Unfortunately, it is difficult to specify what we want. After clarifying what we mean by “intelligence”, we offer a series of “intuition pumps” from the field of moral philosophy for our conclusion that human values are complex and difficult to specify. We then survey the evidence from the psychology of motivation, moral psychology, and neuroeconomics that supports our position. We conclude by recommending ideal preference theories of value as a promising approach for developing a machine ethics suitable for navigating an intelligence explosion or “technological singularity”.

[1]  A. J. Ayer,et al.  Language, Truth, and Logic , 1936 .

[2]  R. M. Hare,et al.  The Language of Morals. , 1952 .

[3]  M. Friedman Essays in Positive Economics , 1954 .

[4]  M. Allais Le comportement de l'homme rationnel devant le risque : critique des postulats et axiomes de l'ecole americaine , 1953 .

[5]  R. Tagiuri,et al.  Person perception and interpersonal behavior , 1959 .

[6]  Edmund L. Gettier Is Justified True Belief Knowledge? , 1963, Arguing About Knowledge.

[7]  I. J. Good,et al.  Speculations Concerning the First Ultraintelligent Machine , 1965, Adv. Comput..

[8]  H. Acton,et al.  Kant's Moral Philosophy , 1970 .

[9]  R. Lewontin Race and Intelligence , 1970 .

[10]  Irving John Good,et al.  Some future social repercussions of computers , 1970 .

[11]  R. Nozick Anarchy, State, and Utopia , 1975, Princeton Readings in Political Thought.

[12]  Ralph L. Keeney,et al.  Decisions with multiple objectives: preferences and value tradeoffs , 1976 .

[13]  J. Mackie,et al.  Ethics: Inventing Right and Wrong , 1977 .

[14]  John C. Harsanyi,et al.  Rule utilitarianism and decision theory , 1977 .

[15]  R. L. Keeney,et al.  Decisions with Multiple Objectives: Preferences and Value Trade-Offs , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[16]  Richard B. Brandt A theory of the good and the right , 1979 .

[17]  A. Sen,et al.  Utilitarianism and Welfarism , 1979 .

[18]  Ulric Neisser,et al.  The concept of intelligence. , 1979 .

[19]  R. Sternberg,et al.  People's conceptions of intelligence. , 1981 .

[20]  A. Tversky,et al.  The framing of decisions and the psychology of choice. , 1981, Science.

[21]  R. M. Hare Utilitarianism and beyond: Ethical theory and utilitarianism , 1982 .

[22]  Harry G. Frankfurt,et al.  The importance of what we care about: Freedom of the will and the concept of a person , 1971 .

[23]  R. Shope The Analysis of Knowing: A Decade of Research , 1983 .

[24]  Daniel C. Dennett Elbow Room: The Varieties of Free Will Worth Wanting , 1984 .

[25]  D. Parfit Reasons and Persons , 1986 .

[26]  R. Sternberg Implicit theories of intelligence. creativity, and wisdom , 1985 .

[27]  COUNTEREXAMPLES IN ETHICS , 1985 .

[28]  P. Railton Facts and Values , 1986 .

[29]  David Lewis,et al.  Dispositional Theories of Value , 1989 .

[30]  John R. Searle,et al.  Minds, brains, and programs , 1980, Behavioral and Brain Sciences.

[31]  A. Gibbard,et al.  Wise Choices, Apt Feelings: A Theory of Normative Judgement. , 1991 .

[32]  Michael S. Gazzaniga,et al.  Nature's Mind: The Biological Roots of Thinking, Emotions, Sexuality, Language, and Intelligence, Michael S. Gazzaniga. 1992. Basic Books, Inc. Publishers, New York, NY. 256 pages. ISBN: 0-465-07-649-1. $25.00 , 1992 .

[33]  Algirdas Pakstas,et al.  Computer networks in Estonia, Latvia, and Lithuania , 1993, Computer.

[34]  Robert A. Baron,et al.  A Whiff of Reality , 1994 .

[35]  David Sobel,et al.  Full Information Accounts of Well-Being , 1994, Ethics.

[36]  Roger Clarke,et al.  Asimov's Laws of Robotics: Implications for Information Technology - Part 2 , 1993, Computer.

[37]  E. Grigorenko,et al.  Cultural meaning systems, intelligence, and personality. , 1994 .

[38]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[39]  E. Single Defining harm reduction. , 1995, Drug and alcohol review.

[40]  Axel Wüstehube,et al.  Moore, George Edward , 1995 .

[41]  F. Keil,et al.  Conceptualizing a Nonnatural Entity: Anthropomorphism in God Concepts , 1996, Cognitive Psychology.

[42]  N. Daniels Justice and Justification: Reflective Equilibrium in Theory and Practice , 1996 .

[43]  William Ramsey,et al.  Thinking about Thinking: Philosophers Talk to Cognitive Scientists about Intuition@@@Rethinking Intuition: The Psychology of Intuition and Its Role in Philosophical Inquiry , 2001 .

[44]  Thomas G. Dietterich Adaptive computation and machine learning , 1998 .

[45]  David Sobel Do the desires of rational agents converge , 1999 .

[46]  Richard J. Arneson Egalitarianism and Responsibility , 1999 .

[47]  A. Abdoullaev,et al.  Artificial Superintelligence , 1999 .

[48]  Dale E. Miller,et al.  Morality, Rules, and Consequences: A Critical Reader , 2000 .

[49]  Feet of clay : a novel of Discworld , 2000 .

[50]  F. Jackson,et al.  From Metaphysics to Ethics: A Defence of Conceptual Analysis , 2000 .

[51]  P. Pettit Akrasia, collective and individual , 2001 .

[52]  D. A. Kenny,et al.  The organisation of Luo conceptions of intelligence: A study of implicit theories in a Kenyan village , 2001 .

[53]  P. Slovic,et al.  The affect heuristic , 2007, European Journal of Operational Research.

[54]  T. Wilson Strangers to Ourselves: Discovering the Adaptive Unconscious , 2002 .

[55]  Kent C. Berridge,et al.  Pleasures of the brain , 2003, Brain and Cognition.

[56]  Joy Bill,et al.  Why the future doesn’t need us , 2003 .

[57]  Brian Weatherson,et al.  What Good are Counterexamples? , 2003 .

[58]  R. ShafEr-Landau Moral Realism: A Defence , 2003 .

[59]  David Zimmerman Why Richard Brandt Does Not Need Cognitive Psychotherapy, and Other Glad News about Idealized Preference Theories in Meta-Ethics , 2003 .

[60]  P. Railton Facts, Values, and Norms: Essays toward a Morality of Consequence , 2003 .

[61]  Eric Margolis,et al.  Concepts and Conceptual Analysis , 2003 .

[62]  G. Moskowitz,et al.  The Implicit Volition Model: On the Preconscious Regulation of Temporarily Adopted Goals , 2004 .

[63]  Janet E. Davidson,et al.  Contemporary Models of Intelligence , 2004 .

[64]  Timothy Schroeder,et al.  Three Faces of Desire , 2004 .

[65]  Richard A. Posner,et al.  Catastrophe: Risk and Response , 2004 .

[66]  Thomas D. Nielsen,et al.  Learning a decision maker's utility function from (possibly) inconsistent behavior , 2004, Artif. Intell..

[67]  Faruk Gul,et al.  Random Expected Utility , 2005 .

[68]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[69]  Kenji Araki,et al.  What Statistics Could Do for Ethics? : The Idea of Common Sense Processing Based Safety Valve , 2005, AAAI 2005.

[70]  R. Joyce The Evolution of Morality , 2005 .

[71]  D. McFadden Revealed stochastic preference: a synthesis , 2005 .

[72]  Petter Johansson,et al.  Failure to Detect Mismatches Between Intention and Outcome in a Simple Decision Task , 2005, Science.

[73]  Richard E. Mayer Thorndike, Edward L. , 2005 .

[74]  Nick Bostrom What is a Singleton , 2006 .

[75]  Attila Tanyi An Essay on the Desire-Based Reasons Model , 2006 .

[76]  Bruce M. McLaren,et al.  Computational Models of Ethical Reasoning: Challenges, Initial Steps, and Future Directions , 2006, IEEE Intelligent Systems.

[77]  Marcello Guarini,et al.  Particularism and the Classification and Reclassification of Moral Cases , 2006, IEEE Intelligent Systems.

[78]  F. Jackson,et al.  Absolutist Moral Theories and Uncertainty , 2006 .

[79]  Shane Legg,et al.  A Collection of Definitions of Intelligence , 2007, AGI.

[80]  Thomas M. Powers Prospects for a Kantian Machine , 2006, IEEE Intelligent Systems.

[81]  Michael Anderson,et al.  An Approach to Computing Ethics , 2006, IEEE Intelligent Systems.

[82]  R. Sternberg,et al.  Cultural Intelligence and Successful Intelligence , 2006 .

[83]  Hubert L. Dreyfus,et al.  What artificial experts can and cannot do , 1992, AI & SOCIETY.

[84]  G. Drescher Good and Real: Demystifying Paradoxes from Physics to Ethics , 2006 .

[85]  Eliezer Yudkowsky Artificial Intelligence as a Positive and Negative Factor in Global Risk , 2006 .

[86]  James H. Moor,et al.  The Nature, Importance, and Difficulty of Machine Ethics , 2006, IEEE Intelligent Systems.

[87]  J. Laird,et al.  Feelings: The Perception of Self , 2007 .

[88]  J. Cacioppo,et al.  On seeing human: a three-factor theory of anthropomorphism. , 2007, Psychological review.

[89]  G. Ellis,et al.  Universe or Multiverse , 2009 .

[90]  Wendell Wallach,et al.  Machine morality: bottom-up and top-down approaches for modelling human moral faculties , 2008, AI & SOCIETY.

[91]  John Storrs Hall,et al.  Self-improving AI: an Analysis , 2007, Minds and Machines.

[92]  Jürgen Schmidhuber,et al.  Gödel Machines: Fully Self-referential Optimal Universal Self-improvers , 2007, Artificial General Intelligence.

[93]  Max Tegmark The Multiverse Hierarchy , 2009, 0905.1283.

[94]  J. Storrs Hall Beyond AI: Creating the Conscience of the Machine , 2007 .

[95]  Kyle S. Smith,et al.  Hedonic Hotspots: Generating Sensory Pleasure in the Brain , 2007 .

[96]  S. Legg,et al.  Machine super intelligence , 2008 .

[97]  Colin Camerer,et al.  A framework for studying the neurobiology of value-based decision making , 2008, Nature Reviews Neuroscience.

[98]  Joshua D. Greene The secret joke of Kant's soul. , 2008 .

[99]  D. Braddon-Mitchell,et al.  Conceptual Analysis and Philosophical Naturalism , 2008 .

[100]  W. Gardner,et al.  Handbook of motivation science , 2008 .

[101]  N. Bostrom,et al.  Global Catastrophic Risks , 2008 .

[102]  C. Allen,et al.  Moral Machines: Teaching Robots Right from Wrong , 2008 .

[103]  Melissa J. Ferguson,et al.  Implicit motivation: Past, present, and future. , 2008 .

[104]  David D. Friedman Future Imperfect: Technology and Freedom in an Uncertain World , 2008 .

[105]  Stephen M. Omohundro,et al.  The Basic AI Drives , 2008, AGI.

[106]  Colin Camerer,et al.  Neuroeconomics: decision making and the brain , 2008 .

[107]  G. Clore,et al.  Disgust as Embodied Moral Judgment , 2008, Personality and Social Psychology Bulletin.

[108]  Michael Smith,et al.  DESIRES, VALUES, REASONS, AND THE DUALISM OF PRACTICAL. REASON , 2009 .

[109]  Alex S. Taylor,et al.  Machine intelligence , 2009, CHI.

[110]  K. Binmore Interpersonal Comparison of Utility , 2009 .

[111]  J. W. Aldridge,et al.  Dissecting components of reward: 'liking', 'wanting', and learning. , 2009, Current opinion in pharmacology.

[112]  Itzhak Bars,et al.  Extra Dimensions in Space and Time , 2009 .

[113]  L. Tremblay,et al.  Handbook of reward and decision making , 2009 .

[114]  D. Kumaran,et al.  The Neurobiology of Reference-Dependent Value Computation , 2009, NeuroImage.

[115]  Ryan Tonkens,et al.  A Challenge for Machine Ethics , 2009, Minds and Machines.

[116]  Carl Shulman,et al.  Machine Ethics and Superintelligence , 2009 .

[117]  Theo Tryfonas,et al.  Frontiers in Artificial Intelligence and Applications , 2009 .

[118]  E. Thorndike Animal Intelligence; Experimental Studies , 2009 .

[119]  Nasser Ghasem-Aghaee,et al.  An artificial neural network approach for creating an ethical artificial agent , 2009, 2009 IEEE International Symposium on Computational Intelligence in Robotics and Automation - (CIRA).

[120]  Ilhan Kubilay Geçkil,et al.  Applied Game Theory and Strategic Behavior , 2009 .

[121]  Joshua D. Greene,et al.  Multi-system moral psychology , 2010 .

[122]  Andrew Daly,et al.  Choice Modelling: The State-of-the-art and the State-of-practice: Proceedings from the Inaugural International Choice Modelling Conference , 2010 .

[123]  Antonio Rangel,et al.  Neural computations associated with goal-directed choice , 2010, Current Opinion in Neurobiology.

[124]  D. Chalmers The Singularity: a Philosophical Analysis , 2010 .

[125]  Makoto Ito,et al.  Evidence for Model-Based Action Planning in a Sequential Finger Movement Task , 2010, Journal of motor behavior.

[126]  Moral reasons and moral sentiments , 2010 .

[127]  Eyke Hllermeier,et al.  Preference Learning , 2010 .

[128]  P. Glimcher Foundations of Neuroeconomic Analysis , 2010 .

[129]  Niro Sivanathan,et al.  A clean self can render harsh moral judgment , 2010 .

[130]  Gustaf Arrhenius,et al.  The Impossibility of a Satisfactory Population Ethics , 2011 .

[131]  C. Padoa-Schioppa Neurobiology of economic choice: a good-based model. , 2011, Annual review of neuroscience.

[132]  D. Halpern,et al.  Sex Differences in Intelligence , 2011, The Cambridge Handbook of Intelligence.

[133]  Derek Parfit,et al.  What matters. , 2011, Current problems in pediatric and adolescent health care.

[134]  J. Storrs Hall Machine Ethics: Ethics for Self-Improving Machines , 2011 .

[135]  J. Davidson,et al.  The Cambridge Handbook of Intelligence: Contemporary Models of Intelligence , 2011 .

[136]  Raymond J. Dolan,et al.  Neuroscience of Preference and Choice : Cognitive and Neural Mechanisms , 2011 .

[137]  N. Daw,et al.  Multiplicity of control in the basal ganglia: computational roles of striatal subregions , 2011, Current Opinion in Neurobiology.

[138]  Dylan A. Simon,et al.  Neural Correlates of Forward Planning in a Spatial Decision Task in Humans , 2011, The Journal of Neuroscience.

[139]  The Cambridge Handbook of Intelligence: Intelligence in Worldwide Perspective , 2011 .

[140]  R. Sternberg,et al.  The Cambridge handbook of intelligence , 2011 .

[141]  Eyke Hüllermeier,et al.  Preferences in AI: An overview , 2011, Artif. Intell..

[142]  The Cambridge Handbook of Intelligence: Race and Intelligence , 2011 .

[143]  Antonio Rangel,et al.  The Decision Value Computations in the vmPFC and Striatum Use a Relative Value Code That is Guided by Visual Attention , 2011, The Journal of Neuroscience.

[144]  P. Dayan,et al.  Model-based influences on humans’ choices and striatal prediction errors , 2011, Neuron.

[145]  Eliezer Yudkowsky,et al.  Complex Value Systems in Friendly AI , 2011, AGI.

[146]  Michael Anderson,et al.  Machine Ethics , 2011 .

[147]  Laurent Orseau,et al.  Delusion, Survival, and Intelligent Agents , 2011, AGI.

[148]  Michael L. Anderson,et al.  Machine Ethics: General Introduction , 2011 .

[149]  C. Daniel Batson,et al.  Altruism in Humans , 2011 .

[150]  Patrick Lin,et al.  Robot Ethics: The Ethical and Social Implications of Robotics , 2011 .

[151]  E. Fehr,et al.  Neuroeconomic Foundations of Economic Choice—Recent Advances , 2011 .

[152]  Daniel Dewey,et al.  Learning What to Value , 2011, AGI.

[153]  Raymond J. Dolan,et al.  Comprar Neuroscience Of Preference And Choice. Cognitive And Neural Mechanisms | Raymond Dolan | 9780123814319 | Academic Press , 2011 .

[154]  Souhila Kaci,et al.  Working with Preferences: Less Is More , 2011, Cognitive Technologies.

[155]  Luke Muehlhauser,et al.  Intelligence Explosion: Evidence and Import , 2012 .

[156]  Patrick Lin,et al.  Moral Machines and the Threat of Ethical Nihilism , 2012 .

[157]  Bill Hibbard,et al.  Model-based Utility Functions , 2011, J. Artif. Gen. Intell..

[158]  Nick Bostrom,et al.  The Superintelligent Will: Motivation and Instrumental Rationality in Advanced Artificial Agents , 2012, Minds and Machines.

[159]  Peter Dayan,et al.  Models of value and choice. , 2012 .

[160]  Amnon H. Eden,et al.  Singularity Hypotheses: A Scientific and Philosophical Assessment , 2013 .

[161]  Grant J. Rich,et al.  Bruner, Jerome S , 2013 .

[162]  Eliezer Yudkowsky,et al.  The Ethics of Artificial Intelligence , 2014, Artificial Intelligence Safety and Security.

[163]  Moshe Idel,et al.  Golem: Jewish Magical and Mystical Traditions on the Artificial Anthropoid , 2019 .