Mental time-travel, semantic flexibility, and A.I. ethics

This article argues that existing approaches to programming ethical AI fail to resolve a serious moral-semantic trilemma, generating interpretations of ethical requirements that are either too semantically strict, too semantically flexible, or overly unpredictable. This paper then illustrates the trilemma utilizing a recently proposed ‘general ethical dilemma analyzer,’ GenEth. Finally, it uses empirical evidence to argue that human beings resolve the semantic trilemma using general cognitive and motivational processes involving ‘mental time-travel,’ whereby we simulate different possible pasts and futures. I demonstrate how mental time-travel psychology leads us to resolve the semantic trilemma through a six-step process of interpersonal negotiation and renegotiation, and then conclude by showing how comparative advantages in processing power would plausibly cause AI to use similar processes to solve the semantic trilemma more reliably than we do, leading AI to make better moral-semantic choices than humans do by our very own lights.

[1]  Jeff T. Larsen,et al.  Negative information weighs more heavily on the brain: the negativity bias in evaluative categorizations. , 1998, Journal of personality and social psychology.

[2]  R. Blair,et al.  Neurobiological basis of psychopathy. , 2003, The British journal of psychiatry : the journal of mental science.

[3]  Fritz Allhoff Terrorism and Torture , 2003 .

[4]  Katrin Amunts,et al.  Structural brain abnormalities in psychopaths-a review. , 2008, Behavioral sciences & the law.

[5]  D. Dennett Darwin's Dangerous Idea , 1995 .

[6]  Michael Anderson,et al.  The status of machine ethics: a report from the AAAI Symposium , 2007, Minds and Machines.

[7]  Richard Dean The Value of Humanity in Kant's Moral Theory , 2006 .

[8]  Andreas Matthias,et al.  The responsibility gap: Ascribing responsibility for the actions of learning automata , 2004, Ethics and Information Technology.

[9]  F. Schauer,et al.  Philosophy of Law: Classic and Contemporary Readings with Commentary , 1995 .

[10]  Thomas M. Powers Prospects for a Kantian Machine , 2006, IEEE Intelligent Systems.

[11]  Japa Pallikkathayil Deriving Morality from Politics: Rethinking the Formula of Humanity* , 2010, Ethics.

[12]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[13]  J. Buckholtz,et al.  Psychopathic individuals exhibit but do not avoid regret during counterfactual decision making , 2016, Proceedings of the National Academy of Sciences.

[14]  J. Marshall Dignity and Practical Reason in Kant's Moral Theory , 1993 .

[15]  Hal E. Hershfield,et al.  Vividness of the Future Self Predicts Delinquency , 2013, Psychological science.

[16]  Michael Anderson,et al.  GenEth: a general ethical dilemma analyzer , 2014, AAAI.

[17]  Philippe N. Tobler,et al.  Brain stimulation reveals crucial role of overcoming self-centeredness in self-control , 2016, Science Advances.

[18]  Ryan Tonkens,et al.  A Challenge for Machine Ethics , 2009, Minds and Machines.

[19]  M. Taylor,et al.  Cultivating morality in the Asia-Pacific: Influences, issues, challenges and change , 2017 .

[20]  Christopher T. Wonnell DEONTOLOGY, THRESHOLDS, AND EFFICIENCY , 2011, Legal Theory.

[21]  J. Cottingham,et al.  Partiality and Impartiality: Morality, Special Relationships, and the Wider World , 2010 .

[22]  Alan C. Evans,et al.  Brain development during childhood and adolescence: a longitudinal MRI study , 1999, Nature Neuroscience.

[23]  J. Rawls A Theory of Justice , 1999 .

[24]  Jenna Burrell,et al.  How the machine ‘thinks’: Understanding opacity in machine learning algorithms , 2016 .

[25]  J. Mendola Multiple‐Act Consequentialism , 2006 .

[26]  Brian Knutson,et al.  Saving for the future self: neural measures of future self-continuity predict temporal discounting. , 2009, Social cognitive and affective neuroscience.

[27]  Joshua Glasgow Kant's Conception of Humanity , 2007 .

[28]  Jean Maria Arrigo A utilitarian argument against torture interrogation of terrorists , 2004, Science and engineering ethics.

[29]  T. Shanahan Philosophy 9/11: Thinking About the War on Terrorism , 2005 .

[30]  Wendell Wallach,et al.  Machine morality: bottom-up and top-down approaches for modelling human moral faculties , 2008, AI & SOCIETY.

[31]  U. Steinhoff On the Ethics of Torture , 2013 .

[32]  M. Corballis,et al.  The evolution of foresight: What is mental time travel, and is it unique to humans? , 2007, The Behavioral and brain sciences.

[33]  Robert D. Hare,et al.  The Hare Psychopathy Checklist-Revised , 1996 .

[34]  D. Stuss,et al.  "No longer Gage": frontal lobe dysfunction and emotional changes. , 1992, Journal of consulting and clinical psychology.

[35]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[36]  John R. Searle,et al.  Minds, brains, and programs , 1980, Behavioral and Brain Sciences.

[37]  S. Kahn Can Positive Duties be Derived from Kant's Formula of Universal Law? , 2014, Kantian Review.

[38]  Dieter Schönecker,et al.  I M M A N U E L K A N T Groundwork of the Metaphysics of Morals , 2011 .

[39]  Michael Anderson,et al.  Machine Ethics: Creating an Ethical Intelligent Agent , 2007, AI Mag..

[40]  Bart W. Schermer,et al.  The limits of privacy in automated profiling and data mining , 2011, Comput. Law Secur. Rev..

[41]  M. Hauser,et al.  The Role of Conscious Reasoning and Intuition in Moral Judgment , 2006, Psychological science.

[42]  Christine M. Korsgaard Kant's Formula of Universal Law , 1985 .

[43]  D. Luban Torture, Power, and Law: Liberalism, torture, and the ticking bomb , 2007 .

[44]  Brian Knutson,et al.  Don't stop thinking about tomorrow: Individual differences in future self-continuity account for saving. , 2009, Judgment and decision making.

[45]  Adrian Raine,et al.  Prefrontal structural and functional brain imaging findings in antisocial, violent, and psychopathic individuals: A meta-analysis , 2009, Psychiatry Research: Neuroimaging.

[46]  I. Kant,et al.  Grounding for the metaphysics of morals ; with, On a supposed right to lie because of philanthropic concerns , 1993 .

[47]  T. Moffitt Adolescence-limited and life-course-persistent antisocial behavior: a developmental taxonomy. , 1993, Psychological review.

[48]  K. Vohs,et al.  Case Western Reserve University , 1990 .

[49]  Joanna Bryson,et al.  Semantics derived automatically from language corpora contain human-like biases , 2016, Science.

[50]  Hal E. Hershfield,et al.  Increasing Saving Behavior Through Age-Progressed Renderings of the Future Self , 2011, JMR, Journal of marketing research.

[51]  Hyemin Han Neural correlates of moral sensitivity and moral judgment associated with brain circuitries of selfhood: A meta-analysis , 2017 .

[52]  Ticking Bombs, Torture, and the Analogy with Self-Defense , 2007 .

[53]  Alan F. T. Winfield,et al.  When robots tell each other stories: The emergence of artificial fiction , 2018 .

[54]  J. Timmermann Kant's 'Groundwork of the Metaphysics of Morals': A Critical Guide , 2013 .

[55]  R. Nozick Anarchy, State, and Utopia , 1975, Princeton Readings in Political Thought.

[56]  T. Hare,et al.  The Adolescent Brain , 2008, Annals of the New York Academy of Sciences.

[57]  Richard Dean Humanity as an Idea, as an Ideal, and as an End in Itself , 2013, Kantian Review.

[58]  Alexa Zellentin,et al.  3 The right and the good , 2012 .

[59]  S. Hart,et al.  Impulsivity and psychopathy. , 1997 .

[60]  Isaac Asimov,et al.  The Rest of the Robots , 1964 .

[61]  J. Kennett,et al.  Mental time travel, Agency, and Responsibility , 2009 .

[62]  Mariarosaria Taddeo,et al.  The ethics of algorithms: Mapping the debate , 2016, Big Data Soc..

[63]  E. Zamir,et al.  Law, Economics, and Morality , 2010 .

[64]  W. D. Ross,et al.  The Right and the Good , 1930 .

[65]  D. Debus,et al.  ‘Mental Time Travel’: Remembering the Past, Imagining the Future, and the Particularity of Events , 2014 .

[66]  I. Kant,et al.  The Metaphysics of Morals , 1997, The Ontology of Prejudice.

[67]  J. Annas Being Virtuous and Doing the Right Thing , 2004 .

[68]  A. Cureton A Contractualist Reading of Kant's Proof of the Formula of Humanity , 2013, Kantian Review.

[69]  Marcus Arvan Unifying the Categorical Imperative , 2012 .

[70]  S. Wolf Morality and partiality , 1992 .

[71]  M. Ridge Fairness and Non-Compliance , 2010 .

[72]  Katrin Flikschuh Kant’s kingdom of ends: metaphysical, not political , 2009 .