From social interaction to ethical AI: a developmental roadmap

AI and robot ethics have recently gained a lot of attention because adaptive machines are increasingly involved in ethically sensitive scenarios and cause incidents of public outcry. Much of the debate has been focused on achieving highest moral standards in handling ethical dilemmas on which not even humans can agree, which indicates that the wrong questions are being asked. We suggest to address this ethics debate strictly through the lens of what behavior seems socially acceptable, rather than idealistically ethical. Learning such behavior puts the debate into the very heart of developmental robotics. This paper poses a roadmap of computational and experimental questions to address the development of socially acceptable machines. We emphasize the need for social reward mechanisms and learning architectures that integrate these while reaching beyond limitations of plain reinforcement-learning agents. We suggest to use the metaphor of “needs” to bridge rewards and higher level abstractions such as goals for both communication and action generation in a social context. We then suggest a series of experimental questions and possible platforms and paradigms to guide future research in the area.

[1]  David Matsumoto,et al.  Cultural Similarities and Differences in Emblematic Gestures , 2012, Journal of Nonverbal Behavior.

[2]  Luc Steels,et al.  The synthetic modeling of language origins , 1997 .

[3]  N. Emery,et al.  The eyes have it: the neuroethology, function and evolution of social gaze , 2000, Neuroscience & Biobehavioral Reviews.

[4]  Yukie Nagai,et al.  Does Disturbance Discourage People from Communicating with a Robot? , 2007, RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication.

[5]  Andrea Lockerd Thomaz,et al.  Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance , 2006, AAAI.

[6]  Katrin S. Lohan A model of contingency detection to spot tutoring behavior and respond to ostensive cues in human-robot-interaction , 2011 .

[7]  John Deigh,et al.  An Introduction to Ethics , 2010 .

[8]  Jochen J. Steil,et al.  Learning the rules of a game: Neural conditioning in human-robot interaction with delayed rewards , 2013, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[9]  Noah J. Goldstein,et al.  The Constructive, Destructive, and Reconstructive Power of Social Norms , 2007, Psychological science.

[10]  Ortwin Renn,et al.  The Social Amplification of Risk: A Conceptual Framework , 1988 .

[11]  D. Winterfeldt,et al.  Beyond acceptable risk: On the social acceptability of technologies , 1982 .

[12]  Katharina J. Rohlfing,et al.  Attention via Synchrony: Making Use of Multimodal Cues in Social Learning , 2009, IEEE Transactions on Autonomous Mental Development.

[13]  Sotaro Kita,et al.  Nodding, aizuchi, and final particles in Japanese conversation: How conversation reflects the ideology of communication and social relationships , 2007 .

[14]  R. G. Fontaine,et al.  Peer Rejection and Social Information-Processing Factors in the Development of Aggressive Behavior Problems in Children , 2003, Child development.

[15]  Matthias Rolf,et al.  What If: Robots Create Novel Goals? Ethics Based on Social Value Systems , 2016, EDIA@ECAI.

[16]  Sandip Sen,et al.  Emergence of Norms through Social Learning , 2007, IJCAI.

[17]  John Yearwood,et al.  On the Limitations of Scalarisation for Multi-objective Reinforcement Learning of Pareto Fronts , 2008, Australasian Conference on Artificial Intelligence.

[18]  F. Fagnani,et al.  Risk Perception and Social Acceptability of Technologies: The French Case , 1989 .

[19]  G. Fernández,et al.  Reinforcement Learning Signal Predicts Social Conformity , 2009, Neuron.

[20]  Pierre-Yves Oudeyer,et al.  Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[21]  Stefan Wermter,et al.  The effects on adaptive behaviour of negatively valenced signals in reinforcement learning , 2017, 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[22]  Minoru Asada,et al.  Cognitive developmental robotics as a new paradigm for the design of humanoid robots , 2001, Robotics Auton. Syst..

[23]  Yukie Nagai,et al.  Yet another gaze detector: An embodied calibration free system for the iCub robot , 2015, 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids).

[24]  Minoru Asada,et al.  Autonomous development of goals: From generic rewards to goal and self detection , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[25]  Brian R. Duffy,et al.  Fundamental Issues in Social Robotics , 2006 .

[26]  Yu Zhao,et al.  Self-calibrating smooth pursuit through active efficient coding , 2015, Robotics Auton. Syst..

[27]  C. Allen,et al.  Moral Machines: Teaching Robots Right from Wrong , 2008 .

[28]  Daijin Kim,et al.  Robust Real-Time Face Detection Using Face Certainty Map , 2007, ICB.

[29]  Giulio Sandini,et al.  Prospection in Cognition: The Case for Joint Episodic-Procedural Memory in Cognitive Robotics , 2015, Front. Robot. AI.

[30]  John Mason,et al.  Robust voice activity detection using cepstral features , 1993, Proceedings of TENCON '93. IEEE Region 10 International Conference on Computers, Communications and Automation.

[31]  Giulio Sandini,et al.  The iCub humanoid robot: an open platform for research in embodied cognition , 2008, PerMIS.

[32]  Minoru Asada,et al.  What are goals? And if so, how many? , 2015, 2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[33]  David L. Roberts,et al.  Learning something from nothing: Leveraging implicit human feedback strategies , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[34]  Bryan Casey,et al.  Amoral Machines, Or: How Roboticists Can Learn to Stop Worrying and Love the Law , 2017 .

[35]  M. Hauser,et al.  The tuning of human neonates' preference for speech. , 2010, Child development.

[36]  Sandip Sen,et al.  Proceedings of the fifth international conference on Autonomous agents , 2001 .

[37]  Michael Fisher,et al.  Formal verification of ethical choices in autonomous systems , 2016, Robotics Auton. Syst..

[38]  Giulio Sandini,et al.  Learning about objects through action - initial steps towards artificial cognition , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[39]  Jochen J. Steil,et al.  Rare Neural Correlations Implement Robotic Conditioning with Delayed Rewards and Disturbances , 2013, Front. Neurorobot..

[40]  Takayuki Kanda,et al.  > Replace This Line with Your Paper Identification Number (double-click Here to Edit) < 2 , 2022 .

[41]  이진성 method of breeding robot pet using on-line and off-line systems simulaneously , 2001 .

[42]  Stefan Elfwing,et al.  Parallel reward and punishment control in humans and robots: Safe reinforcement learning using the MaxPain algorithm , 2017, 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[43]  Michael Spranger,et al.  The evolution of grounded spatial language , 2016 .

[44]  Kate M. Johnson,et al.  Cultural differences in moral judgment and behavior, across and within societies. , 2016, Current opinion in psychology.

[45]  Peter Stone,et al.  A social reinforcement learning agent , 2001, AGENTS '01.

[46]  Minoru Asada,et al.  Emergence of mirror neuron system: Immature vision leads to self-other correspondence , 2011, 2011 IEEE International Conference on Development and Learning (ICDL).

[47]  Joseph L. Austerweil,et al.  Networks of Social and Moral Norms in Human and Robot Agents , 2017 .

[48]  Mark H. Johnson Subcortical face processing , 2005, Nature Reviews Neuroscience.

[49]  Pierre Blazevic,et al.  Mechatronic design of NAO humanoid , 2009, 2009 IEEE International Conference on Robotics and Automation.

[50]  Marek P. Michalowski,et al.  Keepon , 2009, Int. J. Soc. Robotics.

[51]  Giulio Sandini,et al.  Developmental robotics: a survey , 2003, Connect. Sci..

[52]  Jochen J. Steil,et al.  Goal Babbling: a New Concept for Early Sensorimotor Exploration , 2012 .

[53]  Minoru Asada,et al.  Where do goals come from? A Generic Approach to Autonomous Goal-System Development , 2014, ArXiv.

[54]  M. Asada Towards Artificial Empathy based on Affective Developmental Robotics , 2014 .

[55]  Chrystopher L. Nehaniv,et al.  An empirical framework for human-robot proxemics , 2009 .

[56]  Nick Bostrom,et al.  Superintelligence: Paths, Dangers, Strategies , 2014 .