Incorporating Rich Social Interactions Into MDPs

Much of what we do as humans is engage socially with other agents, a skill that robots must also eventually possess. We demonstrate that a rich theory of social interactions originating from microsociology and economics can be formalized by extending a nested MDP where agents reason about arbitrary functions of each other’s hidden rewards. This extended Social MDP allows us to encode the five basic interactions that underlie microsociology: cooperation, conflict, coercion, competition, and exchange. The result is a robotic agent capable of executing social interactions zero-shot in new environments; like humans it can engage socially in novel ways even without a single example of that social interaction. Moreover, the judgments of these Social MDPs align closely with those of humans when considering which social interaction is taking place in an environment. This method both sheds light on the nature of social interactions, by providing concrete mathematical definitions, and brings rich social interactions into a mathematical framework that has proven to be natural for robotics, MDPs.

[1]  Jake K. Aggarwal,et al.  Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[2]  Chris L. Baker,et al.  Action understanding as inverse planning , 2009, Cognition.

[3]  Anca D. Dragan,et al.  Cooperative Inverse Reinforcement Learning , 2016, NIPS.

[4]  Noah D. Goodman,et al.  Theory-based Social Goal Inference , 2008 .

[5]  Ian D. Reid,et al.  Structured Learning of Human Interactions in TV Shows , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  U. Frith,et al.  Do triangles play tricks? Attribution of mental states to animated shapes in normal and abnormal development , 2000 .

[7]  E. Goffman The Presentation of Self in Everyday Life , 1959 .

[8]  Prashant Doshi,et al.  Monte Carlo Sampling Methods for Approximating Interactive POMDPs , 2014, J. Artif. Intell. Res..

[9]  Joshua B. Tenenbaum,et al.  Help or Hinder: Bayesian Models of Social Goal Inference , 2009, NIPS.

[10]  Chris L. Baker,et al.  Modeling Human Plan Recognition Using Bayesian Theory of Mind , 2014 .

[11]  Jordan L. Boyd-Graber,et al.  Opponent Modeling in Deep Reinforcement Learning , 2016, ICML.

[12]  Chelsea Finn,et al.  Learning Latent Representations to Influence Multi-Agent Interaction , 2020, CoRL.

[13]  Prashant Doshi,et al.  Generalized Point Based Value Iteration for Interactive POMDPs , 2008, AAAI.

[14]  E. Goffman On face-work; an analysis of ritual elements in social interaction. , 1955, Psychiatry.

[15]  Noah D. Goodman,et al.  The mentalistic basis of core social cognition: experiments in preverbal infants and a computational model. , 2013, Developmental science.

[16]  Joshua B. Tenenbaum,et al.  Plans or Outcomes: How Do We Attribute Intelligence to Others? , 2021, Cogn. Sci..

[17]  Andrew Wang,et al.  Bayes-Adaptive Interactive POMDPs , 2012, AAAI.

[18]  H. Francis Song,et al.  Machine Theory of Mind , 2018, ICML.

[19]  Andrew Zisserman,et al.  Detecting People Looking at Each Other in Videos , 2014, International Journal of Computer Vision.

[20]  J. Tenenbaum,et al.  Adventures in Flatland: Perceiving Social Interactions Under Physical Dynamics , 2020, CogSci.

[21]  Peter Stone,et al.  Autonomous agents modelling other agents: A comprehensive survey and open problems , 2017, Artif. Intell..

[22]  Sanja Fidler,et al.  Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration , 2020, ICLR.

[23]  Joshua B. Tenenbaum,et al.  Coordinate to cooperate or compete: Abstract goals and joint intentions in social interaction , 2016, CogSci.

[24]  Joshua B. Tenenbaum,et al.  Bayesian Theory of Mind: Modeling Joint Belief-Desire Attribution , 2011, CogSci.

[25]  Joshua B. Tenenbaum,et al.  PHASE: PHysically-grounded Abstract Social Events for Machine Social Perception , 2021, AAAI.

[26]  Chunqiao Tan,et al.  Bargaining Game with Altruistic and Spiteful Preferences , 2020, Group Decision and Negotiation.

[27]  M. Argyle Social interactions. , 1976, Science.

[28]  Andrei Barbu,et al.  Social Interactions as Recursive MDPs , 2021, CoRL.

[29]  D. Levine Modeling Altruism and Spitefulness in Experiments , 1998 .

[30]  F. Heider,et al.  An experimental study of apparent behavior , 1944 .