Towards Human–Robot Teams: Model-Based Analysis of Human Decision Making in Two-Alternative Choice Tasks With Social Feedback

With a principled methodology for systematic design of human-robot decision-making teams as a motivating goal, we seek an analytic, model-based description of the influence of team and network design parameters on decision-making performance. Given that there are few reliably predictive models of human decision making, we consider the relatively well-understood two-alternative choice tasks from cognitive psychology, where individuals make sequential decisions with limited information, and we study a stochastic decision-making model, which has been successfully fitted to human behavioral and neural data for a range of such tasks. We use an extension of the model, fitted to experimental data from groups of humans performing the same task simultaneously and receiving feedback on the choices of others in the group. First, we show how the task and model can be regarded as a Markov process. Then, we derive analytically the steady-state probability distributions for decisions and performance as a function of model and design parameters such as the strength and path of the social feedback. Finally, we discuss application to human-robot team and network design and next steps with a multirobot testbed.

[1]  Kristi A. Morgansen,et al.  Modeling and analysis of dynamic decision making in sequential two-choice tasks , 2008, 2008 47th IEEE Conference on Decision and Control.

[2]  Philip Holmes,et al.  A Decision Task in a Social Context: Human Experiments, Models, and Analyses of Behavioral Data , 2012, Proceedings of the IEEE.

[3]  E. Seneta Non-negative Matrices and Markov Chains , 2008 .

[4]  Alexei Makarenko,et al.  Human-robot communication for collaborative decision making - A probabilistic approach , 2010, Robotics Auton. Syst..

[5]  Jean Scholtz,et al.  The Peer-to-Peer Human-Robot Interaction Project , 2005 .

[6]  D. W. Hands The Matching Law: Papers In Psychology And Economics , 1999 .

[7]  Jonathan D. Cohen,et al.  The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks. , 2006, Psychological review.

[8]  W. Marsden I and J , 2012 .

[9]  Donald D. Dudenhoeffer,et al.  Mixed-initiative control for remote characterization of hazardous environments , 2003, 36th Annual Hawaii International Conference on System Sciences, 2003. Proceedings of the.

[10]  Brian D. O. Anderson,et al.  Reaching a Consensus in a Dynamically Changing Environment: Convergence Rates, Measurement Delays, and Asynchronous Events , 2008, SIAM J. Control. Optim..

[11]  Naomi Ehrich Leonard,et al.  Integrating human and robot decision-making dynamics with feedback: Models and convergence analysis , 2008, 2008 47th IEEE Conference on Decision and Control.

[12]  Naomi Ehrich Leonard,et al.  Convergence in human decision-making dynamics , 2010, Syst. Control. Lett..

[13]  B. Øksendal Stochastic differential equations : an introduction with applications , 1987 .

[14]  Roger Ratcliff,et al.  A Theory of Memory Retrieval. , 1978 .

[15]  P. Montague,et al.  Neural Economics and the Biological Substrates of Valuation , 2002, Neuron.

[16]  R. Ratcliff,et al.  Connectionist and diffusion models of reaction time. , 1999, Psychological review.

[17]  Ronald L. Boring,et al.  Shared understanding for collaborative control , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[18]  Samuel M. McClure,et al.  Short-term memory traces for action bias in human reinforcement learning , 2007, Brain Research.

[19]  Philip Holmes,et al.  A simple decision task in a social context: Experiments, a model, and preliminary analyses of behavioral data , 2008, 2008 47th IEEE Conference on Decision and Control.

[20]  R. Herrnstein Experiments on Stable Suboptimality in Individual Behavior , 1991 .

[21]  Naomi Ehrich Leonard,et al.  Collective Motion, Sensor Networks, and Ocean Sampling , 2007, Proceedings of the IEEE.

[22]  P. Montague,et al.  A Computational Role for Dopamine Delivery in Human Decision-Making , 1998, Journal of Cognitive Neuroscience.

[23]  J. Andel Sequential Analysis , 2022, The SAGE Encyclopedia of Research Design.

[24]  J. Wolfowitz,et al.  Optimum Character of the Sequential Probability Ratio Test , 1948 .

[25]  Naomi Ehrich Leonard,et al.  The role of social feedback in steady-state performance of human decision making for two-alternative choice tasks , 2010, 49th IEEE Conference on Decision and Control (CDC).

[26]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[27]  Jonathan D. Cohen,et al.  Explicit melioration by a neural diffusion model , 2009, Brain Research.

[28]  Kenneth Dixon,et al.  Introduction to Stochastic Modeling , 2011 .

[29]  Philip L. Smith,et al.  Psychology and neurobiology of simple decisions , 2004, Trends in Neurosciences.

[30]  Naomi Ehrich Leonard,et al.  Steady-state distributions for human decisions in two-alternative choice tasks , 2010, Proceedings of the 2010 American Control Conference.

[31]  R. Herrnstein Rational Choice Theory Necessary but Not Sufficient , 1990 .

[32]  Michael R. Frey,et al.  An Introduction to Stochastic Modeling (2nd Ed.) , 1994 .

[33]  Robin R. Murphy,et al.  Human-robot interaction in rescue robotics , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).