Detecting contingencies: An infomax approach

The ability to detect social contingencies plays an important role in the social and emotional development of infants. Analyzing this problem from a computational perspective may provide important clues for understanding social development, as well as for the synthesis of social behavior in robots. In this paper, we show that the turn-taking behaviors observed in infants during contingency detection situations are tuned to optimally gather information as to whether a person is responsive to them. We show that simple reinforcement learning mechanisms can explain how infants acquire these efficient contingency detection schemas. The key is to use the reduction of uncertainty (information gain) as a reward signal. The result is an interesting form of learning in which the learner rewards itself for conducting actions that help reduce its own sense of uncertainty. This paper illustrates the possibilities of an emerging area of computer science and engineering that focuses on the computational understanding of human behavior and on its synthesis in robots. We believe that the theory of stochastic optimal control will play a key role providing a formal mathematical foundation for this newly emerging discipline.

[1]  Javier R. Movellan,et al.  Optimal scanning for faster object detection , 2009, CVPR.

[2]  R. Wise Dopamine, learning and motivation , 2004, Nature Reviews Neuroscience.

[3]  E. Thoman,et al.  Origins of the infant's social responsiveness , 1979 .

[4]  Phil Howlett,et al.  Stochastic Optimal Control of a Solar Car , 2001 .

[5]  P. Rochat Early Social Cognition : Understanding Others in the First Months of Life , 1999 .

[6]  J. Movellan,et al.  The development of gaze following as a Bayesian systems identification problem , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[7]  Lorraine E. Bahrick,et al.  Detection of Intermodal Proprioceptive-Visual Contingency as a , Potential Basis of Self-Perception in Infancy , 1985 .

[8]  Stan Van Hooft Abstract , 2001 .

[9]  Javier R. Movellan,et al.  Infomax Control of Eye Movements , 2010, IEEE Transactions on Autonomous Mental Development.

[10]  N. Smelser,et al.  International Encyclopedia of the Social and Behavioral Sciences , 2001 .

[11]  Refractor Vision , 2000, The Lancet.

[12]  M. K rn,et al.  Stochastic Optimal Control , 1988 .

[13]  S. Carey,et al.  Whose gaze will infants follow? The elicitation of gaze-following in 12-month-olds , 1998 .

[14]  Ethan S. Bromberg-Martin,et al.  Midbrain Dopamine Neurons Signal Preference for Advance Information about Upcoming Rewards , 2009, Neuron.

[15]  J. Watson Smiling, cooing, and "the game." , 1972 .

[16]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[17]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[18]  David Elkind,et al.  Learning: An Introduction , 1968 .

[19]  A. Inkeles,et al.  International Encyclopedia of the Social Sciences. , 1968 .

[20]  Jonathan D. Cohen,et al.  Computational roles for dopamine in behavioural control , 2004, Nature.

[21]  N. Fox,et al.  Social perception in infants , 1985 .

[22]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[23]  N.J. Butko,et al.  I-POMDP: An infomax model of eye movement , 2008, 2008 7th IEEE International Conference on Development and Learning.

[24]  Javier R. Movellan,et al.  Perception of Directional Attention , 2001 .

[25]  Javier R. Movellan,et al.  Active Inference in Concept Learning , 2000, NIPS.