论文信息 - Eye Movements for Reward Maximization

Eye Movements for Reward Maximization

Recent eye tracking studies in natural tasks suggest that there is a tight link between eye movements and goal directed motor actions. However, most existing models of human eye movements provide a bottom up account that relates visual attention to attributes of the visual scene. The purpose of this paper is to introduce a new model of human eye movements that directly ties eye movements to the ongoing demands of behavior. The basic idea is that eye movements serve to reduce uncertainty about environmental variables that are task relevant. A value is assigned to an eye movement by estimating the expected cost of the uncertainty that will result if the movement is not made. If there are several candidate eye movements, the one with the highest expected value is chosen. The model is illustrated using a humanoid graphic figure that navigates on a sidewalk in a virtual urban environment. Simulations show our protocol is superior to a simple round robin scheduling mechanism.

Dana H. Ballard | Nathan Sprague | D. Ballard | N. Sprague

[1] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .

[2] David N. Lee,et al. Where we look when we steer , 1994, Nature.

[3] Richard S. Sutton,et al. Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[4] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[5] Mark Humphreys,et al. Action selection methods using reinforcement learning , 1997 .

[6] Jonas Karlsson,et al. Learning to Solve Multiple Goals , 1997 .

[7] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[8] A. Cassandra,et al. Exact and approximate algorithms for partially observable markov decision processes , 1998 .

[9] M. Hayhoe,et al. The coordination of eye, head, and hand movements in a natural task , 2001, Experimental Brain Research.

[10] C. Koch,et al. Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[11] W. Schultz,et al. Dopamine responses comply with basic assumptions of formal learning theory , 2001, Nature.

[12] T. Başar,et al. A New Approach to Linear Filtering and Prediction Problems , 2001 .

[13] Klaus H. Strobl,et al. Task-Oriented and Situation-Dependent Gaze Control for Vision Guided Humanoid Walking , 2003 .

[14] Dana H. Ballard,et al. Multiple-Goal Reinforcement Learning with Modular Sarsa(0) , 2003, IJCAI.

[15] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.