论文信息 - A POMDP Model of Eye-Hand Coordination

A POMDP Model of Eye-Hand Coordination

This paper presents a generative model of eye-hand coordination. We use numerical optimization to solve for the joint behavior of an eye and two hands, deriving a predicted motion pattern from first principles, without imposing heuristics. We model the planar scene as a POMDP with 17 continuous state dimensions. Belief-space optimization is facilitated by using a nominal-belief heuristic, whereby we assume (during planning) that the maximum likelihood observation is always obtained. Since a globally-optimal solution for such a high-dimensional domain is computationally intractable, we employ local optimization in the belief domain. By solving for a locally-optimal plan through belief space, we generate a motion pattern of mutual coordination between hands and eye: the eye's saccades disambiguate the scene in a task-relevant manner, and the hands' motions anticipate the eye's saccades. Finally, the model is validated through a behavioral experiment, in which human subjects perform the same eye-hand coordination task. We show how simulation is congruent with the experimental results.

[1] Edward J. Sondik,et al. The optimal control of par-tially observable Markov processes , 1971 .

[2] D. Whitteridge. Movements of the eyes R. H. S. Carpenter, Pion Ltd, London (1977), 420 pp., $27.00 , 1979, Neuroscience.

[3] L. Bour,et al. The Double Magnetic Induction Method for Measuring Eye Movement - Results in Monkey and Man , 1984, IEEE Transactions on Biomedical Engineering.

[4] S. McKee,et al. The detection of motion in the peripheral visual field , 1984, Vision Research.

[5] S Ullman,et al. Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[6] Robert F. Stengel,et al. Optimal Control and Estimation , 1994 .

[7] Denis Fize,et al. Speed of processing in the human visual system , 1996, Nature.

[8] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[9] Michael I. Jordan,et al. Optimal feedback control as a theory of motor coordination , 2002, Nature Neuroscience.

[10] Pieter Abbeel,et al. Exploration and apprenticeship learning in reinforcement learning , 2005, ICML.

[11] Pascal Poupart,et al. Point-Based Value Iteration for Continuous POMDPs , 2006, J. Mach. Learn. Res..

[12] William D. Smart,et al. Receding Horizon Differential Dynamic Programming , 2007, NIPS.

[13] Mary M Hayhoe,et al. Task and context determine where you look. , 2016, Journal of vision.

[14] Konrad Paul Kording,et al. Decision Theory: What "Should" the Nervous System Do? , 2007, Science.

[15] Laurent Itti,et al. Applying computational tools to predict gaze direction in interactive visual environments , 2008, TAP.

[16] Leslie Pack Kaelbling,et al. Continuous-State POMDPs with Hybrid Dynamics , 2008, ISAIM.

[17] Edwin K. P. Chong,et al. A POMDP Framework for Coordinated Guidance of Autonomous UAVs for Multitarget Tracking , 2009, EURASIP J. Adv. Signal Process..

[18] N. Roy,et al. The Belief Roadmap: Efficient Planning in Belief Space by Factoring the Covariance , 2009, Int. J. Robotics Res..

[19] William D. Smart,et al. A Scalable Method for Solving High-Dimensional Continuous POMDPs Using Local Approximation , 2010, UAI.