论文信息 - POMDP-based Planning for Visual Processing Management on a Robot

POMDP-based Planning for Visual Processing Management on a Robot

Recent progress in sensor technology [10, 24], and the use of state of the art algorithms to process the input from a variety of sensors, has resulted in the deployment of mobile robots in several specific applications [2, 17, 22]. A key requirement for the widespread deployment of mobile robots is the ability to autonomously tailor the sensory processing to the task at hand. Our work represents a significant effort towards such general-purpose processing of visual input. We pose visual processing management as an instance of probabilistic sequential decision making, and specifically as a Partially Observable Markov Decision Process (POMDP). Our prior work introduced a hierarchical POMDP decomposition that enables a robot to plan a sequence of visual operators that reliably and efficiently analyze the state of the world represented by salient regions-of-interest (ROIs) in input images [20]. Here, we significantly enhance the capabilities of the existing system by: (a) extending our POMDP framework to autonomously adapt to a change in state space dimensions, thereby enabling the robot to effectively process partially overlapping objects in the image; and (b) enabling the robot to autonomously trade-off planning speed and plan quality, by theoretically and empirically evaluating the estimation errors involved in policy caching. All algorithms are implemented and tested on a physical robot platform. We show that the hierarchical planner performs significantly better than a modern planner that has been applied successfully to human-robot interaction domains [1].

R. Dearden | M. Sridharan | J. Wyatt

[1] Horst Bunke,et al. Vision planner for an intelligent multisensory vision system , 1994, Defense, Security, and Sensing.

[2] Sabine Moisan,et al. Use of a real-time perception program supervisor in a driving scenario , 1994, Proceedings of the Intelligent Vehicles '94 Symposium.

[3] Trevor Darrell. Reinforcement Learning of Active Recognition Behaviors , 1997, NIPS 1997.

[4] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[5] Marinette Revenu,et al. Borg: A Knowledge-Based System for Automatic Generation of Image Processing Programs , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[6] Sabine Moisan,et al. What can program supervision do for program reuse? , 2000, IEE Proc. Softw..

[7] David G. Stork,et al. Pattern classification, 2nd Edition , 2000 .

[8] Joelle Pineau,et al. High-level robot behavior control using POMDPs , 2002 .

[9] Robin R. Murphy,et al. Human-robot interactions during the robot-assisted urban search and rescue response at the World Trade Center , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[10] Sabine Moisan. Program Supervision : Yakl and Pegase+ Reference and User Manual , 2003 .

[11] Eric A. Hansen,et al. Synthesis of Hierarchical Finite-State Controllers for POMDPs , 2003, ICAPS.