POMDP-based Planning for Visual Processing Management on a Robot

Recent progress in sensor technology [10, 24], and the use of state of the art algorithms to process the input from a variety of sensors, has resulted in the deployment of mobile robots in several specific applications [2, 17, 22]. A key requirement for the widespread deployment of mobile robots is the ability to autonomously tailor the sensory processing to the task at hand. Our work represents a significant effort towards such general-purpose processing of visual input. We pose visual processing management as an instance of probabilistic sequential decision making, and specifically as a Partially Observable Markov Decision Process (POMDP). Our prior work introduced a hierarchical POMDP decomposition that enables a robot to plan a sequence of visual operators that reliably and efficiently analyze the state of the world represented by salient regions-of-interest (ROIs) in input images [20]. Here, we significantly enhance the capabilities of the existing system by: (a) extending our POMDP framework to autonomously adapt to a change in state space dimensions, thereby enabling the robot to effectively process partially overlapping objects in the image; and (b) enabling the robot to autonomously trade-off planning speed and plan quality, by theoretically and empirically evaluating the estimation errors involved in policy caching. All algorithms are implemented and tested on a physical robot platform. We show that the hierarchical planner performs significantly better than a modern planner that has been applied successfully to human-robot interaction domains [1].

[1]  Horst Bunke,et al.  Vision planner for an intelligent multisensory vision system , 1994, Defense, Security, and Sensing.

[2]  Sabine Moisan,et al.  Use of a real-time perception program supervisor in a driving scenario , 1994, Proceedings of the Intelligent Vehicles '94 Symposium.

[3]  Trevor Darrell Reinforcement Learning of Active Recognition Behaviors , 1997, NIPS 1997.

[4]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[5]  Marinette Revenu,et al.  Borg: A Knowledge-Based System for Automatic Generation of Image Processing Programs , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Sabine Moisan,et al.  What can program supervision do for program reuse? , 2000, IEE Proc. Softw..

[7]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[8]  Joelle Pineau,et al.  High-level robot behavior control using POMDPs , 2002 .

[9]  Robin R. Murphy,et al.  Human-robot interactions during the robot-assisted urban search and rescue response at the World Trade Center , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[10]  Sabine Moisan Program Supervision : Yakl and Pegase+ Reference and User Manual , 2003 .

[11]  Eric A. Hansen,et al.  Synthesis of Hierarchical Finite-State Controllers for POMDPs , 2003, ICAPS.

[12]  Fahiem Bacchus,et al.  Extending the Knowledge-Based Approach to Planning with Incomplete Information and Sensing , 2004, ICAPS.

[13]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[14]  Reid G. Simmons,et al.  Point-Based POMDP Algorithms: Improved Analysis and Implementation , 2005, UAI.

[15]  Sebastian Thrun,et al.  Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.

[16]  Bernhard Nebel,et al.  Continual planning and acting in dynamic multiagent environments , 2006 .

[17]  Sebastian Thrun,et al.  A Personal Account of the Development of Stanley, the Robot That Won the DARPA Grand Challenge , 2006, AI Mag..

[18]  Panos E. Trahanias,et al.  Real-time hierarchical POMDPs for autonomous robot navigation , 2007, Robotics Auton. Syst..

[19]  Aaron Sloman,et al.  Towards an Integrated Robot with Multiple Cognitive Functions , 2007, AAAI.

[20]  Richard Dearden,et al.  HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot , 2008, ICAPS.

[21]  Marc Toussaint,et al.  Hierarchical POMDP Controller Optimization by Likelihood Maximization , 2008, UAI.

[22]  Russell Greiner,et al.  Improving an Adaptive Image Interpretation System by Leveraging , 2008 .

[23]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .