HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot

Flexible general purpose robots need to tailor their visual processing to their task, on the fly. We propose a new approach to this within a planning framework, where the goal is to plan a sequence of visual operators to apply to the regions of interest (ROIs) in a scene. We pose the visual processing problem as a Partially Observable Markov Decision Process (POMDP). This requires probabilistic models of operator effects to quantitatively capture the unreliability of the processing actions, and thus reason precisely about trade-offs between plan execution time and plan reliability. Since planning in practical sized POMDPs is intractable we show how to ameliorate this intractability somewhat for our domain by defining a hierarchical POMDP. We compare the hierarchical POMDP approach with a Continual Planning (CP) approach. On a real robot visual domain, we show empirically that all the planning methods outperform naive application of all visual operators. The key result is that the POMDP methods produce more robust plans than either naive visual processing or the CP approach. In summary, we believe that visual processing problems represent a challenging and worthwhile domain for planning techniques, and that our hierarchical POMDP based approach to them opens up a promising new line of research.

[1]  Bernhard Nebel,et al.  The FF Planning System: Fast Plan Generation Through Heuristic Search , 2011, J. Artif. Intell. Res..

[2]  Joelle Pineau,et al.  High-level robot behavior control using POMDPs , 2002 .

[3]  Reid G. Simmons,et al.  Point-Based POMDP Algorithms: Improved Analysis and Implementation , 2005, UAI.

[4]  Fahiem Bacchus,et al.  Extending the Knowledge-Based Approach to Planning with Incomplete Information and Sensing , 2004, ICAPS.

[5]  Sabine Moisan,et al.  What can program supervision do for program reuse? , 2000, IEE Proc. Softw..

[6]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[7]  Lee Dn,et al.  The optic flow field: the foundation of vision. , 1980 .

[8]  Craig A. Knoblock,et al.  PDDL-the planning domain definition language , 1998 .

[9]  Sabine Moisan,et al.  Use of a real-time perception program supervisor in a driving scenario , 1994, Proceedings of the Intelligent Vehicles '94 Symposium.

[10]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[11]  Marc Toussaint,et al.  Hierarchical POMDP Controller Optimization by Likelihood Maximization , 2008, UAI.

[12]  Aaron Sloman,et al.  Towards an Integrated Robot with Multiple Cognitive Functions , 2007, AAAI.

[13]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[14]  Russell Greiner,et al.  Improving an Adaptive Image Interpretation System by Leveraging , 2008 .

[15]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[16]  Eric A. Hansen,et al.  Synthesis of Hierarchical Finite-State Controllers for POMDPs , 2003, ICAPS.

[17]  Thomas G. Dietterich The MAXQ Method for Hierarchical Reinforcement Learning , 1998, ICML.

[18]  Ian Horswill,et al.  Polly: A Vision-Based Artificial Agent , 1993, AAAI.

[19]  D. N. Lee The optic flow field: the foundation of vision. , 1980, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[20]  Tara A. Estlin,et al.  Automated software module reconfiguration through the use of artificial intelligence planning techniques , 2000, IEE Proc. Softw..

[21]  M. Hayhoe,et al.  In what ways do eye movements contribute to everyday activities? , 2001, Vision Research.

[22]  Bernhard Nebel,et al.  Continual planning and acting in dynamic multiagent environments , 2006 .

[23]  Marinette Revenu,et al.  Borg: A Knowledge-Based System for Automatic Generation of Image Processing Programs , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Sabine Moisan Program Supervision : Yakl and Pegase+ Reference and User Manual , 2003 .