A Monte Carlo Update for Parametric POMDPs

This paper presents the Parameterised POMDP (PPOMDP) algorithm: a method for planning in the space of continuous parameterised functions. The novel contribution is an approach to transitioning parameterised beliefs using Monte Carlo methods. By re-using prediction and observation calculations, the transition function can be computed efficiently. An analysis of scalability suggests that the approach is likely to scale to physically larger environments than algorithms which rely on an underlying discretisation. Experimental results in a simulated robot navigation problem show that the algorithm compares favourably with existing approaches.

[1]  Hugh F. Durrant-Whyte,et al.  Simultaneous localization and mapping: part I , 2006, IEEE Robotics & Automation Magazine.

[2]  Sebastian Thrun,et al.  Monte Carlo POMDPs , 1999, NIPS.

[3]  Illah R. Nourbakhsh,et al.  DERVISH - An Office-Navigating Robot , 1995, AI Mag..

[4]  Hugh Durrant-Whyte,et al.  Simultaneous localization and mapping (SLAM): part II , 2006 .

[5]  Sebastian Thrun,et al.  Probabilistic robotics , 2002, CACM.

[6]  Alexei Makarenko,et al.  Parametric POMDPs for planning in continuous state spaces , 2006, Robotics Auton. Syst..

[7]  Milos Hauskrecht,et al.  Value-Function Approximations for Partially Observable Markov Decision Processes , 2000, J. Artif. Intell. Res..

[8]  Douglas Aberdeen,et al.  Scalable Internal-State Policy-Gradient Methods for POMDPs , 2002, ICML.

[9]  Geoffrey J. Gordon Stable Function Approximation in Dynamic Programming , 1995, ICML.

[10]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[11]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[12]  Geoffrey J. Gordon,et al.  Finding Approximate POMDP solutions Through Belief Compression , 2011, J. Artif. Intell. Res..

[13]  Nikos A. Vlassis,et al.  Perseus: Randomized Point-based Value Iteration for POMDPs , 2005, J. Artif. Intell. Res..

[14]  Joelle Pineau,et al.  Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.

[15]  D. Moore Simplicial Mesh Generation with Applications , 1992 .