The Importance of a Suitable Distance Function in Belief-Space Planning

Many methods for planning under uncertainty operate in the belief space, i.e., the set of probability distributions over states. Although the problem is computationally hard, recent advances have shown that belief-space planning is becoming practical for many medium size problems. Some of the most successful methods utilize sampling and often rely on distances between beliefs to partially guide the search process. This paper deals with the question of what is a suitable distance function for belief space planning, which despite its importance remains unanswered. This work indicates that the rarely used Wasserstein distance (also known as Earth Mover’s Distance (\(\mathtt{EMD} \))) is a more suitable metric than the commonly used \(\mathtt{L1} \) and Kullback–Leibler (\(\mathtt{KL} \)) for belief-space planning. Simulation results on Non-Observable Markov Decision Problems, i.e., the simplest class of belief-space planning, indicate that as the problem becomes more complex, the differences on the effectiveness of different distance functions become quite prominent. In fact, in state spaces with more than 4 dimensions, by just replacing \(\mathtt{L1} \) or \(\mathtt{KL} \) distance with \(\mathtt{EMD} \), the problems become from virtually unsolvable to solvable within a reasonable time frame. Furthermore, preliminary results on Partially Observable Markov Decision Processes indicate that point-based solvers with \(\mathtt{EMD} \) use a smaller number of samples to generate policies with similar qualities, compared to those with \(\mathtt{L1} \) and \(\mathtt{KL} \). This paper also shows that \(\mathtt{EMD} \) carries the Lipschitz continuity of the state’s cost function to Lipschitz continuity of the expected cost of the beliefs. Such a continuity property is often critical for convergence to optimal solutions.

[1]  Richard Dearden,et al.  Navigation Planning in Probabilistic Roadmaps with Uncertainty , 2009, ICAPS.

[2]  Reid G. Simmons,et al.  Particle RRT for Path Planning with Uncertainty , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[3]  Nikos A. Vlassis,et al.  Perseus: Randomized Point-based Value Iteration for POMDPs , 2005, J. Artif. Intell. Res..

[4]  David Hsu,et al.  Motion planning under uncertainty for robotic tasks with long time horizons , 2010, Int. J. Robotics Res..

[5]  John N. Tsitsiklis,et al.  The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..

[6]  Kostas E. Bekris,et al.  Sparse Methods for Efficient Asymptotically Optimal Kinodynamic Planning , 2014, WAFR.

[7]  Kostas E. Bekris,et al.  Efficient sampling-based motion planning with asymptotic near-optimality guarantees for systems with dynamics , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Reid G. Simmons,et al.  Point-Based POMDP Algorithms: Improved Analysis and Implementation , 2005, UAI.

[9]  David Hsu,et al.  Monte Carlo Value Iteration for Continuous-State POMDPs , 2010, WAFR.

[10]  Nicholas M. Patrikalakis,et al.  Analysis of Asymptotically Optimal Sampling-based Motion Planning Algorithms for Lipschitz Continuous Dynamical Systems , 2014, ArXiv.

[11]  P. Abbeel,et al.  LQG-MP: Optimized path planning for robots with motion uncertainty and imperfect state information , 2011 .

[12]  Nicholas Roy,et al.  Global Motion Planning under Uncertain Motion, Sensing, and Environment Map , 2012 .

[13]  Kostas E. Bekris,et al.  Asymptotically optimal sampling-based kinodynamic planning , 2014, Int. J. Robotics Res..

[14]  Siddhartha S. Srinivasa,et al.  Pre- and post-contact policy decomposition for planar contact manipulation under uncertainty , 2014, Int. J. Robotics Res..

[15]  Reid G. Simmons,et al.  Approaches for heuristically biasing RRT growth , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[16]  Nicholas M. Patrikalakis,et al.  Global motion planning under uncertain motion, sensing, and environment map , 2011, Auton. Robots.

[17]  David W. Jacobs,et al.  Approximate earth mover’s distance in linear time , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Nicholas Roy,et al.  Unmanned Aircraft Collision Avoidance Using Continuous-State POMDPs , 2012 .

[19]  N. Roy,et al.  The Belief Roadmap: Efficient Planning in Belief Space by Factoring the Covariance , 2009, Int. J. Robotics Res..

[20]  Joel W. Burdick,et al.  Interactive non-prehensile manipulation for grasping via POMDPs , 2013, 2013 IEEE International Conference on Robotics and Automation.

[21]  Nicholas Roy,et al.  Rapidly-exploring Random Belief Trees for motion planning under uncertainty , 2011, 2011 IEEE International Conference on Robotics and Automation.

[22]  Joelle Pineau,et al.  Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.

[23]  Emilio Frazzoli,et al.  Sampling-based algorithms for optimal motion planning , 2011, Int. J. Robotics Res..

[24]  Pieter Abbeel,et al.  LQG-MP: Optimized path planning for robots with motion uncertainty and imperfect state information , 2010, Int. J. Robotics Res..

[25]  Sebastian Thrun,et al.  Monte Carlo POMDPs , 1999, NIPS.

[26]  Dudley,et al.  Real Analysis and Probability: Measurability: Borel Isomorphism and Analytic Sets , 2002 .

[27]  Pieter Abbeel,et al.  LQG-Based Planning, Sensing, and Control of Steerable Needles , 2010, WAFR.

[28]  Siddhartha S. Srinivasa,et al.  Pre- and post-contact policy decomposition for planar contact manipulation under uncertainty , 2014, Int. J. Robotics Res..

[29]  Michael Werman,et al.  Fast and robust Earth Mover's Distances , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[30]  Nicholas M. Patrikalakis,et al.  Point-Based Policy Transformation: Adapting Policy to Changing POMDP Models , 2012, WAFR.

[31]  David Hsu,et al.  SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces , 2008, Robotics: Science and Systems.

[32]  Simon Parsons,et al.  Principles of Robot Motion: Theory, Algorithms and Implementations by Howie Choset, Kevin M. Lynch, Seth Hutchinson, George Kantor, Wolfram Burgard, Lydia E. Kavraki and Sebastian Thrun, 603 pp., $60.00, ISBN 0-262-033275 , 2007, The Knowledge Engineering Review.

[33]  Leslie Pack Kaelbling,et al.  Non-Gaussian belief space planning: Correctness and complexity , 2012, 2012 IEEE International Conference on Robotics and Automation.