Robust Mobile Robot Navigation using Partially-Observable Semi-Markov Decision Processes

Mobile robots operating in dynamic ooce environments need robust navigation algorithms capable of dealing with uncertainties arising from sensor and actuator errors. Such algorithms also need to be able to synthesize plans that take into account temporal aspects of their environment, such as crowded corridors, and busy intersections. This paper describes a novel mobile robot navigation architecture based on Partially-Observable Semi-Markov Decision Processes (POSMDP). This model extends previous work on POMDP-based navigation architectures, which assumed a discrete-time temporal model of actions. We show that the POSMDP model allows the robot to explicitly model the transition time of diierent actions in various locations in the environment (e.g. passing through intersections takes more time than going through corridors). The paper describes experimental results of using the POSMDP architecture on a real mobile robot called PAVLOV. The results show that the robot is able to traverse paths of several hundred meters and also plan to avoid crowded corridors.

[1]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[2]  Hans P. Moravec Certainty grids for mobile robots , 1987 .

[3]  Avinash C. Kak,et al.  Fast Vision-guided Mobile Robot Navigation Using Model-based Reasoning And Prediction Of Uncertainties , 1992, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Reid G. Simmons,et al.  Probabilistic Robot Navigation in Partially Observable Environments , 1995, IJCAI.

[5]  Benjamin Kuipers,et al.  A robot exploration and mapping strategy based on a semantic hierarchy of spatial representations , 1991, Robotics Auton. Syst..

[6]  Michael L. Littman,et al.  Algorithms for Sequential Decision Making , 1996 .

[7]  Stuart J. Russell,et al.  Approximating Optimal Policies for Partially Observable Stochastic Domains , 1995, IJCAI.

[8]  Leslie Pack Kaelbling,et al.  Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.

[9]  Hans P. Moravec,et al.  High resolution maps from wide angle sonar , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[10]  Reid G. Simmons,et al.  Passive Distance Learning for Robot Navigation , 1996, ICML.

[11]  Leslie Pack Kaelbling,et al.  Acting under uncertainty: discrete Bayesian models for mobile-robot navigation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[12]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[13]  Wolfram Burgard,et al.  Map learning and high-speed navigation in RHINO , 1998 .