Active sensing for motion planning in uncertain environments via mutual information policies

This paper addresses path planning with real-time reaction to environmental uncertainty. The environment is represented as a robotic roadmap, or graph, and is uncertain in that the edges of the graph are unknown to the robot a priori. Instead, the robot’s prior information consists of a distribution over candidate edge sets, modeling the likelihood of certain obstacles in the environment. The robot can locally sense the environment, and at a vertex, can determine the presence or absence of some subset of edges. Within this model, the reactive planning problem provides the robot with a start location and a goal location and asks it to compute a policy that minimizes the expected travel and observation cost. In contrast to computing paths that maximize the probability of success, we focus on complete policies (i.e., policies that are guaranteed to navigate the robot to the goal or determine no such path exists). We prove that the problem is NP-hard and provide a suboptimal, but computationally efficient solution. This solution, based on mutual information, returns a complete policy and a bound on the gap between the policy’s expected cost and the optimal. We test the performance of the policy and the lower bound against that of the optimal policy and explore the effects of errors in the robot’s prior information on performance. Simulations are run on a flexible factory scenario to demonstrate the scalability of the proposed approach. Finally, we present a method to extend this solution to robots with faulty sensors.

[1]  Andreas Krause,et al.  Near Optimal Bayesian Active Learning for Decision Making , 2014, AISTATS.

[2]  Vijay Kumar,et al.  Approximate representations for multi-robot control policies that maximize mutual information , 2014, Robotics: Science and Systems.

[3]  Mihalis Yannakakis,et al.  Shortest Paths Without a Map , 1989, Theor. Comput. Sci..

[4]  Ann Melissa Campbell,et al.  Shortest path problem with arc failure scenarios , 2017, EURO J. Transp. Logist..

[5]  David Hsu,et al.  Adaptive Informative Path Planning in Metric Spaces , 2014, WAFR.

[6]  Giovanni Andreatta,et al.  Stochastic shortest paths with recourse , 1988, Networks.

[7]  Geoffrey A. Hollinger,et al.  Sampling-based robotic information gathering algorithms , 2014, Int. J. Robotics Res..

[8]  Yuri M. Suhov,et al.  Basic inequalities for weighted entropies , 2015, ArXiv.

[9]  Steven M. LaValle,et al.  Planning algorithms , 2006 .

[10]  John N. Tsitsiklis,et al.  Stochastic shortest path problems with recourse , 1996, Networks.

[11]  Gaurav S. Sukhatme,et al.  Branch and bound for informative path planning , 2012, 2012 IEEE International Conference on Robotics and Automation.

[12]  J. Tsitsiklis,et al.  Stochastic shortest path problems with recourse , 1996 .

[13]  Solomon Eyal Shimony,et al.  Canadian traveler problem with remote sensing , 2009, IJCAI 2009.

[14]  Vijay Kumar,et al.  Active Magnetic Anomaly Detection Using Multiple Micro Aerial Vehicles , 2016, IEEE Robotics and Automation Letters.

[15]  HsuDavid,et al.  Integrated perception and planning in the continuous space , 2014 .

[16]  Sven Koenig,et al.  Fast replanning for navigation in unknown terrain , 2005, IEEE Transactions on Robotics.

[17]  Geoffrey A. Hollinger,et al.  Active planning for underwater inspection and the benefit of adaptivity , 2012, Int. J. Robotics Res..

[18]  Srinivasan Seshan,et al.  Active sensing data collection with autonomous mobile robots , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[19]  Mac Schwager,et al.  Correlated Orienteering Problem and its application to informative path planning for persistent monitoring tasks , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20]  John N. Tsitsiklis,et al.  The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..

[21]  Mukesh K. Mohania,et al.  Decision trees for entity identification: approximation algorithms and hardness results , 2007, PODS '07.

[22]  Vijay Kumar,et al.  A decentralized control policy for adaptive information gathering in hazardous environments , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[23]  David Hsu,et al.  Integrated Perception and Planning in the Continuous Space: A POMDP Approach , 2013, Robotics: Science and Systems.

[24]  Vijay Kumar,et al.  Persistent Homology for Path Planning in Uncertain Environments , 2015, IEEE Transactions on Robotics.

[25]  Min Chen,et al.  POMDP-lite for robust robot planning under uncertainty , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[26]  David Furcy,et al.  Lifelong Planning A , 2004, Artif. Intell..

[27]  Stephen L. Smith,et al.  Reactive Motion Planning in Uncertain Environments via Mutual Information Policies , 2016, WAFR.

[28]  Jan Faigl,et al.  Multi-robot path planning for budgeted active perception with self-organising maps , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[29]  Ron Alterovitz,et al.  Motion planning under uncertainty using iterative local optimization in belief space , 2012, Int. J. Robotics Res..

[30]  Daniela Rus,et al.  On mutual information-based control of range sensing robots for mapping applications , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31]  Leslie Pack Kaelbling,et al.  Integrated task and motion planning in belief space , 2013, Int. J. Robotics Res..