Belief Change Maximisation for Hydrothermal Vent Hunting Using Occupancy Grids

The problem of where a mobile robot should go to efficiently build a map of its surroundings is frequently addressed using entropy reduction techniques. However, in exploration problems where the goal is to find an object or objects of interest, such techniques can be a useful heuristic but are optimising the wrong quantity. An example of such a problem is an autonomous underwater vehicle (AUV) searching the sea floor for hydrothermal vents. The state of the art in these problems is information lookahead in the action-observation space which is computationally expensive. We present an original belief-maximisation algorithm for this problem, and use a simulation of the AUV problem to show that our method outperforms straightforward entropy reduction and runs much faster than information lookahead while approaching it in terms of performance. We further introduce a heuristic using an orienteering-problem (OP) solver, which improves the performance of both our belief-maximisation algorithm and information lookahead.

[1]  T. Tsiligirides,et al.  Heuristic Methods Applied to Orienteering , 1984 .

[2]  R. Vohra,et al.  The Orienteering Problem , 1987 .

[3]  Alberto Elfes,et al.  Using occupancy grids for mobile robot perception and navigation , 1989, Computer.

[4]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[5]  A. Sokal Monte Carlo methods for the self-avoiding walk , 1994, hep-lat/9509032.

[6]  Hans P. Moravec,et al.  Robot Evidence Grids. , 1996 .

[7]  Bruce L. Golden,et al.  A fast and effective heuristic for the orienteering problem , 1996 .

[8]  Shlomo Zilberstein,et al.  LAO*: A heuristic search algorithm that finds solutions with loops , 2001, Artif. Intell..

[9]  Alexei Makarenko,et al.  Information based adaptive robotic exploration , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  David R. Karger,et al.  Approximation algorithms for orienteering and discounted-reward TSP , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[11]  R. Andrew Russell,et al.  A comparison of reactive robot chemotaxis algorithms , 2003, Robotics Auton. Syst..

[12]  J. Farrell,et al.  Chemical plume tracing experimental results with a REMUS AUV , 2003, Oceans 2003. Celebrating the Past ... Teaming Toward the Future (IEEE Cat. No.03CH37492).

[13]  Rui P. Rocha,et al.  Cooperative Multi-Robot Systems A study of Vision-based 3-D Mapping using Information Theory , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[14]  Wolfram Burgard,et al.  Information Gain-based Exploration Using Rao-Blackwellized Particle Filters , 2005, Robotics: Science and Systems.

[15]  Andreas Krause,et al.  Near-optimal Observation Selection using Submodular Functions , 2007, AAAI.

[16]  Massimo Vergassola,et al.  ‘Infotaxis’ as a strategy for searching without gradients , 2007, Nature.

[17]  Michael V. Jakuba,et al.  Stochastic mapping for chemical plume source localization with application to autonomous hydrothermal vent discovery , 2007 .

[18]  Richard Dearden,et al.  Planning for AUVs: Dealing with a Continuous Partially-Observable Environment , 2007 .

[19]  Richard Dearden,et al.  Information-Lookahead Planning for AUV Mapping , 2009, IJCAI.

[20]  Kian Hsiang Low,et al.  Information-Theoretic Approach to Efficient Adaptive Path Planning for Mobile Robotic Environmental Sensing , 2009, ICAPS.