论文信息 - Visual Search and Multirobot Collaboration Based on Hierarchical Planning

Visual Search and Multirobot Collaboration Based on Hierarchical Planning

Mobile robots are increasingly being used in the real-world due to the availability of high-fidelity sensors and sophisticated information processing algorithms. A key challenge to the widespread deployment of robots is the ability to accurately sense the environment and collaborate towards a common objective. Probabilistic sequential decision-making methods can be used to address this challenge because they encapsulate the partial observability and non-determinism of robot domains. However, such formulations soon become intractable for domains with complex state spaces that require real-time operation. Our prior work enabled a mobile robot to use hierarchical partially observable Markov decision processes (POMDPs) to automatically tailor visual sensing and information processing to the task at hand (Zhang, Sridharan, & Li 2011). This paper introduces adaptive observation functions and policy re-weighting in a three-layered POMDP hierarchy to enable reliable and efficient visual processing in dynamic domains. In addition, each robot merges its beliefs with those communicated by teammates, to enable a team of robots to collaborate robustly. All algorithms are evaluated in simulated domains and on physical robots tasked with locating target objects in indoor environments.

Mohan Sridharan | Shiqi Zhang | M. Sridharan | Shiqi Zhang

[1] Maria Gini,et al. Communication Strategies in Multi-robot Search and Retrieval: Experiences with MinDART , 2004, DARS.

[2] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[3] Russell Greiner,et al. Improving an Adaptive Image Interpretation System by Leveraging , 2008 .

[4] Sebastian Thrun,et al. Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.

[5] Panos E. Trahanias,et al. Real-time hierarchical POMDPs for autonomous robot navigation , 2007, Robotics Auton. Syst..

[6] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[7] N.J. Butko,et al. I-POMDP: An infomax model of eye movement , 2008, 2008 7th IEEE International Conference on Development and Learning.

[8] Jesse Hoey,et al. Automated handwashing assistance for persons with dementia using video and a partially observable Markov decision process , 2010, Comput. Vis. Image Underst..

[9] Joelle Pineau,et al. Towards robotic assistants in nursing homes: Challenges and results , 2003, Robotics Auton. Syst..

[10] Xiang Li,et al. To look or not to look: A hierarchical representation for visual planning on mobile robots , 2011, 2011 IEEE International Conference on Robotics and Automation.

[11] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[12] Sean Luke,et al. Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[13] Rong Yang,et al. Teamwork and Coordination under Model Uncertainty in DEC-POMDPs , 2010, Interactive Decision Theory and Game Theory.

[14] PanaitLiviu,et al. Cooperative Multi-Agent Learning , 2005 .

[15] Andreas Krause,et al. Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , 2008, J. Mach. Learn. Res..

[16] Richard Dearden,et al. Planning to see: A hierarchical approach to planning visual actions on a robot using POMDPs , 2010, Artif. Intell..

[17] Nils J. Nilsson,et al. Artificial Intelligence , 1974, IFIP Congress.

[18] Joelle Pineau,et al. A Bayesian Method for Learning POMDP Observation Parameters for Robot Interaction Management Systems , 2010 .

[19] Sebastian Thrun,et al. Stanley: The robot that won the DARPA Grand Challenge: Research Articles , 2006 .

[20] Peter Stone,et al. Non-commercial Research and Educational Use including without Limitation Use in Instruction at Your Institution, Sending It to Specific Colleagues That You Know, and Providing a Copy to Your Institution's Administrator. All Other Uses, Reproduction and Distribution, including without Limitation Comm , 2022 .

[21] Olivier Buffet,et al. The factored policy-gradient planner , 2009, Artif. Intell..

[22] Joelle Pineau,et al. Online Planning Algorithms for POMDPs , 2008, J. Artif. Intell. Res..

[23] Mathijs de Weerdt,et al. Introduction to planning in multiagent systems , 2009, Multiagent Grid Syst..

[24] Andrea Vedaldi,et al. Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[25] Hugh F. Durrant-Whyte,et al. A solution to the simultaneous localization and map building (SLAM) problem , 2001, IEEE Trans. Robotics Autom..

[26] Alfred O. Hero,et al. Sensor management using an active sensing approach , 2005, Signal Process..