Physical search problems with probabilistic knowledge

This paper considers the problem of an agent or a team of agents searching for a resource or tangible good in a physical environment, where the resource or good may possibly be obtained at one of several locations. The cost of acquiring the resource or good at a given location is uncertain (a priori), and the agents can observe the true cost only when physically arriving at this location. Sample applications include agents in exploration and patrol missions (e.g., an agent seeking to find the best location to deploy sensing equipment along its path). The uniqueness of these settings is in that the cost of observing a new location is determined by distance from the current one, impacting the consideration for the optimal search order. Although this model captures many real world scenarios, it has not been investigated so far. We analyze three variants of the problem, differing in their objective: minimizing the total expected cost, maximizing the success probability given an initial budget, and minimizing the budget necessary to obtain a given success probability. For each variant, we first introduce and analyze the problem with a single agent, either providing a polynomial solution to the problem or proving it is NP-complete. We also introduce a fully polynomial time approximation scheme algorithm for the minimum budget variant. In the multi-agent case, we analyze two models for managing resources, shared and private budget models. We present polynomial algorithms that work for any fixed number of agents, in the shared or private budget model. For non-communicating agents in the private budget model, we present a polynomial algorithm that is suitable for any number of agents. We also analyze the difference between homogeneous and heterogeneous agents, both with respect to their allotted resources and with respect to their capabilities. Finally, we define our problem in an environment with self-interested agents. We show how to find a Nash equilibrium in polynomial time, and prove that the bound on the performance of our algorithms, with respect to the social welfare, is tight.

[1]  George Papageorgiou,et al.  The Complexity of the Travelling Repairman Problem , 1986, RAIRO Theor. Informatics Appl..

[2]  Nikolaos V. Sahinidis,et al.  Heuristic Bounds and Test Problem Generation for the Time-Dependent Traveling Salesman Problem , 1995, Transp. Sci..

[3]  Joel W. Burdick,et al.  Multi-robot boundary coverage with plan revision , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[4]  Sarit Kraus,et al.  Physical Search Problems Applying Economic Search Models , 2008, AAAI.

[5]  Madhu Sudan,et al.  The minimum latency problem , 1994, STOC '94.

[6]  Satish Rao,et al.  The k-traveling repairman problem , 2003, SODA '03.

[7]  Noa Agmon,et al.  Multiagent Patrol Generalized to Complex Environmental Conditions , 2011, AAAI.

[8]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[9]  B. O. Koopman Search and Screening: General Principles and Historical Applications , 1980 .

[10]  Mihalis Yannakakis,et al.  Searching a Fixed Graph , 1996, ICALP.

[11]  S. Lippman,et al.  THE ECONOMICS OF JOB SEARCH: A SURVEY* , 1976 .

[12]  R. Bellman A Markovian Decision Process , 1957 .

[13]  Sanjeev Arora,et al.  A 2+epsilon approximation algorithm for the k-MST problem , 2000, SODA.

[14]  P. Hudson Search Games , 1982 .

[15]  J. Christopher Beck,et al.  Proactive Algorithms for Job Shop Scheduling with Probabilistic Durations , 2011, J. Artif. Intell. Res..

[16]  Alfredo García Olaverri,et al.  A note on the traveling repairman problem , 2002, Networks.

[17]  Lucio Bianco,et al.  The traveling salesman problem with cumulative costs , 1993, Networks.

[18]  Ariel Rubinstein,et al.  A Course in Game Theory , 1995 .

[19]  Sarit Kraus,et al.  Multi-robot perimeter patrol in adversarial settings , 2008, 2008 IEEE International Conference on Robotics and Automation.

[20]  Naveen Garg,et al.  Saving an epsilon: a 2-approximation for the k-MST problem in graphs , 2005, STOC '05.

[21]  Mark S. Daskin,et al.  The orienteering problem with stochastic profits , 2008 .

[22]  Sarit Kraus,et al.  Managing parallel inquiries in agents' two-sided search , 2008, Artif. Intell..

[23]  Steven Y. Goldsmith,et al.  Exhaustive Geographic Search with Mobile Robots Along Space-Filling Curves , 1998, CRW.

[24]  Elon Rimon,et al.  Spanning-tree based coverage of continuous areas by a mobile robot , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[25]  G. Kaminka,et al.  Frequency-Based Multi-Robot Fence Patrolling , 2008 .

[26]  Willy Herroelen,et al.  Project scheduling under uncertainty: Survey and research potentials , 2005, Eur. J. Oper. Res..

[27]  Matteo Fischetti,et al.  The Delivery Man Problem and Cumulative Matroids , 1993, Oper. Res..

[28]  Teofilo F. Gonzalez,et al.  P-Complete Problems and Approximate Solutions , 1974, SWAT.

[29]  Naveen Garg,et al.  A 3-approximation for the minimum tree spanning k vertices , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[30]  Douglas B. West,et al.  Extremal results and algorithms for degree sequences of graphs , 1993 .

[31]  Abilio Lucena,et al.  Time-dependent traveling salesman problem-the deliveryman case , 1990, Networks.

[32]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[33]  J. Christopher Beck,et al.  Job Shop Scheduling with Probabilistic Durations , 2004, ECAI.

[34]  Santosh S. Vempala,et al.  A Constant-Factor Approximation Algorithm for the k-MST Problem , 1999, J. Comput. Syst. Sci..

[35]  Satish Rao,et al.  Paths, trees, and minimum latency tours , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[36]  David Simchi-Levi,et al.  Minimizing the Total Flow Time of n Jobs on a Network , 1991 .

[37]  Santosh S. Vempala,et al.  Improved approximation guarantees for minimum-weight k-trees and prize-collecting salesmen , 1995, STOC '95.

[38]  Sven Koenig,et al.  Dynamic fringe-saving A* , 2009, AAMAS.

[39]  Noam Hazon,et al.  Redundancy, Efficiency and Robustness in Multi-Robot Coverage , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[40]  Steven A. Lippman,et al.  The Economics of Job Search: A Survey: Part I , 1976 .

[41]  Jeffrey O. Kephart,et al.  Shopbot Economics , 1999, AGENTS '99.

[42]  Jeffrey O. Kephart,et al.  Dynamic pricing by software agents , 2000, Comput. Networks.

[43]  John McMillan,et al.  Chapter 27 Search , 1994 .

[44]  D. Gibson,et al.  Redundancy , 1984 .

[45]  Santosh S. Vempala,et al.  New Approximation Guarantees for Minimum-Weight k-Trees and Prize-Collecting Salesmen , 1999, SIAM J. Comput..

[46]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[47]  Sanjeev Arora,et al.  A 2 + ɛ approximation algorithm for the k-MST problem , 2000, SODA '00.

[48]  David Sarne,et al.  Coordinated Exploration with a Shared Goal in Costly Environments , 2012, ECAI.

[49]  Anthony Stentz,et al.  The Focussed D* Algorithm for Real-Time Replanning , 1995, IJCAI.

[50]  Ian R. Webb,et al.  Depth-First Solutions for the Deliveryman Problem on Tree-Like Networks: An Evaluation Using a Permutation Model , 1996, Transp. Sci..

[51]  E. Minieka The delivery man problem on a tree network , 1990 .

[52]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[53]  Hoong Chuin Lau,et al.  Towards Finding Robust Execution Strategies for RCPSP/max with Durational Uncertainty , 2010, ICAPS.

[54]  M. Weitzman Optimal search for the best alternative , 1978 .

[55]  Sanjeev Arora,et al.  Approximation schemes for minimum latency problems , 1999, STOC '99.

[56]  Thomas S. Ferguson,et al.  Who Solved the Secretary Problem , 1989 .

[57]  Giorgio Ausiello,et al.  On Salesmen, Repairmen, Spiders, and Other Traveling Agents , 2000, CIAC.

[58]  Egon Balas,et al.  The prize collecting traveling salesman problem , 1989, Networks.

[59]  Jon M. Kleinberg,et al.  An improved approximation ratio for the minimum latency problem , 1996, SODA '96.

[60]  Metin Sitti,et al.  Design and Rolling Locomotion of a Magnetically Actuated Soft Capsule Endoscope , 2012, IEEE Transactions on Robotics.

[61]  Sarit Kraus,et al.  Collaborative Multi Agent Physical Search with Probabilistic Knowledge , 2009, IJCAI.

[62]  René Sitters,et al.  The Minimum Latency Problem Is NP-Hard for Weighted Trees , 2002, IPCO.

[63]  Yehuda Elmaliach,et al.  A realistic model of frequency-based multi-robot polyline patrolling , 2008, AAMAS.

[64]  Michel Gendreau,et al.  The orienteering problem with stochastic travel and service times , 2011, Ann. Oper. Res..

[65]  Sven Koenig,et al.  Fast replanning for navigation in unknown terrain , 2005, IEEE Transactions on Robotics.

[66]  T. Tsiligirides,et al.  Heuristic Methods Applied to Orienteering , 1984 .

[67]  S. Hart,et al.  Handbook of Game Theory with Economic Applications , 1992 .

[68]  Satish Rao,et al.  The k-traveling repairmen problem , 2007, ACM Trans. Algorithms.