论文信息 - Preallocation and Planning Under Stochastic Resource Constraints

Preallocation and Planning Under Stochastic Resource Constraints

Resource constraints frequently complicate multi-agent planning problems. Existing algorithms for resource-constrained, multi-agent planning problems rely on the assumption that the constraints are deterministic. However, frequently resource constraints are themselves subject to uncertainty from external influences. Uncertainty about constraints is especially challenging when agents must execute in an environment where communication is unreliable, making on-line coordination difficult. In those cases, it is a significant challenge to find coordinated allocations at plan time depending on availability at run time. To address these limitations, we propose to extend algorithms for constrained multi-agent planning problems to handle stochastic resource constraints. We show how to factorize resource limit uncertainty and use this to develop novel algorithms to plan policies for stochastic constraints. We evaluate the algorithms on a search-and-rescue problem and on a power-constrained planning domain where the resource constraints are decided by nature. We show that plans taking into account all potential realizations of the constraint obtain significantly better utility than planning for the expectation, while causing fewer constraint violations.

[1] Mathijs de Weerdt,et al. Best-Response Planning of Thermostatically Controlled Loads under Power Constraints , 2015, AAAI.

[2] Nicholas R. Jennings,et al. Intention-aware routing to minimise delays at electric vehicle charging stations: the research related to this demonstration has been published at IJCAI 2013 [1] , 2013, AIIP '13.

[3] Claudia V. Goldman,et al. Solving Transition Independent Decentralized Markov Decision Processes , 2004, J. Artif. Intell. Res..

[4] Stephen F. Smith,et al. Scheduling with Uncertain Resources: Search for a Near-Optimal Solution , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[5] Edmund H. Durfee,et al. Resource-Driven Mission-Phasing Techniques for Constrained Agents in Stochastic Environments , 2010, J. Artif. Intell. Res..

[6] Craig Boutilier,et al. Planning, Learning and Coordination in Multiagent Decision Processes , 1996, TARK.

[7] E. Altman. Constrained Markov Decision Processes , 1999 .

[8] Ronen I. Brafman,et al. Planning with Continuous Resources in Stochastic Domains , 2005, IJCAI.

[9] A. Testa,et al. Very short-term probabilistic wind power forecasting based on Markov chain models , 2010, 2010 IEEE 11th International Conference on Probabilistic Methods Applied to Power Systems.

[10] Robert Fitch,et al. Probabilistic Temporal Logic for Motion Planning with Resource Threshold Constraints , 2012, Robotics: Science and Systems.

[11] Daniel Adelman,et al. Relaxations of Weakly Coupled Stochastic Dynamic Programs , 2008, Oper. Res..

[12] Nikos A. Vlassis,et al. Multiagent Planning Under Uncertainty with Stochastic Communication Delays , 2008, ICAPS.

[13] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[14] R. Bellman. A Markovian Decision Process , 1957 .

[15] Steve A. Chien,et al. Probabilistic Reasoning for Plan Robustness , 2005, IJCAI.

[16] Frans A. Oliehoek,et al. Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication , 2012, AAAI.

[17] Pradeep Varakantham,et al. Scalable Greedy Algorithms for Task/Resource Constrained Multi-Agent Stochastic Planning , 2016, IJCAI.

[18] Mathijs de Weerdt,et al. Bounding the Probability of Resource Constraint Violations in Multi-Agent MDPs , 2017, AAAI.

[19] Patrick Jaillet,et al. Decentralized Stochastic Planning with Anonymity in Interactions , 2014, AAAI.

[20] Makoto Yokoo,et al. Communications for improving policy computation in distributed POMDPs , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[21] Leslie Pack Kaelbling,et al. Influence-Based Abstraction for Multiagent Systems , 2012, AAAI.

[22] Kee-Eung Kim,et al. Solving Very Large Weakly Coupled Markov Decision Processes , 1998, AAAI/IAAI.

[23] Hoong Chuin Lau,et al. Lagrangian Relaxation for Large-Scale Multi-agent Planning , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[24] G. Papaefthymiou,et al. Probabilistic tools for planning and operating power systems with distributed energy storage , 2008, Elektrotech. Informationstechnik.