论文信息 - Balancing exploration and exploitation in incomplete Min/Max-sum inference for distributed constraint optimization - 字舞流文

Balancing exploration and exploitation in incomplete Min/Max-sum inference for distributed constraint optimization

Distributed Constraint Optimization Problems (DCOPs) are NP-hard and therefore the number of studies that consider incomplete algorithms for solving them is growing. Specifically, the Max-sum algorithm has drawn attention in recent years and has been applied to a number of realistic applications. Unfortunately, in many cases Max-sum does not produce high-quality solutions. More specifically, Max-sum does not converge and explores solutions of low quality when run on problems whose constraint graph representation contains multiple cycles of different sizes. In this paper we advance the state-of-the-art in incomplete algorithms for DCOPs by: (1) proposing a version of the Max-sum algorithm that operates on an alternating directed acyclic graph (Max-sum_AD), which guarantees convergence in linear time; (2) solving a major weakness of Max-sum and Max-sum_AD that causes inconsistent costs/utilities to be propagated and affect the assignment selection, by introducing value propagation to Max-sum_AD (Max-sum_ADVP); and (3) proposing exploration heuristic methods that evidently improve the algorithms performance further. We prove that Max-sum_ADVP converges to monotonically improving states after each change of direction, and that it is guaranteed to converge in pseudo-polynomial time to a stable solution that does not change with further changes of direction. Our empirical study reveals a large improvement in the quality of the solutions produced by Max-sum_ADVP on various benchmarks, compared to the solutions produced by the standard Max-sum algorithm, Bounded Max-sum and Max-sum_AD with no value propagation. It is found to be the best guaranteed convergence inference algorithm for DCOPs. The exploration methods we propose for Max-sum_ADVP improve its performance further. However, anytime results demonstrate that their exploration level is not as efficient as a version of Max-sum, which uses Damping.

Steven Okamoto | Roie Zivan | Liel Cohen | Tomer Parash | Hilla Peled | Roie Zivan | Hilla Peled | Steven Okamoto | Liel Cohen | Tomer Parash

[1] Weixiong Zhang,et al. Distributed stochastic search and distributed breakout: properties, comparison and applications to constraint optimization problems in sensor networks , 2005, Artif. Intell..

[2] Robert J. McEliece,et al. The generalized distributive law , 2000, IEEE Trans. Inf. Theory.

[3] Steven Okamoto,et al. Distributed constraint optimization for teams of mobile sensing agents , 2014, Autonomous Agents and Multi-Agent Systems.

[4] Meritxell Vinyals,et al. Divide-and-coordinate: DCOPs by agreement , 2010, AAMAS.

[5] Roger Mailler,et al. Getting What You Pay For: Is Exploration in Distributed Hill Climbing Really Worth it? , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[6] Nicholas R. Jennings,et al. Decentralised coordination of low-power embedded devices using the max-sum algorithm , 2008, AAMAS.

[7] Amnon Meisels,et al. Asynchronous Forward Bounding for Distributed COPs , 2014, J. Artif. Intell. Res..

[8] Meritxell Vinyals,et al. Constructing a unifying theory of dynamic programming DCOP algorithms via the generalized distributive law , 2010, Autonomous Agents and Multi-Agent Systems.

[9] Makoto Yokoo,et al. An approach to over-constrained distributed constraint satisfaction problems: distributed hierarchical constraint satisfaction , 2000, Proceedings Fourth International Conference on MultiAgent Systems.

[10] Milind Tambe,et al. Quality guarantees for region optimal DCOP algorithms , 2011, AAMAS.

[11] Thomas Schiex,et al. Solving weighted CSP by maintaining arc consistency , 2004, Artif. Intell..

[12] Amnon Meisels. Asynchronous Forward-Bounding , 2008 .

[13] Makoto Yokoo,et al. Pseudo-Tree-Based Incomplete Algorithm for Distributed Constraint Optimization with Quality Bounds , 2011, CP.

[14] Pedro Meseguer,et al. Improving DPOP with function filtering , 2010, AAMAS.

[15] Carmel Domshlak,et al. Sensor networks and distributed CSP: communication, computation and complexity , 2005, Artif. Intell..

[16] Milind Tambe,et al. Distributed Algorithms for DCOP: A Graphical-Game-Based Approach , 2004, PDCS.

[17] Roie Zivan,et al. Max/min-sum distributed constraint optimization through value propagation on an alternating DAG , 2012, AAMAS.

[18] X. Jin. Factor graphs and the Sum-Product Algorithm , 2002 .

[19] Milind Tambe,et al. Quality Guarantees on k-Optimal Solutions for Distributed Constraint Optimization Problems , 2007, IJCAI.

[20] Yair Weiss,et al. Linear Programming Relaxations and Belief Propagation - An Empirical Study , 2006, J. Mach. Learn. Res..

[21] Nicholas R. Jennings,et al. Decentralised coordination of continuously valued control parameters using the max-sum algorithm , 2009, AAMAS.

[22] Milind Tambe,et al. Taking DCOP to the real world: efficient complete solutions for distributed multi-event scheduling , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[23] Amnon Meisels,et al. Concurrent forward bounding for distributed constraint optimization problems , 2012, Artif. Intell..

[24] Tamir Hazan,et al. Norm-Product Belief Propagation: Primal-Dual Message-Passing for Approximate Inference , 2009, IEEE Transactions on Information Theory.

[25] Steven Okamoto,et al. Explorative anytime local search for distributed constraint optimization , 2014, Artif. Intell..

[26] Makoto Yokoo,et al. Adopt: asynchronous distributed constraint optimization with quality guarantees , 2005, Artif. Intell..

[27] Javier Larrosa,et al. Improved Bounded Max-Sum for Distributed Constraint Optimization , 2012, CP.

[28] Amnon Meisels,et al. Distributed constraint satisfaction with partially known constraints , 2009, Constraints.

[29] Tommi S. Jaakkola,et al. Fixing Max-Product: Convergent Message Passing Algorithms for MAP LP-Relaxations , 2007, NIPS.

[30] Sven Koenig,et al. BnB-ADOPT: an asynchronous branch-and-bound DCOP algorithm , 2008, AAMAS.

[31] Katsutoshi Hirayama,et al. DeQED: an efficient divide-and-coordinate algorithm for DCOP , 2013, AAMAS.

[32] Milind Tambe,et al. Asynchronous algorithms for approximate distributed constraint optimization with quality bounds , 2010, AAMAS.

[33] Nicholas R. Jennings,et al. Max-Sum Decentralised Coordination for Sensor Systems (Demo Paper) , 2008 .

[34] Nicholas R. Jennings,et al. Bounded approximate decentralised coordination via the max-sum algorithm , 2009, Artif. Intell..

[35] Sarvapali D. Ramchurn,et al. Decentralized Coordination in RoboCup Rescue , 2010, Comput. J..

[36] Boi Faltings,et al. A Scalable Method for Multiagent Constraint Optimization , 2005, IJCAI.

[37] Tommi S. Jaakkola,et al. Tightening LP Relaxations for MAP using Message Passing , 2008, UAI.

[38] Brendan J. Frey,et al. Solving the Uncapacitated Facility Location Problem Using Message Passing Algorithms , 2010, AISTATS.

[39] Subhash Khot. On the power of unique 2-prover 1-round games , 2002, STOC '02.

[40] Amnon Meisels,et al. Scheduling Meetings by Agents , 2008 .

[41] Rina Dechter,et al. Bucket Elimination: A Unifying Framework for Reasoning , 1999, Artif. Intell..

[42] Javier Larrosa,et al. Intelligent variable orderings and re-orderings in DAC-based solvers for WCSP , 2006, J. Heuristics.

[43] Boi Faltings,et al. Approximations in Distributed Optimization , 2005, CP.

[44] C. Reeves. Modern heuristic techniques for combinatorial problems , 1993 .