论文信息 - Collective Intelligence - 字舞流文

Collective Intelligence

Many systems of self-interested agents have an associated performance criterion that rates the dynamic behavior of the overall system. This paper presents an introduction to the science of such systems. Formally, this paper concerns collectives, which are defined as any system having the following two characteristics: First, the system must contain one or more agents each of which we view as trying to maximize an associated private utility. Second, the system must have an associated world utility function that rates the possible behaviors of that overall system [38, 39, 40, 37, 28, 38]. In practice collectives are often very large, distributed, and support little if any centralized communication and control, although those characteristics are not part of their formal definition.

David H. Wolpert | D. Wolpert

[1] Kagan Tumer,et al. Reinforcement Learning in Distributed Domains: Beyond Team Games , 2001, IJCAI.

[2] Michael P. Wellman. A Market-Oriented Programming Environment and its Application to Distributed Multicommodity Flow Problems , 1993, J. Artif. Intell. Res..

[3] Kagan Tumer,et al. Optimal Payoff Functions for Members of Collectives , 2001, Adv. Complex Syst..

[4] Onn Shehory,et al. Anytime Coalition Structure Generation with Worst Case Guarantees , 1998, AAAI/IAAI.

[5] G. Hardin,et al. The Tragedy of the Commons , 1968, Green Planet Blues.

[6] Yicheng Zhang,et al. On the minority game: Analytical and numerical studies , 1998, cond-mat/9805084.

[7] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[8] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[9] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[10] M. Marsili,et al. A Prototype Model of Stock Exchange , 1997, cond-mat/9709118.

[11] Kagan Tumer,et al. Using Collective Intelligence to Route Internet Traffic , 1998, NIPS.

[12] Kagan Tumer,et al. General principles of learning-based multi-agent systems , 1999, AGENTS '99.

[13] Kagan Tumer,et al. A Survey of Collective Intelligence , 2013 .

[14] Ron Lavi,et al. Algorithmic Mechanism Design , 2008, Encyclopedia of Algorithms.

[15] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[16] Kagan Tumer,et al. Collective Intelligence for Control of Distributed Dynamical Systems , 1999, ArXiv.

[17] Nicholas R. Jennings,et al. A Roadmap of Agent Research and Development , 2004, Autonomous Agents and Multi-Agent Systems.

[18] Craig Boutilier. Multiagent Systems: Challenges and Opportunities for Decision-Theoretic Planning , 1999, AI Mag..

[19] Andrew G. Barto,et al. Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.

[20] Y. Shoham,et al. Editorial: economic principles of multi-agent systems , 1997 .

[21] David C. Parkes,et al. Iterative Combinatorial Auctions: Theory and Practice , 2000, AAAI/IAAI.

[22] Kagan Tumer,et al. An Introduction to Collective Intelligence , 1999, ArXiv.

[23] L. Shapley,et al. Potential Games , 1994 .

[24] S. Griffis. EDITOR , 1997, Journal of Navigation.

[25] Kagan Tumer,et al. Learning sequences of actions in collectives of autonomous agents , 2002, AAMAS '02.

[26] David H. Wolpert,et al. Designing agent collectives for systems with markovian dynamics , 2002, AAMAS '02.

[27] Kagan Tumer,et al. Improving Simulated Annealing by Recasting it as a Non-Cooperative Game , 2001 .

[28] M. Dufwenberg. Game theory. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[29] Michael P. Wellman,et al. Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[30] W. Arthur. Complexity in economic theory: inductive reasoning and bounded rationality , 1994 .

[31] P. M. Hui,et al. Volatility and agent adaptability in a self-organizing market , 1998, cond-mat/9802177.

[32] A. Mas-Colell,et al. Microeconomic Theory , 1995 .

[33] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[34] Robert H. Crites,et al. Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.

[35] Michael R. Genesereth,et al. Software agents , 1994, CACM.

[36] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[37] Kagan Tumer,et al. Collective Intelligence and Braess' Paradox , 2000, AAAI/IAAI.

[38] Yicheng Zhang. Modeling Market Mechanism with Evolutionary Games , 1998, cond-mat/9803308.

[39] Agostino Poggi,et al. Multiagent Systems , 2006, Intelligenza Artificiale.

[40] Kagan Tumer,et al. Collective Intelligence, Data Routing and Braess' Paradox , 2002, J. Artif. Intell. Res..