论文信息 - Every Team Deserves a Second Chance: Identifying When Things Go Wrong (Student Abstract Version) - 字舞流文

Every Team Deserves a Second Chance: Identifying When Things Go Wrong (Student Abstract Version)

Voting among different agents is a powerful tool in problem solving, and it has been widely applied to improve the performance in finding the correct answer to complex problems. We present a novel benefit of voting, that has not been observed before: we can use the voting patterns to assess the performance of a team and predict their final outcome. This prediction can be executed at any moment during problem-solving and it is completely domain independent. We present a theoretical explanation of why our prediction method works. Further, contrary to what would be expected based on a simpler explanation using classical voting models, we argue that we can make accurate predictions irrespective of the strength (i.e., performance) of the teams, and that in fact, the prediction can work better for diverse teams composed of different agents than uniform teams made of copies of the best agent. We perform experiments in the Computer Go domain, where we obtain a high accuracy in predicting the final outcome of the games. We analyze the prediction accuracy for three different teams with different levels of diversity and strength, and we show that the prediction works significantly better for a diverse team. Since our approach is domain independent, it can be easily applied to a variety of domains.

Leandro Soriano Marcolino | Milind Tambe | Vaishnavh Nagarajan

[1] Gal A. Kaminka. Handling Coordination Failures in Large-Scale Multi-Agent Systems , 2006 .

[2] Milind Tambe,et al. Automated assistants to aid humans in understanding team behaviors , 2000, AGENTS '00.

[3] Olivier Teytaud,et al. Modification of UCT with Patterns in Monte-Carlo Go , 2006 .

[4] Ariel D. Procaccia,et al. When do noisy votes reveal the truth? , 2013, EC '13.

[5] Cha Zhang,et al. Ensemble Machine Learning: Methods and Applications , 2012 .

[6] Pieter Spronck,et al. Opponent Modeling in Real-Time Strategy Games , 2007, GAMEON.

[7] Petr Baudis,et al. PACHI: State of the Art Open Source Go Program , 2011, ACG.

[8] Meir Kalech,et al. COORDINATION DIAGNOSTIC ALGORITHMS FOR TEAMS OF SITUATED AGENTS: SCALING UP , 2011, Comput. Intell..

[9] Meir Kalech,et al. A hybrid approach for fault detection in autonomous physical agents , 2014, AAMAS.

[10] Noa Agmon,et al. Effective, Quantitative, Obscured Observation-Based Fault Detection in Multi-Agent Systems (Extended Abstract) , 2014 .

[11] H. Jaap van den Herik,et al. Opponent modelling for case-based adaptive game AI , 2009, Entertain. Comput..

[12] Pedro U. Lima,et al. Abnormality detection in multiagent systems inspired by the adaptive immune system , 2013, AAMAS.

[13] Alessio Lomuscio,et al. Automatic verification of parameterised multi-agent systems , 2013, AAMAS.

[14] Takeshi Ito,et al. Consultation Algorithm for Computer Shogi: Move Decisions by Majority , 2010, Computers and Games.

[15] Muhammad Khusairi Osman,et al. Weather Forecasting Using Photovoltaic System and Neural Network , 2010, 2010 2nd International Conference on Computational Intelligence, Communication Systems and Networks.

[16] Martin Müller,et al. Fuego—An Open-Source Framework for Board Games and Go Engine Based on Monte Carlo Tree Search , 2010, IEEE Transactions on Computational Intelligence and AI in Games.

[17] Alessio Lomuscio,et al. Automatic Verification of Parameterised Interleaved Multi-Agent Systems , 2013, ArXiv.

[18] S. Legg. Machine super intelligence , 2008 .

[19] Vincent Conitzer,et al. Common Voting Rules as Maximum Likelihood Estimators , 2005, UAI.

[20] Matjaz Gams,et al. Discovering Strategic Behaviour of Multi-Agent Systems in Adversary Settings , 2014, Comput. Informatics.

[21] Risto Miikkulainen,et al. Evolving explicit opponent models in game playing , 2007, GECCO '07.

[22] Matthew E. Taylor,et al. Teaching on a budget: agents advising agents in reinforcement learning , 2013, AAMAS.

[23] Leandro Soriano Marcolino,et al. Multi-Agent Team Formation: Diversity Beats Strength? , 2013, IJCAI.

[24] Meir Kalech,et al. On the design of coordination diagnosis algorithms for teams of situated agents , 2007, Artif. Intell..

[25] Franco Raimondi,et al. A synergistic and extensible framework for multi-agent system verification , 2013, AAMAS.

[26] Leandro Soriano Marcolino,et al. Give a Hard Problem to a Diverse Team: Exploring Large Action Spaces , 2014, AAAI.

[27] Marcus Hutter,et al. Universal Artificial Intelligence: Sequential Decisions Based on Algorithmic Probability (Texts in Theoretical Computer Science. An EATCS Series) , 2006 .

[28] Michael R. Genesereth,et al. General Game Playing: Overview of the AAAI Competition , 2005, AI Mag..

[29] Tuomas Sandholm,et al. Game theory-based opponent modeling in large imperfect-information games , 2011, AAMAS.

[30] Fernando Ramos,et al. Discovering tactical behavior patterns supported by topological structures in soccer agent domains , 2008, AAMAS.

[31] Gjergji Kasneci,et al. Crowd IQ: aggregating opinions to boost performance , 2012, AAMAS.

[32] Mehdi Dastani,et al. Monitoring norm violations in multi-agent systems , 2013, AAMAS.

[33] Ariel D. Procaccia,et al. Better Human Computation Through Principled Voting , 2013, AAAI.

[34] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract) , 2012, IJCAI.

[35] Michael H. Bowling,et al. Bayes' Bluff: Opponent Modelling in Poker , 2005, UAI 2005.

[36] Doan Thu Trang,et al. Verifying heterogeneous multi-agent programs , 2014, AAMAS.

[37] C. List,et al. Epistemic democracy : generalizing the Condorcet jury theorem , 2001 .

[38] Leandro Soriano Marcolino,et al. Diverse Randomized Agents Vote to Win , 2014, NIPS.

[39] Milind Tambe,et al. What Is Wrong With Us? Improving Robustness Through Social Diagnosis , 1998, AAAI/IAAI.

[40] Victor R. Lesser,et al. Coordinating multi-agent reinforcement learning with limited communication , 2013, AAMAS.