论文信息 - Cooperative Games in a Stochastic Environment

Cooperative Games in a Stochastic Environment

We introduce a very complex game based on an approximate solution of a NP-hard problem, so that the probability of victory grows monotonically, but of an unknown amount, with the resources each player employs. We formulate this model in the computational learning framework and focus on the problem of computing a confidence interval for the losing probability. We deal with the problem of reducing the width of this interval under a given threshold in both batch and on-line modality. While the former leads to a feasible polynomial complexity, the on-line learning strategy may get stuck in an indeterminacy trap: the more we play the game the broader becomes the confidence interval. In order to avoid this indeterminacy we organise in a better way the knowledge, introducing the notion of virtual game to achieve the goal efficiently. Then we extend the one-player to a team mode game. Namely, we improve the success of a team by redistributing the resources among the players and exploiting their mutual cooperation to treat the indeterminacy phenomenon suitably.

Bruno Apolloni | Simone Bassis | Dario Malchiodi | Sabrina Gaito

[1] Sartaj Sahni,et al. Approximate Algorithms for the 0/1 Knapsack Problem , 1975, JACM.

[2] Donald Fraser,et al. Nonparametric Estimation IV , 1951 .

[3] J. Nash,et al. NON-COOPERATIVE GAMES , 1951, Classics in Game Theory.

[4] M. A. Girshick,et al. Theory of games and statistical decisions , 1955 .

[5] Shai Ben-David,et al. Online Learning versus Offline Learning , 1995, Machine Learning.

[6] Bruno Apolloni,et al. PAC Learning of Concept Classes Through the Boundaries of Their Items , 1997, Theor. Comput. Sci..

[7] William J. Cook,et al. Combinatorial optimization , 1997 .

[8] Bruno Apolloni,et al. The Statistical Bases of Learning , 2002 .

[9] N. Littlestone. Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[10] D. Angluin. Queries and Concept Learning , 1988 .

[11] J. Tukey. Non-Parametric Estimation II. Statistically Equivalent Blocks and Tolerance Regions--The Continuous Case , 1947 .

[12] Bruno Apolloni,et al. From synapses to rules , 2002, Cognitive Systems Research.

[13] Sartaj Sahni. Some Related Problems from Network Flows, Game Theory and Integer Programming , 1972, SWAT.