论文信息 - Efficient Dynamic Allocation with Uncertain Valuations

Efficient Dynamic Allocation with Uncertain Valuations

In this paper we consider the problem of efficiently allocating a given resource or object repeatedly over time. The agents, who may temporarily receive access to the resource, learn more about its value through its use. When the agents' beliefs about their valuations at any given time are public information, this problem reduces to the classic multi-armed bandit problem, the solution to which is obtained by determining a Gittins index for every agent. In the setting we study, agents observe their valuations privately, and the efficient dynamic resource allocation problem under asymmetric information becomes a problem of truthfully eliciting every agent's Gittins index. We introduce two bounding mechanisms, under which agents announce types corresponding to Gittins indices either at least as high or at most as high as their true Gittins indices. Using an announcement-contingent affine combination of the bounding mechanisms it is possible to implement the efficient dynamic allocation policy. We provide necessary and sufficient conditions for global Bayesian incentive compatibility, guaranteeing a truthful efficient allocation of the resource. Using essentially the same method it is possible to approximately implement truthful mechanisms corresponding to a large variety of surplus distribution objectives the principal might have, for instance a dynamic second-price Gittins index auction, which maximizes the principal's revenue subject to implementing an efficient allocation policy.

Thomas A. Weber | A. Bapna

[1] D. Hilbert. Ueber die stetige Abbildung einer Line auf ein Flächenstück , 1891 .

[2] W. R. Thompson. ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .

[3] W. R. Thompson. On the Theory of Apportionment , 1935 .

[4] H Robbins,et al. A SEQUENTIAL DECISION PROBLEM WITH A FINITE MEMORY. , 1956, Proceedings of the National Academy of Sciences of the United States of America.

[5] R. N. Bradt,et al. On Sequential Designs for Maximizing the Sum of $n$ Observations , 1956 .

[6] N. G. Parke,et al. Ordinary Differential Equations. , 1958 .

[7] H. Raiffa,et al. Applied Statistical Decision Theory. , 1961 .

[8] William Vickrey,et al. Counterspeculation, Auctions, And Competitive Sealed Tenders , 1961 .

[9] Richard A. Silverman,et al. Ordinary Differential Equations , 1968, The Mathematical Gazette.

[10] M. Degroot. Optimal Statistical Decisions , 1970 .

[11] E. H. Clarke. Multipart pricing of public goods , 1971 .

[12] Theodore Groves,et al. Incentives in Teams , 1973 .

[13] A. Gibbard. Manipulation of Voting Schemes: A General Result , 1973 .

[14] Gideon Weiss,et al. Multiple feedback at a single server station , 1975, Advances in Applied Probability.

[15] C. d'Aspremont,et al. Incentives and incomplete information , 1979 .

[16] K. Arrow. The Property Rights Doctrine and Demand Revelation under Incomplete Information**This work was supported by National Science Foundation under Grant No. SOC75-21820 at the Institute for Mathematical Studies in the Social Sciences, Stanford University. , 1979 .