Optimal Bidding in Repeated Wireless Spectrum Auctions with Budget Constraints

Small operators who take part in secondary wireless spectrum markets typically have strict budget limits. In this paper, we study the bidding problem of a budget constrained operator in repeated secondary spectrum auctions. In existing truthful auctions, truthful bidding is the optimal strategy of a bidder. However, budget limits impact bidding behaviors and make bidding decisions complicated, since bidders may behave differently to avoid running out of money. We formulate the problem as a dynamic auction game between operators, where knowledge of other operators is limited due to the distributed nature of wireless networks/markets. We first present a Markov Decision Process (MDP) formulation of the problem and characterize the optimal bidding strategy of an operator, provided that opponents' bids are i.i.d. Next, we generalize the formulation to a Markov game that, in conjunction with model-free reinforcement learning approaches, enables an operator to make inferences about its opponents based on local observations. Finally, we present a fully distributed learning-based bidding algorithm which relies only on local information. Our numerical results show that our proposed learning-based bidding results in a better utility than truthful bidding.

[1]  Zhu Han,et al.  Dynamics of Multiple-Seller and Multiple-Buyer Spectrum Trading in Cognitive Radio Networks: A Game-Theoretic Modeling Approach , 2009, IEEE Transactions on Mobile Computing.

[2]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[3]  Michael L. Littman,et al.  Value-function reinforcement learning in Markov games , 2001, Cognitive Systems Research.

[4]  Mihaela van der Schaar,et al.  Learning to Compete for Resources in Wireless Stochastic Games , 2009, IEEE Transactions on Vehicular Technology.

[5]  Dusit Niyato,et al.  Competitive Pricing for Spectrum Sharing in Cognitive Radio Networks: Dynamic Game, Inefficiency of Nash Equilibrium, and Collusion , 2008, IEEE Journal on Selected Areas in Communications.

[6]  Tommi S. Jaakkola,et al.  Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[7]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[8]  Tim Roughgarden,et al.  Algorithmic Game Theory , 2007 .

[9]  Alhussein A. Abouzeid,et al.  Dynamic Spectrum Sharing Auction With Time-Evolving Channel Qualities , 2015, IEEE Transactions on Wireless Communications.

[10]  Jon Feldman,et al.  Budget optimization in search-based advertising auctions , 2006, EC '07.

[11]  Alhussein A. Abouzeid,et al.  Auction-based spectrum sharing in cognitive radio networks with heterogeneous channels , 2013, 2013 Information Theory and Applications Workshop (ITA).

[12]  Martin Hoefer,et al.  Secondary spectrum auctions for symmetric and submodular bidders , 2011, EC '12.

[13]  Michael L. Honig,et al.  Sequential Bandwidth and Power Auctions for Distributed Spectrum Sharing , 2008, IEEE Journal on Selected Areas in Communications.

[14]  Anton Schwaighofer,et al.  Budget Optimization for Sponsored Search: Censored Learning in MDPs , 2012, UAI.

[15]  P. Cramton The FCC Spectrum Auctions: An Early Assessment , 1997 .

[16]  Jianwei Huang,et al.  Competition with Dynamic Spectrum Leasing , 2010, 2010 IEEE Symposium on New Frontiers in Dynamic Spectrum (DySPAN).

[17]  Alexandre Proutière,et al.  Optimal bidding strategies in dynamic auctions with budget constraints , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[18]  Kang G. Shin,et al.  Exploiting Spectrum Heterogeneity in Dynamic Spectrum Market , 2012, IEEE Transactions on Mobile Computing.

[19]  Dusit Niyato,et al.  Distributed resource allocation in wireless networks under uncertainty and application of Bayesian game , 2011, IEEE Communications Magazine.

[20]  Nicole Immorlica,et al.  Dynamics of bid optimization in online advertisement auctions , 2007, WWW '07.

[21]  K. J. Ray Liu,et al.  Multi-Stage Pricing Game for Collusion-Resistant Dynamic Spectrum Allocation , 2008, IEEE Journal on Selected Areas in Communications.

[22]  Rong Zheng,et al.  Repeated Auctions with Bayesian Nonparametric Learning for Spectrum Access in Cognitive Radio Networks , 2011, IEEE Transactions on Wireless Communications.