Learning through Reinforcement for Repeated Power Control Game in Cognitive Radio Networks

This paper studies the repeated power control game in cognitive radio (CR) networks through reinforcement learning without channel and power strategy information exchange among CR users. Unlike traditional game-theoretical approaches on CR power control, this research solves the incomplete information power control problems for selfish and autonomous CR users for the first time. Each CR user in the problem only knows its own channel and power strategy while the information of primary users (PUs) and other different types of CR users are unknown. The formulated power control problem is a constrained repeated stochastic game with learning automaton. The objective of this repeated game is to maximize the average utility of each CR user under the interference power constraints of PUs. At each time step, the CR user only knows its own utility and the interference functions after the play but no further information. This power control game is proved to be asymptotically equivalent to the traditional game theory approaches. The properties of existence, diagonal concavity and uniqueness for this game are illustrated in detail. A Bush-Mosteller reinforcement learning procedure is designed for the power control algorithm. Finally, the learning based power control algorithm is implemented, and the simulation results with detailed analysis are shown to enforce the effectiveness of the proposed algorithms.

[1]  Mihaela van der Schaar,et al.  Decentralized Knowledge and Learning in Strategic Multi-user Communication , 2008, ArXiv.

[2]  Alexander S. Poznyak,et al.  Self-Learning Control of Finite Markov Chains , 2000 .

[3]  Ian F. Akyildiz,et al.  NeXt generation/dynamic spectrum access/cognitive radio wireless networks: A survey , 2006, Comput. Networks.

[4]  Bo Li,et al.  Non-cooperative power control for wireless ad hoc networks with repeated games , 2007, IEEE Journal on Selected Areas in Communications.

[5]  N. N. Vorobʹev,et al.  Foundations of Game Theory: Noncooperative Games , 1994 .

[6]  L. Hurwicz,et al.  Constraint Qualifications in Maximization Problems , 1961 .

[7]  Kaddour Najim,et al.  Learning automata and stochastic optimization , 1997 .

[8]  Wei Yuan,et al.  Joint Power and Rate Control in Cognitive Radio Networks: A Game-Theoretical Approach , 2008, 2008 IEEE International Conference on Communications.

[9]  R. Chandramouli,et al.  Stochastic Learning Solution for Distributed Discrete Power Control Game in Wireless Data Networks , 2008, IEEE/ACM Transactions on Networking.

[10]  J. Aubin Mathematical methods of game and economic theory , 1979 .

[11]  馬場 則夫 New topics in learning automata theory and applications , 1984 .

[12]  Dirk Ifenthaler,et al.  Stochastic Models of Learning , 2012 .

[13]  Kaddour Najim,et al.  Learning Automata: Theory and Applications , 1994 .

[14]  J. Goodman Note on Existence and Uniqueness of Equilibrium Points for Concave N-Person Games , 1965 .