Distributed channel assignment for network MIMO: game-theoretic formulation and stochastic learning

Abstract The cooperative frequency reuse among base stations (BSs) can improve the system spectral efficiency by reducing the intercell interference through channel assignment and precoding. This paper presents a game-theoretic study of channel assignment for realizing network multiple-input multiple-output (MIMO) operation under time-varying wireless channel. We propose a new joint precoding scheme that carries enhanced interference mitigation and capacity improvement abilities for network MIMO systems. We formulate the channel assignment problem from a game-theoretic perspective with BSs as the players, and show that our game is an exact potential game given the proposed utility function. A distributed, stochastic learning-based algorithm is proposed where each BS progressively moves toward the Nash equilibrium (NE) strategy based on its own action-reward history only. The convergence properties of the proposed learning algorithm toward an NE point are theoretically and numerically verified for different network topologies. The proposed learning algorithm also demonstrates an improved capacity and fairness performance as compared to other schemes through extensive link-level simulations.

[1]  Jie Zhang,et al.  OFDMA femtocells: A roadmap on interference avoidance , 2009, IEEE Communications Magazine.

[2]  L. Shapley,et al.  REGULAR ARTICLEPotential Games , 1996 .

[3]  V. Borkar Stochastic approximation with two time scales , 1997 .

[4]  Roberto Cominetti,et al.  Author's Personal Copy Games and Economic Behavior a Payoff-based Learning Procedure and Its Application to Traffic Games , 2022 .

[5]  M. Benaïm Dynamics of stochastic approximation algorithms , 1999 .

[6]  P. Billingsley,et al.  Probability and Measure , 1980 .

[7]  David Gesbert,et al.  Distributed Beamforming Coordination in Multicell MIMO Channels , 2009, VTC Spring 2009 - IEEE 69th Vehicular Technology Conference.

[8]  Lars Thiele,et al.  Distributed Base Station Cooperation via Block-Diagonalization and Dual-Decomposition , 2008, IEEE GLOBECOM 2008 - 2008 IEEE Global Telecommunications Conference.

[9]  Sampath Rangarajan,et al.  Efficient resource management in OFDMA Femto cells , 2009, MobiHoc '09.

[10]  Husheng Li,et al.  Multiagent -Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems , 2010 .

[11]  Raj Jain,et al.  A Quantitative Measure Of Fairness And Discrimination For Resource Allocation In Shared Computer Systems , 1998, ArXiv.

[12]  E. J. Collins,et al.  Convergent multiple-timescales reinforcement learning algorithms in normal form games , 2003 .

[13]  H. Kushner,et al.  Stochastic Approximation and Recursive Algorithms and Applications , 2003 .

[14]  Guy Pujolle,et al.  FCRA: Femtocell Cluster-Based Resource Allocation Scheme for OFDMA Networks , 2011, 2011 IEEE International Conference on Communications (ICC).

[15]  Johanne Cohen,et al.  Learning Equilibria in Games by Stochastic Distributed Algorithms , 2009, ISCIS.

[16]  Ali H. Sayed,et al.  A Leakage-Based Precoding Scheme for Downlink Multi-User MIMO Channels , 2007, IEEE Transactions on Wireless Communications.

[17]  Mario Bravo An Adjusted Payoff-Based Procedure for Normal Form Games , 2016, Math. Oper. Res..

[18]  Shlomo Shamai,et al.  Linear MMSE Precoding and Equalization for Network MIMO with Partial Cooperation , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[19]  Martin Haardt,et al.  Zero-forcing methods for downlink spatial multiplexing in multiuser MIMO channels , 2004, IEEE Transactions on Signal Processing.

[20]  Alagan Anpalagan,et al.  Opportunistic Spectrum Access in Unknown Dynamic Environment: A Game-Theoretic Stochastic Learning Solution , 2012, IEEE Transactions on Wireless Communications.

[21]  Cristina Comaniciu,et al.  Adaptive channel allocation spectrum etiquette for cognitive radio networks , 2005 .

[22]  Hamidou Tembine Distributed Strategic Learning for Wireless Engineers , 2017 .

[23]  Pierre Coucheney,et al.  Penalty-Regulated Dynamics and Robust Learning Procedures in Games , 2013, Math. Oper. Res..

[24]  Hamidou Tembine,et al.  Dynamic Robust Games in MIMO Systems , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[25]  Ana Galindo-Serrano,et al.  Femtocell systems with self organization capabilities , 2011, International Conference on NETwork Games, Control and Optimization (NetGCooP 2011).

[26]  Emil Björnson,et al.  Optimality Properties, Distributed Strategies, and Measurement-Based Evaluation of Coordinated Multicell OFDMA Transmission , 2011, IEEE Transactions on Signal Processing.

[27]  David S. Leslie,et al.  Individual Q-Learning in Normal Form Games , 2005, SIAM J. Control. Optim..

[28]  David Gesbert,et al.  Distributed Multicell-MISO Precoding Using the Layered Virtual SINR Framework , 2010, IEEE Transactions on Wireless Communications.

[29]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[30]  Alexander Shapiro,et al.  Stochastic Approximation approach to Stochastic Programming , 2013 .

[31]  David S. Leslie,et al.  Generalised weakened fictitious play , 2006, Games Econ. Behav..

[32]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[33]  Huaiyu Dai,et al.  Base Station Cooperation for Multiuser MIMO: Joint Transmission and BS Selection , 2004 .

[34]  Meixia Tao,et al.  Game Theoretic Multimode Precoding Strategy Selection for MIMO Multiple Access Channels , 2010, IEEE Signal Processing Letters.

[35]  Paul de Kerret,et al.  Sparse precoding in multicell MIMO systems , 2012, 2012 IEEE Wireless Communications and Networking Conference (WCNC).

[36]  Paul de Kerret,et al.  The multiplexing gain of a two-cell MIMO channel with unequal CSI , 2011, 2011 IEEE International Symposium on Information Theory Proceedings.

[37]  M. Hirsch,et al.  Stochastic approximation algorithms with constant step size whose average is cooperative , 1999 .

[38]  Pierre Priouret,et al.  Adaptive Algorithms and Stochastic Approximations , 1990, Applications of Mathematics.

[39]  Sean A. Ramprashad,et al.  Rethinking network MIMO: Cost of CSIT, performance analysis, and architecture comparisons , 2010, 2010 Information Theory and Applications Workshop (ITA).

[40]  V. V. Phansalkar,et al.  Decentralized Learning of Nash Equilibria in Multi-Person Stochastic Games With Incomplete Information , 1994, IEEE Trans. Syst. Man Cybern. Syst..

[41]  Athanasios V. Vasilakos,et al.  Game Dynamics and Cost of Learning in Heterogeneous 4G Networks , 2012, IEEE Journal on Selected Areas in Communications.

[42]  Michael Bloem,et al.  A stackelberg game for power control and channel allocation in cognitive radio networks , 2007, Valuetools 2007.

[43]  Andrea J. Goldsmith,et al.  Variable-rate variable-power MQAM for fading channels , 1997, IEEE Trans. Commun..

[44]  Jeffrey G. Andrews,et al.  Networked MIMO with clustered linear precoding , 2008, IEEE Transactions on Wireless Communications.

[45]  C.-C. Jay Kuo,et al.  Multicell OFDMA Downlink Resource Allocation Using a Graphic Framework , 2009, IEEE Transactions on Vehicular Technology.