Reinforcement Learning for Link Adaptation in MIMO-OFDM Wireless Systems

Machine learning algorithms have recently attracted much interest for effective link adaptation due to their flexibility and ability to capture more environmental effects implicitly than classical adaptation algorithms. However, past applications are limited to rather simple configurations such as identifying channel condition or link adaptation in fixed or slowly varying channels. Recently, more sophisticated approaches using offline supervised learning have been proposed for link adaptation in complex configurations such as MIMO-OFDM. However, their time complexity and offline training phase hamper their real-world applicability. Approaches using online learning have shown good throughput performance, but the high memory requirement makes them inefficient or even impractical. In this paper, we propose a new effective online learning algorithm for link adaptation. Our computations show that the algorithm performs comparably to the existing online learning approaches, but ours requires minimal storage and time, which makes it more practical. Moreover it adapts to the change of channel distribution quickly.

[1]  Robert W. Heath,et al.  Adaptation in Convolutionally Coded MIMO-OFDM Wireless Systems Through Supervised Learning and SNR Ordering , 2010, IEEE Transactions on Vehicular Technology.

[2]  Yuan Yao,et al.  Online Learning Algorithms , 2006, Found. Comput. Math..

[3]  Clive Tang,et al.  An adaptive learning approach to adaptive OFDM , 2004, 2004 IEEE Wireless Communications and Networking Conference (IEEE Cat. No.04TH8733).

[4]  Shie Mannor,et al.  The kernel recursive least-squares algorithm , 2004, IEEE Transactions on Signal Processing.

[5]  C. J. Stone,et al.  Consistent Nonparametric Regression , 1977 .

[6]  Robert W. Heath,et al.  An online learning framework for link adaptation in wireless networks , 2009, 2009 Information Theory and Applications Workshop.

[7]  Robert Schober,et al.  Stochastic Learning Algorithms for Adaptive Modulation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[8]  Rajarathnam Chandramouli,et al.  Adaptive stochastic iterative rate selection for wireless channels , 2004, IEEE Communications Letters.

[9]  M. A. L. Thathachar,et al.  Networks of Learning Automata , 2004 .