Transfer Learning for Mixed-Integer Resource Allocation Problems in Wireless Networks

Effective resource allocation plays a pivotal role in wireless networks. Unfortunately, typical resource allocation problems are mixed-integer nonlinear programming (MINLP) problems, which are NP-hard. Machine learning based methods recently emerge as a disruptive way to obtain near-optimal performance for MINLP problems with affordable computational complexity. However, they suffer from severe performance deterioration when the network parameters change, which commonly happens in practice and can be characterized as the task mismatch issue. In this paper, we propose a transfer learning method via self-imitation, to address this issue for effective resource allocation in wireless networks. It is based on a general “learning to optimize” framework for solving MINLP problems. A unique advantage of the proposed method is that it can tackle the task mismatch issue with a few additional unlabeled training samples, which is especially important when transferring to large-size problems. Numerical experiments demonstrate that the proposed method, with much less training time, achieves comparable performance with the model trained from scratch based on sufficient labeled samples. To the best of our knowledge, this is the first work that applies transfer learning for resource allocation in wireless networks.

[1]  Woongsup Lee,et al.  Deep Power Control: Transmit Power Control Scheme Based on Convolutional Neural Network , 2018, IEEE Communications Letters.

[2]  N. Sidiropoulos,et al.  Learning to Optimize: Training Deep Neural Networks for Interference Management , 2017, IEEE Transactions on Signal Processing.

[3]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[4]  Yuanming Shi,et al.  Group Sparse Beamforming for Green Cloud-RAN , 2013, IEEE Transactions on Wireless Communications.

[5]  Leonidas J. Guibas,et al.  Taskonomy: Disentangling Task Transfer Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[6]  Jeffrey G. Andrews,et al.  User Association for Load Balancing in Heterogeneous Cellular Networks , 2012, IEEE Transactions on Wireless Communications.

[7]  Geoffrey J. Gordon,et al.  A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[8]  Claus Nebauer,et al.  Evaluation of convolutional neural networks for visual recognition , 1998, IEEE Trans. Neural Networks.

[9]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[10]  Le Song,et al.  2 Common Formulation for Greedy Algorithms on Graphs , 2018 .

[11]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[12]  Zhu Han,et al.  Resource Allocation for Wireless Networks: Basics, Techniques, and Applications , 2008 .

[13]  Jialin Song Learning to Search via Self-Imitation with Application to Risk-Aware Planning , 2017 .

[14]  Cong Shen,et al.  Towards Optimal Power Control via Ensembling Deep Neural Networks , 2018, IEEE Transactions on Communications.

[15]  Yuanming Shi,et al.  SCALABLE NETWORK ADAPTATION FOR CLOUD-RANS: AN IMITATION LEARNING APPROACH , 2018, 2018 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[16]  Victor C. M. Leung,et al.  Energy Efficient User Association and Power Allocation in Millimeter-Wave-Based Ultra Dense Networks With Energy Harvesting Base Stations , 2017, IEEE Journal on Selected Areas in Communications.

[17]  Khaled Ben Letaief,et al.  Multiuser OFDM with adaptive subcarrier, bit, and power allocation , 1999, IEEE J. Sel. Areas Commun..

[18]  He He,et al.  Learning to Search in Branch and Bound Algorithms , 2014, NIPS.