Learning to Continuously Optimize Wireless Resource in a Dynamic Environment: A Bilevel Optimization Perspective

There has been a growing interest in developing data-driven, and in particular deep neural network (DNN) based methods for modern communication tasks. These methods achieve state-of-the-art performance for a few popular wireless resource allocation problems, while requiring less computational efforts, less resources for acquiring channel state information (CSI), etc. However, it is often challenging for these approaches to learn in a dynamic environment. This work develops a new approach that enables data-driven methods to continuously learn and optimize wireless resource allocation in a dynamic environment. Specifically, we consider an “episodically dynamic” setting where the environment statistics change in “episodes,” and in each episode the environment is stationary. We propose to build the notion of continual learning (CL) into wireless system design, so that the learning model can incrementally adapt to the new episodes, without forgetting knowledge learned from the previous episodes. We demonstrate the effectiveness of the CL approach by integrating it with three popular DNN based models for power control, beamforming and multi-user MIMO, respectively, and testing using both synthetic and ray-tracing based data sets. These numerical results show that the proposed CL approach is not only able to adapt to the new scenarios quickly and seamlessly, but importantly, it also maintains high performance over the previously encountered scenarios as well.

[1]  Mingyi Hong,et al.  To Supervise or Not to Supervise: How to Effectively Learn Wireless Interference Management Models? , 2021, 2021 IEEE 22nd International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[2]  Mingyi Hong,et al.  Learning to Continuously Optimize Wireless Resource in Episodically Dynamic Environment , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Bjorn Ottersten,et al.  Transfer Learning and Meta Learning-Based Fast Downlink Beamforming Adaptation , 2020, IEEE Transactions on Wireless Communications.

[4]  Wotao Yin,et al.  Solving Stochastic Compositional Optimization is Nearly as Easy as Solving Stochastic Optimization , 2020, IEEE Transactions on Signal Processing.

[5]  Zhi Ding,et al.  Iterative Algorithm Induced Deep-Unfolding Neural Networks: Precoding Design for Multiuser MIMO Systems , 2020, IEEE Transactions on Wireless Communications.

[6]  A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic , 2020, ArXiv.

[7]  Maziar Sanjabi,et al.  Nonconvex Min-Max Optimization: Applications, Challenges, and Recent Theoretical Advances , 2020, IEEE Signal Processing Magazine.

[8]  Calypso Herrera,et al.  Estimating Full Lipschitz Constants of Deep Neural Networks , 2020, ArXiv.

[9]  Guan Gui,et al.  Fast Beamforming Design via Deep Learning , 2020, IEEE Transactions on Vehicular Technology.

[10]  Alejandro Ribeiro,et al.  Optimal Wireless Resource Allocation With Random Edge Graph Neural Networks , 2019, IEEE Transactions on Signal Processing.

[11]  Michael I. Jordan,et al.  On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems , 2019, ICML.

[12]  Yuanming Shi,et al.  LORM: Learning to Optimize for Resource Management in Wireless Networks With Few Training Samples , 2018, IEEE Transactions on Wireless Communications.

[13]  Cong Shen,et al.  Towards Optimal Power Control via Ensembling Deep Neural Networks , 2018, IEEE Transactions on Communications.

[14]  Harish Viswanathan,et al.  Deep Learning Based Preamble Detection and TOA Estimation , 2019, 2019 IEEE Global Communications Conference (GLOBECOM).

[15]  Tinne Tuytelaars,et al.  Online Continual Learning with Maximally Interfered Retrieval , 2019, ArXiv.

[16]  Yuanming Shi,et al.  A Graph Neural Network Approach for Scalable Wireless Power Control , 2019, 2019 IEEE Globecom Workshops (GC Wkshps).

[17]  Alexios Balatsoukas-Stimming,et al.  Deep Unfolding for Communications Systems: A Survey and Some New Directions , 2019, 2019 IEEE International Workshop on Signal Processing Systems (SiPS).

[18]  Yoshua Bengio,et al.  Gradient based sample selection for online continual learning , 2019, NeurIPS.

[19]  Ahmed Alkhateeb,et al.  DeepMIMO: A Generic Deep Learning Dataset for Millimeter Wave and Massive MIMO Applications , 2019, ArXiv.

[20]  David Rolnick,et al.  Experience Replay for Continual Learning , 2018, NeurIPS.

[21]  Nathan D. Cahill,et al.  Memory Efficient Experience Replay for Streaming Learning , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[22]  Alessio Zappone,et al.  Model-Aided Wireless Artificial Intelligence: Embedding Expert Knowledge in Deep Neural Networks for Wireless System Optimization , 2019, IEEE Vehicular Technology Magazine.

[23]  Dongning Guo,et al.  Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks , 2018, IEEE Journal on Selected Areas in Communications.

[24]  Alejandro Ribeiro,et al.  Learning Optimal Resource Allocations in Wireless Systems , 2018, IEEE Transactions on Signal Processing.

[25]  Ami Wiesel,et al.  Learning to Detect , 2018, IEEE Transactions on Signal Processing.

[26]  Stefan Wermter,et al.  Continual Lifelong Learning with Neural Networks: A Review , 2018, Neural Networks.

[27]  Geoffrey Ye Li,et al.  A Model-Driven Deep Learning Network for MIMO Detection , 2018, 2018 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[28]  Wei Cui,et al.  Spatial Deep Learning for Wireless Scheduling , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[29]  Mingyi Hong,et al.  Limited Feedback Double Directional Massive MIMO Channel Estimation: From Low-Rank Modeling to Deep Learning , 2018, 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[30]  Ying Li,et al.  Deep Learning Coordinated Beamforming for Highly-Mobile Millimeter Wave Systems , 2018, IEEE Access.

[31]  Woongsup Lee,et al.  Deep Power Control: Transmit Power Control Scheme Based on Convolutional Neural Network , 2018, IEEE Communications Letters.

[32]  David Isele,et al.  Selective Experience Replay for Lifelong Learning , 2018, AAAI.

[33]  Nikolaos Doulamis,et al.  Deep Learning for Computer Vision: A Brief Review , 2018, Comput. Intell. Neurosci..

[34]  Shi Jin,et al.  Deep Learning for Massive MIMO CSI Feedback , 2017, IEEE Wireless Communications Letters.

[35]  Geoffrey Ye Li,et al.  Power of Deep Learning for Channel Estimation and Signal Detection in OFDM Systems , 2017, IEEE Wireless Communications Letters.

[36]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[37]  Sung Ju Hwang,et al.  Lifelong Learning with Dynamically Expandable Networks , 2017, ICLR.

[38]  Stephan ten Brink,et al.  Deep Learning Based Communication Over the Air , 2017, IEEE Journal of Selected Topics in Signal Processing.

[39]  David Burshtein,et al.  Deep Learning Methods for Improved Decoding of Linear Codes , 2017, IEEE Journal of Selected Topics in Signal Processing.

[40]  Derek Hoiem,et al.  Learning without Forgetting , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  N. Sidiropoulos,et al.  Learning to Optimize: Training Deep Neural Networks for Interference Management , 2017, IEEE Transactions on Signal Processing.

[42]  Ami Wiesel,et al.  Deep MIMO detection , 2017, 2017 IEEE 18th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[43]  Marc'Aurelio Ranzato,et al.  Gradient Episodic Memory for Continual Learning , 2017, NIPS.

[44]  Jiwon Kim,et al.  Continual Learning with Deep Generative Replay , 2017, NIPS.

[45]  Surya Ganguli,et al.  Continual Learning Through Synaptic Intelligence , 2017, ICML.

[46]  Stephan ten Brink,et al.  Scaling Deep Learning-Based Decoding of Polar Codes via Partitioning , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[47]  Jakob Hoydis,et al.  An Introduction to Deep Learning for the Physical Layer , 2017, IEEE Transactions on Cognitive Communications and Networking.

[48]  Sundeep Rangan,et al.  AMP-Inspired Deep Networks for Sparse Linear Inverse Problems , 2016, IEEE Transactions on Signal Processing.

[49]  Razvan Pascanu,et al.  Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[50]  Christoph H. Lampert,et al.  iCaRL: Incremental Classifier and Representation Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Mengdi Wang,et al.  Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions , 2014, Mathematical Programming.

[52]  Razvan Pascanu,et al.  Progressive Neural Networks , 2016, ArXiv.

[53]  Dit-Yan Yeung,et al.  Collaborative Deep Learning for Recommender Systems , 2014, KDD.

[54]  Jonathan Le Roux,et al.  Deep Unfolding: Model-Based Inspiration of Novel Deep Architectures , 2014, ArXiv.

[55]  Bjorn Ottersten,et al.  Optimal Downlink Beamforming Using Semidefinite Optimization , 2014 .

[56]  Guillermo Sapiro,et al.  Supervised Sparse Analysis and Synthesis Operators , 2013, NIPS.

[57]  Zhi-Quan Luo,et al.  Signal Processing and Optimal Resource Allocation for the Interference Channel , 2012, ArXiv.

[58]  Shai Shalev-Shwartz,et al.  Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[59]  Zhi-Quan Luo,et al.  Linear transceiver design for a MIMO interfering broadcast channel achieving max-min fairness , 2011, 2011 Conference Record of the Forty Fifth Asilomar Conference on Signals, Systems and Computers (ASILOMAR).

[60]  Zhi-Quan Luo,et al.  An iteratively weighted MMSE approach to distributed sum-utility maximization for a MIMO interfering broadcast channel , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[61]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[62]  Yann LeCun,et al.  Learning Fast Approximations of Sparse Coding , 2010, ICML.

[63]  Zhi-Quan Luo,et al.  Dynamic Spectrum Management: Complexity and Duality , 2008, IEEE Journal of Selected Topics in Signal Processing.

[64]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[65]  Jean C. Walrand,et al.  Fair end-to-end window-based congestion control , 2000, TNET.

[66]  Anthony V. Robins,et al.  Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..

[67]  Mark B. Ring Continual learning in reinforcement environments , 1995, GMD-Bericht.

[68]  Michael McCloskey,et al.  Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .