Learning to Continuously Optimize Wireless Resource in Episodically Dynamic Environment

There has been a growing interest in developing data-driven, in particular deep neural network (DNN) based methods for modern communication tasks. For a few popular tasks such as power control, beamforming, and MIMO detection, these methods achieve state-of-the-art performance while requiring less computational efforts, less channel state information (CSI), etc. However, it is often challenging for these approaches to learn in a dynamic environment where parameters such as CSIs keep changing.This work develops a methodology that enables data-driven methods to continuously learn and optimize in a dynamic environment. Specifically, we consider an "episodically dynamic" setting where the environment changes in "episodes", and in each episode the environment is stationary. We propose a continual learning (CL) framework for wireless systems, which can incrementally adapt the learning models to the new episodes, without forgetting models learned from the previous episodes. Our design is based on a novel min-max formulation which ensures certain "fairness" across different episodes. Finally, we demonstrate the effectiveness of the CL approach by customizing it to a popular DNN based model for power control, and testing using both synthetic and real data.

[1]  Guan Gui,et al.  Fast Beamforming Design via Deep Learning , 2020, IEEE Transactions on Vehicular Technology.

[2]  Shi Jin,et al.  Deep Learning for Massive MIMO CSI Feedback , 2017, IEEE Wireless Communications Letters.

[3]  Nikolaos Doulamis,et al.  Deep Learning for Computer Vision: A Brief Review , 2018, Comput. Intell. Neurosci..

[4]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[5]  Geoffrey Ye Li,et al.  Power of Deep Learning for Channel Estimation and Signal Detection in OFDM Systems , 2017, IEEE Wireless Communications Letters.

[6]  Jiwon Kim,et al.  Continual Learning with Deep Generative Replay , 2017, NIPS.

[7]  Jason D. Lee,et al.  Solving a Class of Non-Convex Min-Max Games Using Iterative First Order Methods , 2019, NeurIPS.

[8]  Marc'Aurelio Ranzato,et al.  Gradient Episodic Memory for Continual Learning , 2017, NIPS.

[9]  Ami Wiesel,et al.  Learning to Detect , 2018, IEEE Transactions on Signal Processing.

[10]  Dongning Guo,et al.  Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks , 2018, IEEE Journal on Selected Areas in Communications.

[11]  Tinne Tuytelaars,et al.  A Continual Learning Survey: Defying Forgetting in Classification Tasks , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Shai Shalev-Shwartz,et al.  Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[13]  Guillermo Sapiro,et al.  Supervised Sparse Analysis and Synthesis Operators , 2013, NIPS.

[14]  David Isele,et al.  Selective Experience Replay for Lifelong Learning , 2018, AAAI.

[15]  Songtao Lu,et al.  Block Alternating Optimization for Non-convex Min-max Problems: Algorithms and Applications in Signal Processing and Communications , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Mark B. Ring Continual learning in reinforcement environments , 1995, GMD-Bericht.

[17]  David Burshtein,et al.  Deep Learning Methods for Improved Decoding of Linear Codes , 2017, IEEE Journal of Selected Topics in Signal Processing.

[18]  Yuanming Shi,et al.  A Graph Neural Network Approach for Scalable Wireless Power Control , 2019, 2019 IEEE Globecom Workshops (GC Wkshps).

[19]  Alejandro Ribeiro,et al.  Learning Optimal Resource Allocations in Wireless Systems , 2018, IEEE Transactions on Signal Processing.

[20]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[21]  Michael McCloskey,et al.  Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem , 1989 .

[22]  Alessio Zappone,et al.  Model-Aided Wireless Artificial Intelligence: Embedding Expert Knowledge in Deep Neural Networks for Wireless System Optimization , 2019, IEEE Vehicular Technology Magazine.

[23]  Zhi-Quan Luo,et al.  Dynamic Spectrum Management: Complexity and Duality , 2008, IEEE Journal of Selected Topics in Signal Processing.

[24]  Woongsup Lee,et al.  Deep Power Control: Transmit Power Control Scheme Based on Convolutional Neural Network , 2018, IEEE Communications Letters.

[25]  Wei Cui,et al.  Spatial Deep Learning for Wireless Scheduling , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[26]  Stefan Wermter,et al.  Continual Lifelong Learning with Neural Networks: A Review , 2019, Neural Networks.

[27]  Stephan ten Brink,et al.  On deep learning-based communication over the air , 2017, 2017 51st Asilomar Conference on Signals, Systems, and Computers.

[28]  Jean C. Walrand,et al.  Fair end-to-end window-based congestion control , 2000, TNET.

[29]  Mingyi Hong,et al.  To Supervise or Not to Supervise: How to Effectively Learn Wireless Interference Management Models? , 2021, 2021 IEEE 22nd International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[30]  Tsung-Hui Chang,et al.  Optimization Inspired Learning Network for Multiuser Robust Beamforming , 2020, 2020 IEEE 11th Sensor Array and Multichannel Signal Processing Workshop (SAM).

[31]  Jakob Hoydis,et al.  An Introduction to Deep Learning for the Physical Layer , 2017, IEEE Transactions on Cognitive Communications and Networking.

[32]  Sundeep Rangan,et al.  AMP-Inspired Deep Networks for Sparse Linear Inverse Problems , 2016, IEEE Transactions on Signal Processing.

[33]  Mingyi Hong,et al.  Learning to Beamform in Heterogeneous Massive MIMO Networks , 2020, IEEE Transactions on Wireless Communications.

[34]  Yarin Gal,et al.  Towards Robust Evaluations of Continual Learning , 2018, ArXiv.

[35]  Jonathan Le Roux,et al.  Deep Unfolding: Model-Based Inspiration of Novel Deep Architectures , 2014, ArXiv.

[36]  Tinne Tuytelaars,et al.  Online Continual Learning with Maximally Interfered Retrieval , 2019, ArXiv.

[37]  N. Sidiropoulos,et al.  Learning to Optimize: Training Deep Neural Networks for Interference Management , 2017, IEEE Transactions on Signal Processing.

[38]  Alexios Balatsoukas-Stimming,et al.  Deep Unfolding for Communications Systems: A Survey and Some New Directions , 2019, 2019 IEEE International Workshop on Signal Processing Systems (SiPS).

[39]  Zhi-Quan Luo,et al.  Signal Processing and Optimal Resource Allocation for the Interference Channel , 2012, ArXiv.

[40]  Razvan Pascanu,et al.  Progressive Neural Networks , 2016, ArXiv.

[41]  Alejandro Ribeiro,et al.  Optimal Wireless Resource Allocation With Random Edge Graph Neural Networks , 2019, IEEE Transactions on Signal Processing.

[42]  Derek Hoiem,et al.  Learning without Forgetting , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Geoffrey Ye Li,et al.  A Model-Driven Deep Learning Network for MIMO Detection , 2018, 2018 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[44]  Stephan ten Brink,et al.  Deep Learning Based Communication Over the Air , 2017, IEEE Journal of Selected Topics in Signal Processing.

[45]  Yoshua Bengio,et al.  Gradient based sample selection for online continual learning , 2019, NeurIPS.

[46]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[47]  Bjorn Ottersten,et al.  Optimal Downlink Beamforming Using Semidefinite Optimization , 2014 .

[48]  Dit-Yan Yeung,et al.  Collaborative Deep Learning for Recommender Systems , 2014, KDD.

[49]  Zhi-Quan Luo,et al.  Linear transceiver design for a MIMO interfering broadcast channel achieving max-min fairness , 2011, 2011 Conference Record of the Forty Fifth Asilomar Conference on Signals, Systems and Computers (ASILOMAR).

[50]  Mingyi Hong,et al.  Nonconvex Min-Max Optimization: Applications, Challenges, and Recent Theoretical Advances , 2020, IEEE Signal Processing Magazine.

[51]  Yann LeCun,et al.  Learning Fast Approximations of Sparse Coding , 2010, ICML.

[52]  OctoMiao Overcoming catastrophic forgetting in neural networks , 2016 .

[53]  Nathan D. Cahill,et al.  Memory Efficient Experience Replay for Streaming Learning , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[54]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[55]  Harish Viswanathan,et al.  Deep Learning Based Preamble Detection and TOA Estimation , 2019, 2019 IEEE Global Communications Conference (GLOBECOM).

[56]  Mingyi Hong,et al.  Limited Feedback Double Directional Massive MIMO Channel Estimation: From Low-Rank Modeling to Deep Learning , 2018, 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[57]  Zhi Ding,et al.  Iterative Algorithm Induced Deep-Unfolding Neural Networks: Precoding Design for Multiuser MIMO Systems , 2020, IEEE Transactions on Wireless Communications.

[58]  Cong Shen,et al.  Towards Optimal Power Control via Ensembling Deep Neural Networks , 2018, IEEE Transactions on Communications.

[59]  Ahmed Alkhateeb,et al.  DeepMIMO: A Generic Deep Learning Dataset for Millimeter Wave and Massive MIMO Applications , 2019, ArXiv.

[60]  Ami Wiesel,et al.  Deep MIMO detection , 2017, 2017 IEEE 18th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[61]  Bjorn Ottersten,et al.  Transfer Learning and Meta Learning-Based Fast Downlink Beamforming Adaptation , 2020, IEEE Transactions on Wireless Communications.

[62]  Anthony V. Robins,et al.  Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..

[63]  Stephan ten Brink,et al.  Scaling Deep Learning-Based Decoding of Polar Codes via Partitioning , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[64]  H. Vincent Poor,et al.  On unbounded path-loss models: effects of singularity on wireless network performance , 2009, IEEE Journal on Selected Areas in Communications.

[65]  Sung Ju Hwang,et al.  Lifelong Learning with Dynamically Expandable Networks , 2017, ICLR.

[66]  Surya Ganguli,et al.  Continual Learning Through Synaptic Intelligence , 2017, ICML.

[67]  Yuanming Shi,et al.  LORM: Learning to Optimize for Resource Management in Wireless Networks With Few Training Samples , 2018, IEEE Transactions on Wireless Communications.

[68]  Zhi-Quan Luo,et al.  An iteratively weighted MMSE approach to distributed sum-utility maximization for a MIMO interfering broadcast channel , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[69]  Christoph H. Lampert,et al.  iCaRL: Incremental Classifier and Representation Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[70]  David Rolnick,et al.  Experience Replay for Continual Learning , 2018, NeurIPS.