Performance Improvements for Wireless Mobile Networks via Deep Reinforcement Learning

Deep reinforcement learning (DRL) is evolved from a collection of powerful techniques in artificial intelligence areas, and has been extensively used in different areas. In DRL, an agent learns to take actions that would yield the most reward by interacting with the environment without prior knowledge of an exact mathematical model of the environment. In this work, we investigate the performance improvements for wireless mobile networks via DRL. We firstly present a DRL approach in cache-enabled opportunistic interference alignment wireless networks. Most existing related works assume that the wireless channels are invariant, which is unrealistic. We consider time-varying channels, and therefore the complexity of the system is very high. We use Google TensorFlow to implement DRL in this chapter to obtain the optimal user selection policy in cache-enabled opportunistic interference alignment wireless networks. Simulation results are presented to show that the network's sum rate and energy efficiency can be significantly improved by using the proposed approach. Secondly, we design a software-defined framework for connected vehicles, which integrates communication, caching and mobile edge computing. A deep reinforcement learning-based resource allocation scheme is proposed for the connected vehicles. The dynamic change processes of the resources are modeled as Markov chains, respectively. Without any assumptions about the objective functions or any low-complexity preprocessing, the proposed scheme can directly solve the resource allocation problems with large-scale state space. Simulation results verify that the proposed scheme can converge at a fast speed, and improve the network operator's total utilities. Thirdly, we study trust-based social networks with mobile edge computing, in-network caching and device-to-device communications. An optimization problem is formulated to maximize the network operator's utility with comprehensive considerations of trust values, computation capabilities, wireless channel qualities, and the cache status of all the available nodes. We apply a DRL approach to automatically make a decision for allocating the network resources. The decision is made purely through observing the network's states, rather than any handcrafted or explicit control rules, which makes it adaptive to variable network conditions. Simulation results with different network parameters are presented to show the effectiveness of the proposed scheme.

[1]  Yi Sun,et al.  Interference Alignment Based on Antenna Selection With Imperfect Channel State Information in Cognitive Radio Networks , 2016, IEEE Transactions on Vehicular Technology.

[2]  Depeng Jin,et al.  Vehicular Fog Computing: A Viewpoint of Vehicles as the Infrastructures , 2016, IEEE Transactions on Vehicular Technology.

[3]  Syed Ali Jafar,et al.  A Distributed Numerical Approach to Interference Alignment and Applications to Wireless Interference Networks , 2011, IEEE Transactions on Information Theory.

[4]  Zhou Su,et al.  Content distribution over content centric mobile social networks in 5G , 2015, IEEE Communications Magazine.

[5]  Wei Yu,et al.  Optimized Backhaul Compression for Uplink Cloud Radio Access Network , 2013, IEEE Journal on Selected Areas in Communications.

[6]  Wenzhong Li,et al.  Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Computing , 2015, IEEE/ACM Transactions on Networking.

[7]  Pao-Chi Chang,et al.  On verifying the first-order Markovian assumption for a Rayleigh fading channel model , 1996 .

[8]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[9]  Nan Zhao,et al.  Adaptive Power Allocation Schemes for Spectrum Sharing in Interference-Alignment-Based Cognitive Radio Networks , 2016, IEEE Transactions on Vehicular Technology.

[10]  F. Richard Yu,et al.  Information-Centric Virtualized Cellular Networks With Device-to-Device Communications , 2016, IEEE Transactions on Vehicular Technology.

[11]  Marco Di Renzo,et al.  Power-Availability-Aware Cell Association for Energy-Harvesting Small-Cell Base Stations , 2016, IEEE Transactions on Wireless Communications.

[12]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Zhu Han,et al.  Social-Aware Data Dissemination via Device-to-Device Communications: Fusing Social and Mobile Networks with Incentive Constraints , 2019, IEEE Transactions on Services Computing.

[14]  Serge Fdida,et al.  A survey on predicting the popularity of web content , 2014, Journal of Internet Services and Applications.

[15]  E. Biglieri,et al.  Space-time decoding with imperfect channel estimation , 2003, IEEE Transactions on Wireless Communications.

[16]  F. Richard Yu,et al.  Wireless Network Virtualization: A Survey, Some Research Issues and Challenges , 2015, IEEE Communications Surveys & Tutorials.

[17]  Dong Liu,et al.  Caching at the wireless edge: design aspects, challenges, and future directions , 2016, IEEE Communications Magazine.

[18]  Laizhong Cui,et al.  When big data meets software-defined networking: SDN for big data and big data for SDN , 2016, IEEE Network.

[19]  Zaher Dawy,et al.  Social Network Aware Device-to-Device Communication in Wireless Networks , 2015, IEEE Transactions on Wireless Communications.

[20]  Cecilio Pimentel,et al.  Finite-state Markov modeling of correlated Rician-fading channels , 2004, IEEE Transactions on Vehicular Technology.

[21]  Antonella Molinaro,et al.  Information-centric networking for connected vehicles: a survey and future perspectives , 2016, IEEE Communications Magazine.

[22]  Ambedkar Dukkipati,et al.  Learning by Stretching Deep Networks , 2014, ICML.

[23]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[24]  Tao Tang,et al.  Finite-State Markov Modeling for Wireless Channels in Tunnel Communication-Based Train Control Systems , 2014, IEEE Transactions on Intelligent Transportation Systems.

[25]  Victor C. M. Leung,et al.  Delay-Optimal Virtualized Radio Resource Scheduling in Software-Defined Vehicular Networks via Stochastic Learning , 2016, IEEE Transactions on Vehicular Technology.

[26]  F. Richard Yu,et al.  Optimization of cache-enabled opportunistic interference alignment wireless networks: A big data deep reinforcement learning approach , 2017, 2017 IEEE International Conference on Communications (ICC).

[27]  Hao Chen,et al.  Enabling cyber-physical communication in 5G cellular networks: challenges, spatial spectrum sensing, and cyber-security , 2017, IET Cyper-Phys. Syst.: Theory & Appl..

[28]  Hong Shen Wang,et al.  Finite-state Markov channel-a useful model for radio communication channels , 1995 .

[29]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[30]  Thomas M. Chen,et al.  Dempster-Shafer theory for intrusion detection in ad hoc networks , 2005, IEEE Internet Computing.

[31]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[32]  Patrick Crowley,et al.  Named data networking , 2014, CCRV.

[33]  Kun Yang,et al.  Mobile Social Networks: Architectures, Social Properties, and Key Research Challenges , 2013, IEEE Communications Surveys & Tutorials.

[34]  Victor C. M. Leung,et al.  Deep-Reinforcement-Learning-Based Optimization for Cache-Enabled Opportunistic Interference Alignment Wireless Networks , 2017, IEEE Transactions on Vehicular Technology.

[35]  Giuseppe Caire,et al.  Wireless caching: technical misconceptions and business barriers , 2016, IEEE Communications Magazine.

[36]  Joseph Kee-Yin Ng,et al.  Network-Coding-Assisted Data Dissemination via Cooperative Vehicle-to-Vehicle/-Infrastructure Communications , 2016, IEEE Transactions on Intelligent Transportation Systems.

[37]  Xi Zhang,et al.  Information-centric network function virtualization over 5g mobile wireless networks , 2015, IEEE Network.

[38]  Antonella Molinaro,et al.  From Theory to Experimental Evaluation: Resource Management in Software-Defined Vehicular Networks , 2017, IEEE Access.

[39]  Xiangang Li,et al.  Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition , 2014, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[40]  Hsiao-Hwa Chen,et al.  Cooperative Device-to-Device Communications: Social Networking Perspectives , 2017, IEEE Network.

[41]  David E. Booth,et al.  A comparison of supervised and unsupervised neural networks in predicting bankruptcy of Korean firms , 2005, Expert Syst. Appl..

[42]  Zhu Han,et al.  Caching based socially-aware D2D communications in wireless content delivery networks: a hypergraph framework , 2016, IEEE Wireless Communications.

[43]  Fernando A. Kuipers,et al.  SDN and Virtualization Solutions for the Internet of Things: A Survey , 2016, IEEE Access.

[44]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[45]  Bang Chul Jung,et al.  Opportunistic Interference Alignment for Interference-Limited Cellular TDD Uplink , 2011, IEEE Communications Letters.

[46]  Leonard J. Cimini,et al.  MobiCacher: Mobility-aware content caching in small-cell networks , 2014, 2014 IEEE Global Communications Conference.

[47]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[48]  S. Haykin,et al.  A Q-learning-based dynamic channel assignment technique for mobile communication systems , 1999 .

[49]  Yang Yi,et al.  Reservoir Computing Meets Smart Grids: Attack Detection Using Delayed Feedback Networks , 2018, IEEE Transactions on Industrial Informatics.

[50]  Zhu Han,et al.  Distributed Interference and Energy-Aware Power Control for Ultra-Dense D2D Networks: A Mean Field Game , 2017, IEEE Transactions on Wireless Communications.

[51]  David Tse,et al.  Fundamentals of Wireless Communication , 2005 .

[52]  Qianbin Chen,et al.  Integration of Networking, Caching, and Computing in Wireless Systems: A Survey, Some Research Issues, and Challenges , 2018, IEEE Communications Surveys & Tutorials.

[53]  Carey L. Williamson,et al.  Estimating Instantaneous Cache Hit Ratio Using Markov Chain Analysis , 2013, IEEE/ACM Transactions on Networking.

[54]  A. H. Kayran,et al.  On Feasibility of Interference Alignment in MIMO Interference Networks , 2009, IEEE Transactions on Signal Processing.

[55]  Joseph Kee-Yin Ng,et al.  Cooperative Data Scheduling in Hybrid Vehicular Ad Hoc Networks: VANET as a Software Defined Network , 2016, IEEE/ACM Transactions on Networking.

[56]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[57]  Brian L. Mark,et al.  A quantitative trust establishment framework for reliable data packet delivery in MANETs , 2005, SASN '05.

[58]  Sergey Levine,et al.  Continuous Deep Q-Learning with Model-based Acceleration , 2016, ICML.

[59]  Victor C. M. Leung,et al.  Software-Defined Networks with Mobile Edge Computing and Caching for Smart Cities: A Big Data Deep Reinforcement Learning Approach , 2017, IEEE Communications Magazine.

[60]  F. Richard Yu,et al.  Predictive Control for Energy Efficiency in Wireless Cellular Networks , 2012, 2012 IEEE 75th Vehicular Technology Conference (VTC Spring).

[61]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[62]  Syed Ali Jafar,et al.  Interference Alignment and Degrees of Freedom of the $K$-User Interference Channel , 2008, IEEE Transactions on Information Theory.

[63]  Hongxi Yin,et al.  Multiuser-diversity-based interference alignment in cognitive radio networks , 2016 .

[64]  Andrew W. Senior,et al.  Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.

[65]  Xiaolin Li,et al.  Detection and defense of DDoS attack–based on deep learning in OpenFlow‐based SDN , 2018, Int. J. Commun. Syst..

[66]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[67]  Urs Niesen,et al.  Cache-aided interference channels , 2015, 2015 IEEE International Symposium on Information Theory (ISIT).

[68]  Mérouane Debbah,et al.  From Spectrum Pooling to Space Pooling: Opportunistic Interference Alignment in MIMO Cognitive Networks , 2009, IEEE Transactions on Signal Processing.

[69]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[70]  Qing Wang,et al.  A Survey on Device-to-Device Communication in Cellular Networks , 2013, IEEE Communications Surveys & Tutorials.

[71]  Qianbin Chen,et al.  Computation Offloading and Resource Allocation in Wireless Cellular Networks With Mobile Edge Computing , 2017, IEEE Transactions on Wireless Communications.

[72]  Nan Zhao,et al.  Integrated Networking, Caching, and Computing for Connected Vehicles: A Deep Reinforcement Learning Approach , 2018, IEEE Transactions on Vehicular Technology.

[73]  Xiaofei Wang,et al.  Cache in the air: exploiting content caching and delivery techniques for 5G systems , 2014, IEEE Communications Magazine.

[74]  Changho Suh,et al.  Interference Alignment for Cellular Networks , 2008, 2008 46th Annual Allerton Conference on Communication, Control, and Computing.

[75]  Weihua Zhuang,et al.  Interworking of DSRC and Cellular Network Technologies for V2X Communications: A Survey , 2016, IEEE Transactions on Vehicular Technology.

[76]  R. Sutton Introduction: The challenge of reinforcement learning , 1992, Machine Learning.

[77]  Boleslaw K. Szymanski,et al.  Friendship Based Routing in Delay Tolerant Mobile Social Networks , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[78]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[79]  R. Clarke A statistical theory of mobile-radio reception , 1968 .

[80]  Victor C. M. Leung,et al.  Cross-Layer Design for TCP Performance Improvement in Cognitive Radio Networks , 2010, IEEE Transactions on Vehicular Technology.

[81]  Pan Hui,et al.  BUBBLE Rap: Social-Based Forwarding in Delay-Tolerant Networks , 2008, IEEE Transactions on Mobile Computing.

[82]  Sherali Zeadally,et al.  Vehicular delay-tolerant networks for smart grid data management using mobile edge computing , 2016, IEEE Communications Magazine.

[83]  Matias Richart,et al.  Resource Slicing in Virtual Wireless Networks: A Survey , 2016, IEEE Transactions on Network and Service Management.

[84]  Carl E. Rasmussen,et al.  PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.

[85]  Qichao Xu,et al.  Security-Aware Resource Allocation for Mobile Social Big Data: A Matching-Coalitional Game Solution , 2017, IEEE Transactions on Big Data.

[86]  Peter Stone,et al.  Deep Reinforcement Learning in Parameterized Action Space , 2015, ICLR.

[87]  Antonella Molinaro,et al.  From today's VANETs to tomorrow's planning and the bets for the day after , 2015, Veh. Commun..

[88]  Sergio Barbarossa,et al.  Joint Optimization of Radio and Computational Resources for Multicell Mobile-Edge Computing , 2014, IEEE Transactions on Signal and Information Processing over Networks.

[89]  F. Richard Yu,et al.  Optimal Joint Session Admission Control in Integrated WLAN and CDMA Cellular Networks with Vertical Handoff , 2007, IEEE Transactions on Mobile Computing.

[90]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[91]  Yan Yu,et al.  Power Allocation for Cache-Aided Small-Cell Networks With Limited Backhaul , 2017, IEEE Access.

[92]  K. B. Letaief,et al.  A Survey on Mobile Edge Computing: The Communication Perspective , 2017, IEEE Communications Surveys & Tutorials.

[93]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[94]  Mérouane Debbah,et al.  On the benefits of edge caching for MIMO interference alignment , 2015, 2015 IEEE 16th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[95]  Ioannis Lambadaris,et al.  Trust establishment in cooperative wireless networks , 2010, 2010 - MILCOM 2010 MILITARY COMMUNICATIONS CONFERENCE.

[96]  Deniz Gündüz,et al.  Learning-based optimization of cache content in a small cell base station , 2014, 2014 IEEE International Conference on Communications (ICC).

[97]  Peter Xiaoping Liu,et al.  Distributed Combined Authentication and Intrusion Detection With Data Fusion in High-Security Mobile Ad Hoc Networks , 2010, IEEE Transactions on Vehicular Technology.

[98]  Nikos Fotiou,et al.  A Survey of Information-Centric Networking Research , 2014, IEEE Communications Surveys & Tutorials.

[99]  Hao Yi Ong,et al.  Distributed Deep Q-Learning , 2015, ArXiv.

[100]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[101]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[102]  Xing Zhang,et al.  A Survey on Mobile Edge Networks: Convergence of Computing, Caching and Communications , 2017, IEEE Access.

[103]  Taskin Koçak,et al.  Survey of random neural network applications , 2000, Eur. J. Oper. Res..

[104]  Yuan Fei A NEW METHOD TO SUPPORT UMTS / WLAN VERTICAL HANDOVER USING SCTP , 2022 .

[105]  F. Richard Yu,et al.  Energy-efficient resource allocation in software-defined mobile networks with mobile edge computing and caching , 2017, 2017 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[106]  Stelios Timotheou,et al.  The Random Neural Network: A Survey , 2010, Comput. J..

[107]  Sami Muhaidat,et al.  On the Performance of Imperfect Channel Estimation for Vehicular Ad-Hoc Networks , 2010, 2010 IEEE 72nd Vehicular Technology Conference - Fall.

[108]  Robert W. Heath,et al.  The practical challenges of interference alignment , 2012, IEEE Wireless Communications.

[109]  Haipeng Yao,et al.  Big Data Analytics in Mobile Cellular Networks , 2016, IEEE Access.

[110]  Khaled Ben Letaief,et al.  Delay-optimal computation task scheduling for mobile-edge computing systems , 2016, 2016 IEEE International Symposium on Information Theory (ISIT).

[111]  Gerardo Rubino,et al.  A Tutorial about Random Neural Networks in Supervised Learning , 2016, ArXiv.

[112]  F. Richard Yu,et al.  Distributed Optimal Relay Selection in Wireless Cooperative Networks With Finite-State Markov Channels , 2010, IEEE Transactions on Vehicular Technology.

[113]  R. Michael Buehrer,et al.  Learning distributed caching strategies in small cell networks , 2014, 2014 11th International Symposium on Wireless Communications Systems (ISWCS).

[114]  Ke Zhang,et al.  Energy-Efficient Offloading for Mobile Edge Computing in 5G Heterogeneous Networks , 2016, IEEE Access.

[115]  B. Aazhang,et al.  Cellular networks with an overlaid device to device network , 2008, 2008 42nd Asilomar Conference on Signals, Systems and Computers.

[116]  Haipeng Yao,et al.  A Survey of Mobile Information-Centric Networking: Research Issues and Challenges , 2018, IEEE Communications Surveys & Tutorials.

[117]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[118]  Xiaojiang Du,et al.  Toward Vehicle-Assisted Cloud Computing for Smartphones , 2015, IEEE Transactions on Vehicular Technology.

[119]  Jia Guo,et al.  Trust-Based Service Management for Social Internet of Things Systems , 2016, IEEE Transactions on Dependable and Secure Computing.

[120]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[121]  Wei Wang,et al.  Proactive storage at caching-enable base stations in cellular networks , 2013, 2013 IEEE 24th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC).

[122]  Xianfu Chen,et al.  Energy-Efficiency Oriented Traffic Offloading in Wireless Networks: A Brief Survey and a Learning Approach for Heterogeneous Cellular Networks , 2015, IEEE Journal on Selected Areas in Communications.

[123]  Amir K. Khandani,et al.  Statistical decision making in adaptive modulation and coding for 3G wireless systems , 2005, IEEE Transactions on Vehicular Technology.

[124]  Wha Sook Jeon,et al.  Two-Stage Semi-Distributed Resource Management for Device-to-Device Communication in Cellular Networks , 2014, IEEE Transactions on Wireless Communications.

[125]  F. Richard Yu,et al.  Software-Defined Device-to-Device (D2D) Communications in Virtual Wireless Networks With Imperfect Network State Information (NSI) , 2016, IEEE Transactions on Vehicular Technology.