Learning-Based Proactive Resource Allocation for Delay-Sensitive Packet Transmission

In this article, a learning-based proactive resource sharing scheme is proposed for the next-generation core communication networks, where the available forwarding resources at a switch are proactively allocated to the traffic flows in order to maximize the efficiency of resource utilization with delay satisfaction. The resource sharing scheme consists of two joint modules, estimation of resource demands and allocation of available resources. For service provisioning, resource demand of each traffic flow is estimated based on the predicted packet arrival rate. Considering the distinct features of each traffic flow, a linear regression algorithm is developed for resource demand estimation, utilizing the mapping relation between traffic flow status and required resources, upon which a network switch makes decision on allocating available resources for delay satisfaction and efficient resource utilization. To learn the implicit relation between the allocated resources and delay, a multi-armed bandit learning-based resource allocation scheme is proposed, which enables fast resource allocation adjustment to traffic arrival dynamics. The proposed algorithm is proved to be asymptotically approaching the optimal strategy, with polynomial time complexity. Extensive simulation results are presented to demonstrate the effectiveness of the proposed resource sharing scheme in terms of delay satisfaction, traffic adaptiveness, and resource allocation gain.

[1]  Michael Menth,et al.  Deficit Round Robin with Limited Deficit Savings (DRR-LDS) for Fairness Among TCP Users , 2018, MMB.

[2]  Peter Auer,et al.  Using Confidence Bounds for Exploitation-Exploration Trade-offs , 2003, J. Mach. Learn. Res..

[3]  Zheng Wang,et al.  An Architecture for Differentiated Services , 1998, RFC.

[4]  Sunil Kumar,et al.  Intelligent Software-Defined Mesh Networks With Link-Failure Adaptive Traffic Balancing , 2018, IEEE Transactions on Cognitive Communications and Networking.

[5]  Parameswaran Ramanathan,et al.  Proportional differentiated services: delay differentiation and packet scheduling , 1999, SIGCOMM '99.

[6]  Li Zhou,et al.  A Survey on Contextual Multi-armed Bandits , 2015, ArXiv.

[7]  Daniel P. Palomar,et al.  A Signal Processing Perspective of Financial Engineering , 2016, Found. Trends Signal Process..

[8]  Ning Zhang,et al.  Content Popularity Prediction Towards Location-Aware Mobile Edge Caching , 2018, IEEE Transactions on Multimedia.

[9]  Mo Dong,et al.  PCC: Re-architecting Congestion Control for Consistent High Performance , 2014, NSDI.

[10]  Xin Yan,et al.  Linear Regression Analysis: Theory and Computing , 2009 .

[11]  Gaurav Raina,et al.  Stability and Performance of Compound TCP With a Proportional Integral Queue Policy , 2019, IEEE Transactions on Control Systems Technology.

[12]  Van Jacobson,et al.  BBR: Congestion-Based Congestion Control , 2016, ACM Queue.

[13]  Weihua Zhuang,et al.  Robust Online Composition, Routing and NF Placement for NFV-Enabled Services , 2020, IEEE Journal on Selected Areas in Communications.

[14]  Weihua Zhuang,et al.  A Virtual Network Customization Framework for Multicast Services in NFV-Enabled Core Networks , 2020, IEEE Journal on Selected Areas in Communications.

[15]  Mehryar Mohri,et al.  Multi-armed Bandit Algorithms and Empirical Evaluation , 2005, ECML.

[16]  George Varghese,et al.  Efficient fair queueing using deficit round-robin , 1996, TNET.

[17]  Stefano Salsano,et al.  Joint Energy Efficient and QoS-Aware Path Allocation and VNF Placement for Service Function Chaining , 2017, IEEE Transactions on Network and Service Management.

[18]  Nick McKeown,et al.  PISCES: A Programmable, Protocol-Independent Software Switch , 2016, SIGCOMM.

[19]  Luca De Cicco,et al.  Controlling queuing delays for real-time communication: the interplay of E2E and AQM algorithms , 2018, CCRV.

[20]  Jian Tang,et al.  How Would you Like Your Packets Delivered? An SDN-Enabled Open Platform for QoS Routing , 2018, 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS).

[21]  Weihua Zhuang,et al.  End-to-End Quality of Service in 5G Networks: Examining the Effectiveness of a Network Slicing Framework , 2018, IEEE Vehicular Technology Magazine.

[22]  Scott Shenker,et al.  Analysis and simulation of a fair queueing algorithm , 1989, SIGCOMM '89.

[23]  Weihua Zhuang,et al.  End-to-End Delay Modeling for Embedded VNF Chains in 5G Core Networks , 2019, IEEE Internet of Things Journal.

[24]  Mohammad S. Obaidat,et al.  Resource-Optimized Multiarmed Bandit-Based Offload Path Selection in Edge UAV Swarms , 2019, IEEE Internet of Things Journal.

[25]  Andrea Zanella,et al.  QoE Multi-Stage Machine Learning for Dynamic Video Streaming , 2018, IEEE Transactions on Cognitive Communications and Networking.

[26]  Wolfgang Kellerer,et al.  DetServ: Network Models for Real-Time QoS Provisioning in SDN-Based Industrial Environments , 2017, IEEE Transactions on Network and Service Management.

[27]  Meng Chang Chen,et al.  Proportional delay differentiation service based on weighted fair queuing , 2000, Proceedings Ninth International Conference on Computer Communications and Networks (Cat.No.00EX440).

[28]  Peilin Hong,et al.  Measurement and Redesign of BBR-based MPTCP , 2019, SIGCOMM Posters and Demos.

[29]  Xuemin Shen,et al.  Delay-Aware VNF Scheduling: A Reinforcement Learning Approach With Variable Action Set , 2021, IEEE Transactions on Cognitive Communications and Networking.

[30]  Wei Chu,et al.  Contextual Bandits with Linear Payoff Functions , 2011, AISTATS.

[31]  Kun Chen,et al.  Timely-Throughput Optimal Scheduling with Prediction , 2017, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[32]  Peter L. Bartlett,et al.  Adaptive Online Gradient Descent , 2007, NIPS.

[33]  Xin Su,et al.  A Reinforcement Learning Approach for Multipath TCP Data Scheduling , 2019, 2019 IEEE 9th Annual Computing and Communication Workshop and Conference (CCWC).

[34]  Shie Mannor,et al.  Multi-User Communication Networks: A Coordinated Multi-Armed Bandit Approach , 2018, IEEE/ACM Transactions on Networking.

[35]  David L. Black,et al.  The Addition of Explicit Congestion Notification (ECN) to IP , 2001, RFC.

[36]  Yonghui Song,et al.  A New Deep-Q-Learning-Based Transmission Scheduling Mechanism for the Cognitive Internet of Things , 2018, IEEE Internet of Things Journal.

[37]  Weihua Zhuang,et al.  SDATP: An SDN-Based Traffic-Adaptive and Service-Oriented Transmission Protocol , 2020, IEEE Transactions on Cognitive Communications and Networking.

[38]  Wei Chu,et al.  A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.

[39]  Sijing Zhang,et al.  Towards 5G: A Reinforcement Learning-Based Scheduling Solution for Data Traffic Management , 2018, IEEE Transactions on Network and Service Management.

[40]  Biswanath Mukherjee,et al.  Scheduling with machine-learning-based flow detection for packet-switched optical data center networks , 2018, IEEE/OSA Journal of Optical Communications and Networking.

[41]  Ning Zhang,et al.  Fast mmwave Beam Alignment via Correlated Bandit Learning , 2019, IEEE Transactions on Wireless Communications.