Reinforcement Learning Based Congestion Control in a Real Environment

Congestion control plays an important role in the Internet to handle real-world network traffic. It has been dominated by hand-crafted heuristics for decades. Recently, reinforcement learning shows great potentials to automatically learn optimal or near-optimal control policies to enhance the performance of congestion control. However, existing solutions train agents in either simulators or emulators, which cannot fully reflect the real-world environment and degrade the performance of network communication. In order to eliminate the performance degradation caused by training in the simulated environment, we first highlight the necessity and challenges to train a learningbased agent in real-world networks. Then we propose a framework, ARC, for learning congestion control policies in a real environment based on asynchronous execution and demonstrate its effectiveness in accelerating the training. We evaluate our scheme on the real testbed and compare it with state-of-the-art congestion control schemes. Experimental results demonstrate that our schemes can achieve higher throughput and lower latency in comparison with existing schemes.

[1]  Van Jacobson,et al.  BBR: Congestion-Based Congestion Control , 2016, ACM Queue.

[2]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[3]  Qian Zhang,et al.  A Compound TCP Approach for High-Speed and Long Distance Networks , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[4]  Xin Wang,et al.  Neural Network Meets DCN: Traffic-driven Topology Adaptation with Deep Learning , 2018, SIGMETRICS.

[5]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[6]  Mo Dong,et al.  PCC: Re-architecting Congestion Control for Consistent High Performance , 2014, NSDI.

[7]  Yong Jiang,et al.  Machine Learning for Internet Congestion Control: Techniques and Challenges , 2019, IEEE Internet Computing.

[8]  Ben Leong,et al.  TCP Congestion Control Beyond Bandwidth-Delay Product for Mobile Cellular Networks , 2017, CoNEXT.

[9]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[10]  Hari Balakrishnan,et al.  Copa: Practical Delay-Based Congestion Control for the Internet , 2018, ANRW.

[11]  Joelle Pineau,et al.  MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions , 2019, ArXiv.

[12]  Mao Yang,et al.  The Case for Learning-and-System Co-design , 2019, ACM SIGOPS Oper. Syst. Rev..

[13]  Mike Hibler,et al.  An integrated experimental environment for distributed systems and networks , 2002, OPSR.

[14]  Hari Balakrishnan,et al.  Mahimahi: Accurate Record-and-Replay for HTTP , 2015, USENIX Annual Technical Conference.

[15]  Dafna Shahaf,et al.  Learning to Route , 2017, HotNets.

[16]  Philip Levis,et al.  Pantheon: the training ground for Internet congestion-control research , 2018, USENIX Annual Technical Conference.

[17]  Yi Li,et al.  L3VTP: A Low-Latency Live Video Transmission Platform , 2019, SIGCOMM Posters and Demos.

[18]  Xin Wang,et al.  Low-Latency Networking: Architecture, Techniques, and Opportunities , 2018, IEEE Internet Computing.

[19]  Yuandong Tian,et al.  Real-world Video Adaptation with Reinforcement Learning , 2019, ArXiv.

[20]  Xin Wang,et al.  STMS: Improving MPTCP Throughput Under Heterogeneous Networks , 2018, USENIX Annual Technical Conference.

[21]  Van Jacobson,et al.  Congestion avoidance and control , 1988, SIGCOMM '88.

[22]  Ulrich Herberg 在事件層平行方法下 NS2 網路模擬器的效能量測 , 2006 .

[23]  Tim Kraska,et al.  Park: An Open Platform for Learning-Augmented Computer Systems , 2019, NeurIPS.

[24]  Brighten Godfrey,et al.  A Deep Reinforcement Learning Perspective on Internet Congestion Control , 2019, ICML.

[25]  Lakshminarayanan Subramanian,et al.  Adaptive Congestion Control for Unpredictable Cellular Networks , 2015, Comput. Commun. Rev..

[26]  Xin Wang,et al.  Machine Learning for Networking: Workflow, Advances and Opportunities , 2017, IEEE Network.

[27]  Johannes Gehrke,et al.  Reinforcement learning for bandwidth estimation and congestion control in real-time communications , 2019, ArXiv.

[28]  Hari Balakrishnan,et al.  TCP ex machina: computer-generated congestion control , 2013, SIGCOMM.

[29]  Vyas Sekar,et al.  Unleashing the Potential of Data-Driven Networking , 2017, COMSNETS.

[30]  Brighten Godfrey,et al.  Internet Congestion Control via Deep Reinforcement Learning , 2018, ArXiv.

[31]  Injong Rhee,et al.  CUBIC: a new TCP-friendly high-speed TCP variant , 2008, OPSR.