SmartCC: A Reinforcement Learning Approach for Multipath TCP Congestion Control in Heterogeneous Networks

The Multipath TCP (MPTCP) protocol has been standardized by the IETF as an extension of conventional TCP, which enables multi-homed devices to establish multiple paths for simultaneous data transmission. Congestion control is a fundamental mechanism for the design and implementation of MPTCP. Due to the diverse QoS characteristics of heterogeneous links, existing multipath congestion control mechanisms suffer from a number of performance problems such as bufferbloat, suboptimal bandwidth usage, etc. In this paper, we propose a learning-based multipath congestion control approach called SmartCC to deal with the diversities of multiple communication path in heterogeneous networks. SmartCC adopts an asynchronous reinforcement learning framework to learn a set of congestion rules, which allows the sender to observe the environment and take actions to adjust the subflows’ congestion windows adaptively to fit different network situations. To deal with the problem of infinite states in high-dimensional space, we propose a hierarchical tile coding algorithm for state aggregation and a function estimation approach for $Q$ -learning, which can derive the optimal policy efficiently. Due to the asynchronous design of SmartCC, the processes of model training and execution are decoupled, and the learning process will not introduce extra delay and overhead on the decision making process in MPTCP congestion control. We conduct extensive experiments for performance evaluation, which show that SmartCC improves the aggregate throughput significantly and outperforms the state-of-the-art mechanisms on a variety of performance metrics.

[1]  Mark Handley,et al.  How Hard Can It Be? Designing and Implementing a Deployable Multipath TCP , 2012, NSDI.

[2]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[3]  Erich M. Nahum,et al.  ECF: An MPTCP Path Scheduler to Manage Heterogeneous Paths , 2017, CoNEXT.

[4]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[5]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[6]  Yichuan Tang,et al.  Deep Learning using Linear Support Vector Machines , 2013, 1306.0239.

[7]  Erich M. Nahum,et al.  A measurement-based study of MultiPath TCP performance over wireless networks , 2013, Internet Measurement Conference.

[8]  Xin Wang,et al.  STMS: Improving MPTCP Throughput Under Heterogeneous Networks , 2018, USENIX Annual Technical Conference.

[9]  Li Li,et al.  A measurement study on multi-path TCP with multiple cellular carriers on high speed rails , 2018, SIGCOMM.

[10]  Mark Handley,et al.  Design, Implementation and Evaluation of Congestion Control for Multipath TCP , 2011, NSDI.

[11]  Steven H. Low,et al.  Multipath TCP algorithms: theory and design , 2013, SIGMETRICS '13.

[12]  Van Jacobson,et al.  Congestion avoidance and control , 1988, SIGCOMM '88.

[13]  Ian Wakeman,et al.  Multipath-TCP in ns-3 , 2015, ArXiv.

[14]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[15]  Mo Dong,et al.  PCC Vivace: Online-Learning Congestion Control , 2018, NSDI.

[16]  Vivek Tiwari,et al.  Lacas: learning automata-based congestion avoidance scheme for healthcare wireless sensor networks , 2009, IEEE Journal on Selected Areas in Communications.

[17]  Hari Balakrishnan,et al.  An experimental study of the learnability of congestion control , 2014, SIGCOMM.

[18]  BongHwan Oh,et al.  Constraint-based proactive scheduling for MPTCP in wireless networks , 2015, Comput. Networks.

[19]  Haiyang Wang,et al.  Multipath TCP for datacenters: From energy efficiency perspective , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[20]  Olivier Bonaventure,et al.  MultiPath TCP: From Theory to Practice , 2011, Networking.

[21]  Miroslav Popovic,et al.  MPTCP Is Not Pareto-Optimal: Performance Issues and a Possible Solution , 2013, IEEE/ACM Trans. Netw..

[22]  Feng Qian,et al.  Understanding On-device Bufferbloat for Cellular Upload , 2016, Internet Measurement Conference.

[23]  Feng Qian,et al.  MP-DASH: Adaptive Video Streaming Over Preference-Aware Multipath , 2016, CoNEXT.

[24]  Pierre Geurts,et al.  A machine learning approach to improve congestion control over wireless computer networks , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[25]  Grigori Sidorov,et al.  Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model , 2014, Computación y Sistemas.

[26]  Donald F. Towsley,et al.  On bufferbloat and delay analysis of multipath TCP in wireless networks , 2014, 2014 IFIP Networking Conference.

[27]  Peter Stone,et al.  Function Approximation via Tile Coding: Automating Parameter Choice , 2005, SARA.

[28]  S. Whiteson,et al.  Adaptive Tile Coding for Value Function Approximation , 2007 .

[29]  Mihaela van der Schaar,et al.  Online Learning Based Congestion Control for Adaptive Multimedia Transmission , 2013, IEEE Transactions on Signal Processing.

[30]  Feng Liu,et al.  AuTO: scaling deep reinforcement learning for datacenter-scale automatic traffic optimization , 2018, SIGCOMM.

[31]  Mark Handley,et al.  Improving datacenter performance and robustness with multipath TCP , 2011, SIGCOMM.

[32]  Van Jacobson,et al.  BBR: Congestion-Based Congestion Control , 2016, ACM Queue.

[33]  Holger Claussen,et al.  MPTCP Meets FEC: Supporting Latency-Sensitive Applications Over Heterogeneous Networks , 2018, IEEE/ACM Transactions on Networking.

[34]  Gwendal Simon,et al.  Cross-layer scheduler for video streaming over MPTCP , 2016, MMSys.

[35]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[36]  Pierre Geurts,et al.  Enhancement of TCP over wired/wireless networks with packet loss classifiers inferred by supervised learning , 2010, Wirel. Networks.

[37]  Özgü Alay,et al.  Tackling the challenge of bufferbloat in Multi-Path Transport over heterogeneous wireless networks , 2014, 2014 IEEE 22nd International Symposium of Quality of Service (IWQoS).