Deep-RL: Deep Reinforcement Learning for Marking-Aware via per-Port in Data Centers

In this paper, we propose Deep-RL—a marking decision with Deep Reinforcement Learning (DRL) via per-port for solving the erroneously marking problems in multi-queue multi service scenarios of Data Center Networks (DCNs). We formulate the statement as a DRL problem and use Deep Neural Network (DNN) to achieve the best possible policy for the agent. In this way we can model the complex DCNs in order to obtain the optimal threshold in the output port when the marked packets from queue buffers are not a concern. Unlike prior research that focused on mathematical models or used machine learning in DCN, Deep-RL is a novel DRL based method, which optimizes the real value of threshold with continues action space. Thus, this fact makes our work incomparable with previous research. To the best of our knowledge, we are the first to discuss the problem with DRL and DNN. Simulation results demonstrate that Deep-RL utilizes the buffer capacity at exactly 30% and achieves near optimal flow completion time.

[1]  Amin Vahdat,et al.  Helios: a hybrid electrical/optical switch architecture for modular data centers , 2010, SIGCOMM '10.

[2]  Hamid Mirvaziri,et al.  BDCC: Backpressure routing and dynamic prioritization for congestion control in WMSNs , 2014 .

[3]  Keqin Li,et al.  DemePro: DEcouple packet Marking from Enqueuing for multiple services with PROactive congestion control , 2017 .

[4]  Albert G. Greenberg,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM '10.

[5]  Guihai Chen,et al.  Adaptive Routing Reconfigurations to Minimize Flow Cost in SDN-Based Data Center Networks , 2019, ICPP.

[6]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[7]  Fengyuan Ren,et al.  Improving ECN marking scheme with micro-burst traffic in data center networks , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[8]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[9]  Feng Liu,et al.  AuTO: scaling deep reinforcement learning for datacenter-scale automatic traffic optimization , 2018, SIGCOMM.

[10]  Haitao Wu,et al.  Enabling ECN in Multi-Service Multi-Queue Data Centers , 2016, NSDI.

[11]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[12]  Shalabh Bhatnagar,et al.  Incremental Natural Actor-Critic Algorithms , 2007, NIPS.

[13]  Guihai Chen,et al.  Support ECN in Multi-Queue Datacenter Networks via Per-Port Marking with Selective Blindness , 2018, 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS).