A Deep Reinforcement Learning-based Routing Scheme with Two Modes for Dynamic Networks

With the development of communication and transmission technologies, more and more applications, like Internet of vehicles and tele-medicine, become more sensitive to network latency and accuracy, which requires routing schemes to be more efficient. In order to meet such urgent need, learning-based routing strategies emerges, with the advantages of high flexibility and accuracy. These strategies can be divided into two categories, centralized and distributed, enjoying the advantages of high precision and high efficiency, respectively. However, routing become more complex in dynamic network, where the link connections and access states are time-varying, so these learning-based routing mechanisms are required to be able to adapt to network changes in real time. In this paper, we designed and implemented both two of centralized and distributed reinforcement learning-based routing schemes (RLR-T). By conducting a series of experiments, we deeply analyzed the results and gave the conclusion that the centralized is better to cope with dynamic networks due to its faster reconvergence, while the distributed is better to handle with large-scale networks by its high scalability.

[1]  Brighten Godfrey,et al.  DRILL: Micro Load Balancing for Low-latency Data Center Networks , 2017, SIGCOMM.

[2]  Wenjie Liu,et al.  Neural Packet Routing , 2020, NetAI@SIGCOMM.

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  Zhiyang Li,et al.  Deep-Reinforcement-Learning-Based QoS-Aware Secure Routing for SDN-IoT , 2020, IEEE Internet of Things Journal.

[5]  Chau Yuen,et al.  Intelligent Task Offloading for Heterogeneous V2X Communications , 2020, IEEE Transactions on Intelligent Transportation Systems.

[6]  Yong Liang Guan,et al.  Communication and Computing Resource Optimization for Connected Autonomous Driving , 2020, IEEE Transactions on Vehicular Technology.

[7]  Ejder Bastug,et al.  Hierarchical Deep Double Q-Routing , 2020, ICC 2020 - 2020 IEEE International Conference on Communications (ICC).

[8]  Song Guo,et al.  Experience-Driven Computational Resource Allocation of Federated Learning by Deep Reinforcement Learning , 2020, 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[9]  Dafna Shahaf,et al.  Learning to Route , 2017, HotNets.

[10]  Ke Xu,et al.  AutoSight: Distributed Edge Caching in Short Video Network , 2020, IEEE Network.

[11]  Ke Zhang,et al.  Deep Learning Empowered Task Offloading for Mobile Edge Computing in Urban Informatics , 2019, IEEE Internet of Things Journal.

[12]  Matthew Roughan,et al.  The Internet Topology Zoo , 2011, IEEE Journal on Selected Areas in Communications.

[13]  Xianfu Chen,et al.  GAN-Based Deep Distributional Reinforcement Learning for Resource Management in Network Slicing , 2019, 2019 IEEE Global Communications Conference (GLOBECOM).

[14]  Ke Zhang,et al.  Deep Reinforcement Learning for Social-Aware Edge Computing and Caching in Urban Informatics , 2020, IEEE Transactions on Industrial Informatics.

[15]  Mohammad S. Obaidat,et al.  Reinforcement Learning-Based Routing Protocol for Opportunistic Networks , 2020, ICC 2020 - 2020 IEEE International Conference on Communications (ICC).

[16]  Nei Kato,et al.  Routing or Computing? The Paradigm Shift Towards Intelligent Computer Network Packet Transmission Based on Deep Learning , 2017, IEEE Transactions on Computers.

[17]  Yan Zhang,et al.  Deep Reinforcement Learning for Stochastic Computation Offloading in Digital Twin Networks , 2020, IEEE Transactions on Industrial Informatics.

[18]  Jingyu Wang,et al.  Toward Greater Intelligence in Route Planning: A Graph-Aware Deep Learning Approach , 2020, IEEE Systems Journal.

[19]  Xin Jin,et al.  Neural packet classification , 2019, SIGCOMM.

[20]  Bo Yang,et al.  Edge Intelligence for Autonomous Driving in 6G Wireless System: Design Challenges and Solutions , 2020, IEEE Wireless Communications.

[21]  Robert Soulé,et al.  Semi-Oblivious Traffic Engineering: The Road Not Taken , 2018, NSDI.

[22]  Yuchao Zhang,et al.  DND: The Controllability of Dynamic Temporal Network in Smart Transportations , 2019, 2019 IEEE Globecom Workshops (GC Wkshps).