论文信息 - Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies

Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies

The recent growth of emergent network applications (e.g., satellite networks, vehicular networks) is increasing the complexity of managing modern communication networks. As a result, the community proposed the Digital Twin Networks (DTN) as a key enabler of efficient network management. Network operators can leverage the DTN to perform different optimization tasks (e.g., Traffic Engineering, Network Planning).Deep Reinforcement Learning (DRL) showed a high performance when applied to solve network optimization problems. In the context of DTN, DRL can be leveraged to solve optimization problems without directly impacting the real-world network behavior. However, DRL scales poorly with the problem size and complexity. In this paper, we explore the use of Evolutionary Strategies (ES) to train DRL agents for solving a routing optimization problem. The experimental results show that ES achieved a training time speed-up of 128 and 6 for the NSFNET and GEANT2 topologies respectively.

[1] Diego Perino,et al. Digital Twin Network: Opportunities and Challenges , 2022, ArXiv.

[2] Albert Cabellos-Aparicio,et al. Is Machine Learning Ready for Traffic Engineering Optimization? , 2021, 2021 IEEE 29th International Conference on Network Protocols (ICNP).

[3] Satyajeet Singh Ahuja,et al. Network planning with deep reinforcement learning , 2021, SIGCOMM.

[4] Lisandro Dalcin,et al. mpi4py: Status Update After 12 Years of Development , 2021, Computing in Science & Engineering.

[5] Jaime Fern'andez del R'io,et al. Array programming with NumPy , 2020, Nature.

[6] Stefano Nolfi,et al. Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control Optimization , 2019, Frontiers in Robotics and AI.

[7] A. Cabellos-Aparicio,et al. Deep reinforcement learning meets graph neural networks: Exploring a routing optimization use case , 2019, Comput. Commun..

[8] Albert Cabellos-Aparicio,et al. Routing in optical transport networks with deep reinforcement learning , 2019, IEEE/OSA Journal of Optical Communications and Networking.

[9] Kagan Tumer,et al. Evolutionary Reinforcement Learning , 2018, NIPS 2018.

[10] Roberto Proietti,et al. Deep-RMSA: A Deep-Reinforcement-Learning Routing, Modulation and Spectrum Assignment Agent for Elastic Optical Networks , 2018, 2018 Optical Fiber Communications Conference and Exposition (OFC).

[11] Matthew W. Hoffman,et al. Distributed Distributional Deterministic Policy Gradients , 2018, ICLR.

[12] Dan Horgan,et al. Distributed Prioritized Experience Replay , 2018, ICLR.

[13] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.

[14] Hongzi Mao,et al. Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[15] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[16] Arjun Chandra,et al. Efficient Parallel Methods for Deep Reinforcement Learning , 2017, ArXiv.

[17] Xi Chen,et al. Evolution Strategies as a Scalable Alternative to Reinforcement Learning , 2017, ArXiv.

[18] Stephen Tyree,et al. Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU , 2016, ICLR.

[19] J. Schulman,et al. OpenAI Gym , 2016, ArXiv.

[20] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[21] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[22] Tiejun J. Xia,et al. How will optical transport deal with future network traffic growth? , 2014, 2014 The European Conference on Optical Communication (ECOC).

[23] Fernando Barreto,et al. Fast emergency paths schema to overcome transient link failures in ospf routing , 2012, ArXiv.

[24] Xavier Glorot,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[25] Jun Zhang,et al. Wavelength converter placement in least-load-routing-based optical networks using genetic algorithms , 2004 .

[26] Angela Chiu,et al. Issues for routing in the optical layer , 2001, IEEE Commun. Mag..

[27] Xiang Shi,et al. ENERO: Efficient Real-Time Routing Optimization , 2021, ArXiv.

[28] NetworkX , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..