论文信息 - A Machine Learning Approach to Routing

A Machine Learning Approach to Routing

Can ideas and techniques from machine learning be leveraged to automatically generate "good" routing configurations? We investigate the power of data-driven routing protocols. Our results suggest that applying ideas and techniques from deep reinforcement learning to this context yields high performance, motivating further research along these lines.

[1] Mikkel Thorup,et al. Traffic engineering with traditional IP routing protocols , 2002, IEEE Commun. Mag..

[2] Anne-Marie Kermarrec,et al. The many faces of publish/subscribe , 2003, CSUR.

[3] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[4] Srikanth Kandula,et al. Walking the tightrope: responsive yet stable traffic engineering , 2005, SIGCOMM '05.

[5] Marco Chiesa,et al. Lying Your Way to Better Traffic Engineering , 2016, CoNEXT.

[6] Nikhil R. Devanur,et al. ProjecToR: Agile Reconfigurable Data Center Interconnect , 2016, SIGCOMM.

[7] Yuval Shavitt,et al. Maximum Flow Routing with Weighted Max-Min Fairness , 2004, QofIS.

[8] Paramvir Bahl,et al. Augmenting data center networks with multi-gigabit wireless links , 2011, SIGCOMM.

[9] Michael L. Littman,et al. Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach , 1993, NIPS.

[10] Farhad Shahrokhi,et al. The maximum concurrent flow problem , 1990, JACM.

[11] Himanshu Shah,et al. FireFly , 2014, SIGCOMM.

[12] Hari Balakrishnan,et al. Resilient overlay networks , 2001, SOSP.

[13] Mikkel Thorup,et al. Optimizing OSPF/IS-IS weights in a changing world , 2002, IEEE J. Sel. Areas Commun..

[14] Amin Vahdat,et al. Hedera: Dynamic Flow Scheduling for Data Center Networks , 2010, NSDI.

[15] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[16] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[17] Elman Mansimov,et al. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation , 2017, NIPS.

[18] Matthew Roughan,et al. The Internet Topology Zoo , 2011, IEEE Journal on Selected Areas in Communications.

[19] Paramvir Bahl,et al. Augmenting data center networks with multi-gigabit wireless links , 2011, SIGCOMM 2011.

[20] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.

[21] Ao Tang,et al. HALO: Hop-by-Hop Adaptive Link-State Optimal Routing , 2015, IEEE/ACM Transactions on Networking.

[22] Jürgen Teich,et al. Packet routing in dynamically changing networks on chip , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[23] Ben Y. Zhao,et al. Mirror mirror on the ceiling: flexible wireless links for data centers , 2012, CCRV.

[24] T. Chow,et al. Nonlinear autoregressive integrated neural network model for short-term load forecasting , 1996 .

[25] Kavé Salamatian,et al. Traffic matrix estimation: existing techniques and new directions , 2002, SIGCOMM '02.

[26] Hao Wang,et al. Lube: Mitigating Bottlenecks in Wide Area Data Analytics , 2017, HotCloud.

[27] Robert Soulé,et al. Kulfi: Robust Traffic Engineering Using Semi-Oblivious Routing , 2016, ArXiv.

[28] Mung Chiang,et al. Link-State Routing with Hop-by-Hop Forwarding Can Achieve Optimal Traffic Engineering , 2008, IEEE INFOCOM 2008 - The 27th Conference on Computer Communications.

[29] Wojciech Czarnecki,et al. On Loss Functions for Deep Neural Networks in Classification , 2017, ArXiv.

[30] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[31] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[32] Srikanth Kandula,et al. Resource Management with Deep Reinforcement Learning , 2016, HotNets.

[33] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[34] Hari Balakrishnan,et al. TCP ex machina: computer-generated congestion control , 2013, SIGCOMM.

[35] Vyas Sekar,et al. Unleashing the Potential of Data-Driven Networking , 2017, COMSNETS.

[36] Ameet Talwalkar,et al. Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[37] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[38] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[39] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[40] Albert G. Greenberg,et al. Data center TCP (DCTCP) , 2010, SIGCOMM '10.

[41] Mo Dong,et al. PCC: Re-architecting Congestion Control for Consistent High Performance , 2014, NSDI.

[42] Edith Cohen,et al. Optimal oblivious routing in polynomial time , 2003, STOC '03.

[43] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[44] Albert G. Greenberg,et al. Experience in measuring backbone traffic variability: models, metrics, measurements and meaning , 2002, IMW '02.

[45] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[46] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[47] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[48] Mikkel Thorup,et al. Increasing Internet Capacity Using Local Search , 2004, Comput. Optim. Appl..