AuTO: scaling deep reinforcement learning for datacenter-scale automatic traffic optimization
暂无分享,去创建一个
Feng Liu | Kai Chen | Li Chen | Justinas Lingys | Kai Chen | Li Chen | Feng Liu | Feng Liu | Justinas Lingys
[1] Ali Munir,et al. Minimizing flow completion times in data centers , 2013, 2013 Proceedings IEEE INFOCOM.
[2] Haitao Wu,et al. Enabling ECN in Multi-Service Multi-Queue Data Centers , 2016, NSDI.
[3] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[4] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[5] Hari Balakrishnan,et al. TCP ex machina: computer-generated congestion control , 2013, SIGCOMM.
[6] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[7] Junda Liu,et al. Multi-enterprise networking , 2000 .
[8] Kai Chen,et al. Scheduling Mix-flows in Commodity Datacenters with Karuna , 2016, SIGCOMM.
[9] George Varghese,et al. CONGA: distributed congestion-aware load balancing for datacenters , 2015, SIGCOMM.
[10] Christian E. Hopps,et al. Analysis of an Equal-Cost Multi-Path Algorithm , 2000, RFC.
[11] Randy H. Katz,et al. Wrangler: Predictable and Faster Jobs using Fewer Resources , 2014, SoCC.
[12] Zhiqiang Ma,et al. HadoopWatch: A first step towards comprehensive traffic forecasting in cloud computing , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.
[13] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[14] Albert G. Greenberg,et al. Data center TCP (DCTCP) , 2010, SIGCOMM '10.
[15] Sergey Levine,et al. Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[16] Nick McKeown,et al. pFabric: minimal near-optimal datacenter transport , 2013, SIGCOMM.
[17] Nick McKeown,et al. OpenFlow: enabling innovation in campus networks , 2008, CCRV.
[18] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[19] Ion Stoica,et al. Efficient Coflow Scheduling Without Prior Knowledge , 2015, SIGCOMM.
[20] Haitao Wu,et al. Enabling ECN over Generic Packet Scheduling , 2016, CoNEXT.
[21] Stephen Tyree,et al. Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU , 2016, ICLR.
[22] Ion Stoica,et al. Efficient coflow scheduling with Varys , 2015, SIGCOMM.
[23] Ralph B Dell,et al. Sample size determination. , 2002, ILAR journal.
[24] Michael I. Jordan,et al. Ray: A Distributed Framework for Emerging AI Applications , 2017, OSDI.
[25] David A. Maltz,et al. Network traffic characteristics of data centers in the wild , 2010, IMC '10.
[26] David E. Culler,et al. SEDA: an architecture for well-conditioned, scalable internet services , 2001, SOSP.
[27] Hong Liu,et al. Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network , 2015, Comput. Commun. Rev..
[28] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[29] Yanhui Geng,et al. CODA: Toward Automatically Identifying and Scheduling Coflows in the Dark , 2016, SIGCOMM.
[30] R. Weisberg. A-N-D , 2011 .
[31] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[32] Haitao Wu,et al. Explicit Path Control in Commodity Data Centers: Design and Applications , 2016, IEEE/ACM Transactions on Networking.
[33] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .
[34] Kurt Hornik,et al. Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.
[35] Shalabh Bhatnagar,et al. Incremental Natural Actor-Critic Algorithms , 2007, NIPS.
[36] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[37] Behnaz Arzani,et al. Taking the Blame Game out of Data Centers Operations with NetPoirot , 2016, SIGCOMM.
[38] Albert G. Greenberg,et al. The nature of data center traffic: measurements & analysis , 2009, IMC '09.
[39] Hongzi Mao,et al. Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.
[40] Srikanth Kandula,et al. Resource Management with Deep Reinforcement Learning , 2016, HotNets.
[41] Alex C. Snoeren,et al. Inside the Social Network's (Datacenter) Network , 2015, Comput. Commun. Rev..
[42] Danijar Hafner,et al. Parallel Trust Region Policy Optimization with Multiple Actors , 2016 .
[43] Kai Chen,et al. PowerMan: An Out-of-Band Management Network for Datacenters Using Power Line Communication , 2018, NSDI.
[44] Amin Vahdat,et al. Hedera: Dynamic Flow Scheduling for Data Center Networks , 2010, NSDI.
[45] Albert G. Greenberg,et al. VL2: a scalable and flexible data center network , 2009, SIGCOMM '09.
[46] Wei Bai,et al. Information-Agnostic Flow Scheduling for Commodity Data Centers , 2015, NSDI.
[47] Amin Vahdat,et al. Helios: a hybrid electrical/optical switch architecture for modular data centers , 2010, SIGCOMM '10.
[48] Other Contributors Are Indicated Where They Contribute. Python Software Foundation , 2017 .
[49] Russell V. Lenth,et al. Some Practical Guidelines for Effective Sample Size Determination , 2001 .
[50] Michael L. Littman,et al. Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach , 1993, NIPS.
[51] Brighten Godfrey,et al. Finishing flows quickly with preemptive scheduling , 2012, CCRV.