Learning scheduling algorithms for data processing clusters
暂无分享,去创建一个
Hongzi Mao | Malte Schwarzkopf | Mohammad Alizadeh | Shaileshh Bojja Venkatakrishnan | Zili Meng | Malte Schwarzkopf | Hongzi Mao | M. Alizadeh | Zili Meng | S. Venkatakrishnan | Mohammad Alizadeh
[1] James E. Kelley,et al. Critical-path planning and scheduling , 1899, IRE-AIEE-ACM '59 (Eastern).
[2] R. Bellman. Dynamic Programming , 1957, Science.
[3] W. K. Hastings,et al. Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .
[4] Bruce M. Maggs,et al. Universal packet routing algorithms , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.
[5] David B. Shmoys,et al. Improved approximation algorithms for shop scheduling problems , 1991, SODA '91.
[6] Chandrasekharan Rajendran,et al. A No-Wait Flowshop Scheduling Heuristic to Minimize Makespan , 1994 .
[7] Building on intuition , 1994 .
[8] John N. Tsitsiklis,et al. Neuro-dynamic programming: an overview , 1995, Proceedings of 1995 34th IEEE Conference on Decision and Control.
[9] Martin T. Hagan,et al. Neural network design , 1995 .
[10] C. Leiserson,et al. Scheduling multithreaded computations by work stealing , 1999, Proceedings 35th Annual Symposium on Foundations of Computer Science.
[11] Michael I. Jordan,et al. PEGASUS: A policy search method for large MDPs and POMDPs , 2000, UAI.
[12] Lex Weaver,et al. The Optimal Reward Baseline for Gradient-Based Reinforcement Learning , 2001, UAI.
[13] Peter L. Bartlett,et al. Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning , 2001, J. Mach. Learn. Res..
[14] Ashish Goel,et al. Multi-processor scheduling to minimize flow time with ε resource augmentation , 2004, STOC '04.
[15] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[16] Shie Mannor,et al. Basis Function Adaptation in Temporal Difference Reinforcement Learning , 2005, Ann. Oper. Res..
[17] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[18] Peter Geibel,et al. Reinforcement Learning for MDPs with Constraints , 2006, ECML.
[19] Nasser M. Nasrabadi,et al. Pattern Recognition and Machine Learning , 2006, Technometrics.
[20] Yuan Yu,et al. Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.
[21] Mark D. Hill,et al. Amdahl's Law in the Multicore Era , 2008, Computer.
[22] Ola Svensson,et al. (Acyclic) Job Shops are Hard to Approximate , 2008, 2008 49th Annual IEEE Symposium on Foundations of Computer Science.
[23] Timothy J. Lowe,et al. Building intuition insights from basic operations management models and principles , 2008 .
[24] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[25] Luiz André Barroso,et al. The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.
[26] Andrew V. Goldberg,et al. Quincy: fair scheduling for distributed computing clusters , 2009, SOSP '09.
[27] Craig Chambers,et al. FlumeJava: easy, efficient data-parallel pipelines , 2010, PLDI '10.
[28] Scott Shenker,et al. Spark: Cluster Computing with Working Sets , 2010, HotCloud.
[29] Benjamin Hindman,et al. Dominant Resource Fairness: Fair Allocation of Multiple Resource Types , 2011, NSDI.
[30] Randy H. Katz,et al. Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center , 2011, NSDI.
[31] JIT compilation policy for modern machines , 2011, OOPSLA.
[32] Srikanth Kandula,et al. Jockey: guaranteed job latency in data parallel clusters , 2012, EuroSys '12.
[33] Michael J. Franklin,et al. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.
[34] Srikanth Kandula,et al. Reoptimizing Data Parallel Computing , 2012, NSDI.
[35] Michael Abd-El-Malek,et al. Omega: flexible, scalable schedulers for large compute clusters , 2013, EuroSys '13.
[36] Luiz André Barroso,et al. The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition , 2013, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition.
[37] Carlo Curino,et al. Apache Hadoop YARN: yet another resource negotiator , 2013, SoCC.
[38] Christina Delimitrou,et al. Paragon: QoS-aware scheduling for heterogeneous datacenters , 2013, ASPLOS '13.
[39] David E. Culler,et al. Hierarchical scheduling for diverse datacenter workloads , 2013, SoCC.
[40] Scott Shenker,et al. Choosy: max-min fair sharing for datacenter jobs with constraints , 2013, EuroSys '13.
[41] Christina Delimitrou,et al. Quasar: resource-efficient and QoS-aware cluster management , 2014, ASPLOS.
[42] Srikanth Kandula,et al. Multi-resource packing for cluster schedulers , 2014, SIGCOMM.
[43] Abhishek Verma,et al. Evaluating job packing in warehouse-scale computing , 2014, 2014 IEEE International Conference on Cluster Computing (CLUSTER).
[44] Abhishek Verma,et al. Large-scale cluster management at Google with Borg , 2015, EuroSys.
[45] Scott Shenker,et al. Making Sense of Performance in Data Analytics Frameworks , 2015, NSDI.
[46] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[47] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[48] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[49] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[50] Srikanth Kandula,et al. Resource Management with Deep Reinforcement Learning , 2016, HotNets.
[51] Benjamin Moseley,et al. Scheduling Parallel DAG Jobs Online to Minimize Average Flow Time , 2016, SODA.
[52] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[53] Srikanth Kandula,et al. Efficient queue management for cluster scheduling , 2016, EuroSys.
[54] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[55] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[56] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[57] Srikanth Kandula,et al. This Paper Is Included in the Proceedings of the 12th Usenix Symposium on Operating Systems Design and Implementation (osdi '16). Graphene: Packing and Dependency-aware Scheduling for Data-parallel Clusters G: Packing and Dependency-aware Scheduling for Data-parallel Clusters , 2022 .
[58] Mor Harchol-Balter,et al. TetriSched: global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters , 2016, EuroSys.
[59] Aditya Akella,et al. Altruistic Scheduling in Multi-Resource Clusters , 2016, OSDI.
[60] Glen Berseth,et al. Terrain-adaptive locomotion skills using deep reinforcement learning , 2016, ACM Trans. Graph..
[61] Xavier Bresson,et al. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.
[62] Robert N. M. Watson,et al. Firmament: Fast, Centralized Cluster Scheduling at Scale , 2016, OSDI.
[63] Hongzi Mao,et al. Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.
[64] Max Welling,et al. Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.
[65] Mykel J. Kochenderfer,et al. Cooperative Multi-agent Control Using Deep Reinforcement Learning , 2017, AAMAS Workshops.
[66] Pieter Abbeel,et al. Constrained Policy Optimization , 2017, ICML.
[67] Abhinav Gupta,et al. Robust Adversarial Reinforcement Learning , 2017, ICML.
[68] Samy Bengio,et al. Device Placement Optimization with Reinforcement Learning , 2017, ICML.
[69] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[70] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[71] Kejiang Ye,et al. Imbalance in the cloud: An analysis on Alibaba cluster trace , 2017, 2017 IEEE International Conference on Big Data (Big Data).
[72] Le Song,et al. 2 Common Formulation for Greedy Algorithms on Graphs , 2018 .
[73] Razvan Pascanu,et al. Relational inductive biases, deep learning, and graph networks , 2018, ArXiv.
[74] Quoc V. Le,et al. A Hierarchical Model for Device Placement , 2018, ICLR.
[75] Hongzi Mao,et al. Placeto: Efficient Progressive Device Placement Optimization , 2018 .
[76] Tamim Asfour,et al. Model-Based Reinforcement Learning via Meta-Policy Optimization , 2018, CoRL.
[77] Saeid Nahavandi,et al. Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications , 2018, ArXiv.
[78] Ion Stoica,et al. Learning to Optimize Join Queries With Deep Reinforcement Learning , 2018, ArXiv.
[79] Zhuwen Li,et al. Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search , 2018, NeurIPS.
[80] Hongzi Mao,et al. Placeto: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning , 2019, NeurIPS.
[81] Tim Kraska,et al. Neo: A Learned Query Optimizer , 2019, Proc. VLDB Endow..
[82] Hongzi Mao,et al. Variance Reduction for Reinforcement Learning in Input-Driven Environments , 2018, ICLR.
[83] Saeid Nahavandi,et al. Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications , 2018, IEEE Transactions on Cybernetics.
[84] A. Gupta,et al. SCALING MULTI-AGENT REINFORCEMENT LEARNING , 2020 .
[85] Yuandong Tian,et al. Real-world Video Adaptation with Reinforcement Learning , 2019, ArXiv.