Chord: Checkpoint-based scheduling using hybrid waiting list in shared clusters
暂无分享,去创建一个
[1] Calton Pu,et al. Improving Preemptive Scheduling with Application-Transparent Checkpointing in Shared Clusters , 2015, Middleware.
[2] Jie Xu,et al. An Approach for Characterizing Workloads in Google Cloud to Derive Realistic Resource Utilization Models , 2013, 2013 IEEE Seventh International Symposium on Service-Oriented System Engineering.
[4] Xiaomin Zhu,et al. QoS-Aware Fault-Tolerant Scheduling for Real-Time Tasks on Heterogeneous Clusters , 2011, IEEE Transactions on Computers.
[5] Jesús Carretero,et al. Different aspects of workflow scheduling in large-scale distributed systems , 2017, Simul. Model. Pract. Theory.
[6] Tao Ke,et al. Checkpointing Orchestration: Toward a Scalable HPC Fault-Tolerant Environment , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).
[7] Miron Livny,et al. Managing network resources in Condor , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.
[8] Chi-Yi Lin,et al. On Improving Fault Tolerance for Heterogeneous Hadoop MapReduce Clusters , 2013, 2013 International Conference on Cloud Computing and Big Data.
[9] Franck Cappello,et al. Optimization of cloud task processing with checkpoint-restart mechanism , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).
[10] Carlo Curino,et al. Apache Hadoop YARN: yet another resource negotiator , 2013, SoCC.
[11] Samy El-Tawab,et al. Towards Fault-Tolerant Job Assignment in Vehicular Cloud , 2015, 2015 IEEE International Conference on Services Computing.
[12] Christine Morin,et al. Checkpointing as a Service in Heterogeneous Cloud Environments , 2014, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.
[13] Ion Stoica,et al. True elasticity in multi-tenant data-intensive compute clusters , 2012, SoCC '12.
[14] Xiaomin Zhu,et al. Improving the Performance of Data Sharing in Dynamic Peer-to-Peer Mobile Cloud , 2016, 2016 IEEE 22nd International Conference on Parallel and Distributed Systems (ICPADS).
[15] Bo Li,et al. Submitted to Ieee Transactions on Parallel and Distributed Systems 1 on Arbitrating the Power-performance Tradeoff in Saas Clouds , 2022 .
[16] Andy B. Yoo,et al. Approved for Public Release; Further Dissemination Unlimited X-ray Pulse Compression Using Strained Crystals X-ray Pulse Compression Using Strained Crystals , 2002 .
[17] John A. Chandy,et al. Leveraging checkpoint/restore to optimize utilization of cloud compute resources , 2015, 2015 IEEE 40th Local Computer Networks Conference Workshops (LCN Workshops).
[18] Michael Abd-El-Malek,et al. Omega: flexible, scalable schedulers for large compute clusters , 2013, EuroSys '13.
[19] Xiaomin Zhu,et al. Real-Time Fault-Tolerant Scheduling Based on Primary-Backup Approach in Virtualized Clouds , 2013, 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing.
[20] Yuan Yu,et al. Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.
[21] David J. Yates,et al. Towards Fault-Tolerant Energy-Efficient High Performance Computing in the Cloud , 2012, 2012 IEEE International Conference on Cluster Computing.
[22] Randy H. Katz,et al. Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center , 2011, NSDI.
[23] Rami G. Melhem,et al. Shadow Computing: An energy-aware fault tolerant computing model , 2014, 2014 International Conference on Computing, Networking and Communications (ICNC).
[24] Raouf Boutaba,et al. Mitigating the negative impact of preemption on heterogeneous MapReduce workloads , 2011, 2011 7th International Conference on Network and Service Management.