Computational Science – ICCS 2020: 20th International Conference, Amsterdam, The Netherlands, June 3–5, 2020, Proceedings, Part I

Heterogeneous architectures that use Graphics Processing Units (GPUs) for general computations, in addition to multicore CPUs, are increasingly common in high-performance computing. However many of the existing methods for scheduling precedence-constrained tasks on such platforms were intended for more diversely heterogeneous clusters, such as the classic Heterogeneous Earliest Finish Time (HEFT) heuristic. We propose a new static scheduling heuristic called Heterogeneous Optimistic Finish Time (HOFT) which exploits the binary heterogeneity of accelerated platforms. Through extensive experimentation with custom software for simulating task scheduling problems on user-defined CPU-GPU platforms, we show that HOFT can obtain schedules at least 5% shorter than HEFT’s for medium-to-large numerical linear algebra application task graphs and around 3% shorter on average for a large collection of randomly-generated graphs.

[1]  D. Mulder,et al.  Effect of cardiopulmonary bypass on circulating lymphocyte function. , 1992, The Annals of thoracic surgery.

[2]  Ryan E. Grant,et al.  Enabling HPC Workloads on Cloud Infrastructure Using Kubernetes Container Orchestration Mechanisms , 2019, 2019 IEEE/ACM International Workshop on Containers and New Orchestration Paradigms for Isolated Environments in HPC (CANOPIE-HPC).

[3]  Kai Liu,et al.  Flexible Container-Based Computing Platform on Cloud for Scientific Workflows , 2016, 2016 International Conference on Cloud Computing Research and Innovations (ICCCRI).

[4]  J. Athens,et al.  Blood: leukocytes. , 1963, Annual Review of Physiology.

[5]  Idafen Santana-Perez,et al.  Towards Reproducibility in Scientific Workflows: An Infrastructure-Based Approach , 2015, Sci. Program..

[6]  Daniel Crawl,et al.  Towards an Integrated Cyberinfrastructure for Scalable Data-driven Monitoring, Dynamic Prediction and Resilience of Wildfires , 2015, ICCS.

[7]  V. Dakos,et al.  Detecting dynamical changes in nonlinear time series using locally linear state‐space models , 2012 .

[8]  Ewa Deelman,et al.  Introducing PRECIP: An API for Managing Repeatable Experiments in the Cloud , 2013, 2013 IEEE 5th International Conference on Cloud Computing Technology and Science.

[9]  Rion Dooley,et al.  Endofday: A Container Workflow Engine for Scalable, Reproducible Computation , 2016, IWSG.

[10]  Ian J. Taylor,et al.  Workflows and e-Science: An overview of workflow system features and capabilities , 2009, Future Gener. Comput. Syst..

[11]  Miron Livny,et al.  Pegasus, a workflow management system for science automation , 2015, Future Gener. Comput. Syst..

[12]  Paul Watson,et al.  A framework for scientific workflow reproducibility in the cloud , 2016, 2016 IEEE 12th International Conference on e-Science (e-Science).

[13]  María S. Pérez-Hernández,et al.  Reproducibility of execution environments in computational science using Semantics and Clouds , 2017, Future Gener. Comput. Syst..

[14]  Volkmar Falk,et al.  Humoral Immune Response During Coronary Artery Bypass Grafting: A Comparison of Limited Approach, “Off-Pump” Technique, and Conventional Cardiopulmonary Bypass , 2000, Circulation.

[15]  Shubhabrata Sengupta,et al.  Efficient Parallel Scan Algorithms for GPUs , 2011 .

[16]  Samuel Williams,et al.  Optimization of sparse matrix-vector multiplication on emerging multicore platforms , 2007, Proceedings of the 2007 ACM/IEEE Conference on Supercomputing (SC '07).

[17]  Takayuki Honda,et al.  Neutrophil left shift and white blood cell count as markers of bacterial infection. , 2016, Clinica chimica acta; international journal of clinical chemistry.

[18]  Michal Wrzeszcz,et al.  Onedata - a Step Forward towards Globalization of Data Access for Computing Infrastructures , 2015, ICCS.

[19]  T. Kleinen,et al.  Detection of climate system bifurcations by degenerate fingerprinting , 2004 .

[20]  Alban Gaignard,et al.  Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities , 2017, Future Gener. Comput. Syst..

[21]  Jens Krüger,et al.  Reproducible Scientific Workflows for High Performance and Cloud Computing , 2019, 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[22]  Stanimire Tomov,et al.  FFT-ECP Implementation Optimizations and Features Phase , 2019 .

[23]  Jack Dongarra,et al.  GPUDirect MPI Communications and Optimizations to Accelerate FFTs on Exascale Systems , 2019 .

[24]  Valeria V. Krzhizhanovskaya,et al.  Evolutionary Game Theory Can Explain the Choice Between Apoptotic and Necrotic Pathways in Neutrophils , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[25]  Daniel Crawl,et al.  Modular Resource Centric Learning for Workflow Performance Prediction , 2017, ArXiv.

[26]  Jan Sijbers,et al.  Fast and flexible X-ray tomography using the ASTRA toolbox. , 2016, Optics express.

[27]  Emiliano Mancini,et al.  Game of neutrophils: modeling the balance between apoptosis and necrosis , 2019, BMC Bioinformatics.

[28]  Ladislav Hluchý,et al.  Abstraction Layer for Development and Deployment of Cloud Services , 2012, Comput. Sci..

[29]  Bartosz Balis,et al.  HyperFlow: A model of computation, programming approach and enactment engine for complex distributed workflows , 2016, Future Gener. Comput. Syst..

[30]  Daisuke Takahashi Implementation of Parallel 3-D Real FFT with 2-D Decomposition on Intel Xeon Phi Clusters , 2019, PPAM.

[31]  Seetharami R. Seelam,et al.  Resource Management for Running HPC Applications in Container Clouds , 2016, ISC.

[32]  Daniel Crawl,et al.  A scalable approach for location-specific detection of Santa Ana conditions , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[33]  S. Rankin,et al.  Neutrophil kinetics in health and disease , 2010, Trends in immunology.

[34]  Domenico Cianflone,et al.  Postoperative Arrhythmias after Cardiac Surgery: Incidence, Risk Factors, and Therapeutic Management , 2014, Cardiology research and practice.

[35]  Bartosz Balis,et al.  Transparent Deployment of Scientific Workflows across Clouds - Kubernetes Approach , 2018, 2018 IEEE/ACM International Conference on Utility and Cloud Computing Companion (UCC Companion).

[36]  Khosrow Hashemzadeh,et al.  Postoperative Atrial Fibrillation following Open Cardiac Surgery: Predisposing Factors and Complications. , 2013, Journal of cardiovascular and thoracic research.