Data-Oriented Runtime Scheduling Framework on Multi-GPUs
暂无分享,去创建一个
Tao Li | Qiankun Dong | Yulu Yang | Wenjing Ma | Kezhao Zhao | Jiabing Ling | Tao Li | Yulu Yang | Wenjing Ma | Qiankun Dong | Kezhao Zhao | Jiabing Ling
[1] Enrique S. Quintana-Ortí,et al. Reducing Energy Consumption of Dense Linear Algebra Operations on Hybrid CPU-GPU Platforms , 2012, 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications.
[2] Zhiyang Li,et al. Resource preprocessing and optimal task scheduling in cloud computing environments , 2015, Concurr. Comput. Pract. Exp..
[3] Jack J. Dongarra,et al. Autotuning GEMM Kernels for the Fermi GPU , 2012, IEEE Transactions on Parallel and Distributed Systems.
[4] Andrei Tchernykh,et al. Multiple Workflow Scheduling Strategies with User Run Time Estimates on a Grid , 2012, Journal of Grid Computing.
[5] R. Dolbeau,et al. HMPP TM : A Hybrid Multi-core Parallel Programming Environment , 2022 .
[6] Hamid Arabnejad,et al. List Scheduling Algorithm for Heterogeneous Systems by an Optimistic Cost Table , 2014, IEEE Transactions on Parallel and Distributed Systems.
[7] Eduard Ayguadé,et al. An Extension of the StarSs Programming Model for Platforms with Multiple GPUs , 2009, Euro-Par.
[8] Thomas Hérault,et al. DAGuE: A Generic Distributed DAG Engine for High Performance Computing , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.
[9] Eduard Ayguadé,et al. SSMART: smart scheduling of multi-architecture tasks on heterogeneous systems , 2015, WACCPD '15.
[10] Cédric Augonnet,et al. StarPU: a unified platform for task scheduling on heterogeneous multicore architectures , 2011, Concurr. Comput. Pract. Exp..
[11] Jack J. Dongarra,et al. Analysis of dynamically scheduled tile algorithms for dense linear algebra on multicore architectures , 2011, Concurr. Comput. Pract. Exp..
[12] Enrique S. Quintana-Ortí,et al. Modeling power and energy of the task-parallel Cholesky factorization on multicore processors , 2012, Computer Science - Research and Development.
[13] Tao Li,et al. Communication-aware task scheduling algorithm for heterogeneous computing , 2017, Int. J. High Perform. Comput. Netw..
[14] Jean-François Méhaut,et al. Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-core Architectures , 2014, Euro-Par.
[15] Jack Dongarra,et al. Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects , 2009 .
[16] Bronis R. de Supinski,et al. Heterogeneous Task Scheduling for Accelerated OpenMP , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.
[17] Tao Li,et al. CPU-assisted GPU thread pool model for dynamic task parallelism , 2015, 2015 IEEE International Conference on Networking, Architecture and Storage (NAS).
[18] Kenli Li,et al. A resource-aware scheduling algorithm with reduced task duplication on heterogeneous computing systems , 2014, The Journal of Supercomputing.
[19] Bruno Raffin,et al. XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.
[20] Jack J. Dongarra,et al. Achieving numerical accuracy and high performance using recursive tile LU factorization with partial pivoting , 2014, Concurr. Comput. Pract. Exp..