MDR: performance model driven runtime for heterogeneous parallel platforms
暂无分享,去创建一个
Anand Raghunathan | Srimat T. Chakradhar | Jacques A. Pienaar | S. Chakradhar | A. Raghunathan | J. Pienaar
[1] John E. Stone,et al. OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems , 2010, Computing in Science & Engineering.
[2] Teresa H. Y. Meng,et al. Merge: a programming model for heterogeneous multi-core systems , 2008, ASPLOS.
[3] Rajkumar Buyya,et al. A taxonomy of scientific workflow systems for grid computing , 2005, SGMD.
[4] Jeffrey S. Vetter,et al. Maestro: Data Orchestration and Tuning for OpenCL Devices , 2010, Euro-Par.
[5] Kevin Skadron,et al. Scalable parallel programming , 2008, 2008 IEEE Hot Chips 20 Symposium (HCS).
[6] David Tarditi,et al. Accelerator: using data parallelism to program GPUs for general-purpose uses , 2006, ASPLOS XII.
[7] Cédric Augonnet,et al. StarPU: a unified platform for task scheduling on heterogeneous multicore architectures , 2011, Concurr. Comput. Pract. Exp..
[8] Jeff S. Brantley,et al. Contention-Aware Scheduling of Parallel Code for Heterogeneous Systems , 2010 .
[9] Matteo Frigo,et al. The implementation of the Cilk-5 multithreaded language , 1998, PLDI.
[10] Yuan Yu,et al. Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.
[11] Gregory Diamos,et al. Harmony: an execution model and runtime for heterogeneous many core systems , 2008, HPDC '08.
[12] FrigoMatteo,et al. The implementation of the Cilk-5 multithreaded language , 1998 .
[13] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[14] John E. Stone,et al. An asymmetric distributed shared memory model for heterogeneous parallel systems , 2010, ASPLOS XV.
[15] Surendra Byna,et al. Data-aware scheduling of legacy kernels on heterogeneous platforms with distributed memory , 2010, SPAA '10.
[16] Mark D. Hill,et al. Amdahl's Law in the Multicore Era , 2008, Computer.
[17] Hyesoon Kim,et al. Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).
[18] Ishfaq Ahmad,et al. Dynamic Critical-Path Scheduling: An Effective Technique for Allocating Task Graphs to Multiprocessors , 1996, IEEE Trans. Parallel Distributed Syst..
[19] Anand Raghunathan,et al. A framework for efficient and scalable execution of domain-specific templates on GPUs , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.
[20] Grigori Fursin,et al. Predictive Runtime Code Scheduling for Heterogeneous Architectures , 2008, HiPEAC.
[21] Kevin Skadron,et al. Rodinia: A benchmark suite for heterogeneous computing , 2009, 2009 IEEE International Symposium on Workload Characterization (IISWC).
[22] Wenguang Chen,et al. MapCG: Writing parallel program portable between CPU and GPU , 2010, 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT).
[23] Surendra Byna,et al. Best-effort semantic document search on GPUs , 2010, GPGPU-3.
[24] Arch D. Robison,et al. Intel® Threading Building Blocks (TBB) , 2011, Encyclopedia of Parallel Computing.
[25] Jesús Labarta,et al. CellSs: Scheduling techniques to better exploit memory hierarchy , 2009, Sci. Program..