StarPU: a unified platform for task scheduling on heterogeneous multicore architectures
暂无分享,去创建一个
[1] Alejandro Duran,et al. A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures , 2009, IWOMP.
[2] Julien Langou,et al. A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures , 2007, Parallel Comput..
[3] Cédric Augonnet,et al. Exploiting the Cell/BE Architecture with the StarPU Unified Runtime System , 2009, SAMOS.
[4] Rosa M. Badia,et al. CellSs: a Programming Model for the Cell BE Architecture , 2006, ACM/IEEE SC 2006 Conference (SC'06).
[5] Salim Hariri,et al. Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing , 2002, IEEE Trans. Parallel Distributed Syst..
[6] Eduard Ayguadé,et al. Hierarchical Task-Based Programming With StarSs , 2009, Int. J. High Perform. Comput. Appl..
[7] Metin Nafi Gürcan,et al. Coordinating the use of GPU and CPU for improving performance of compute intensive applications , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.
[8] Cédric Augonnet,et al. Mapping and Synchronizing Streaming Applications on Cell Processors , 2008, HiPEAC.
[9] Jesús Labarta,et al. Exploiting Locality on the Cell/B.E. through Bypassing , 2009, SAMOS.
[10] Rafael Mayo,et al. Solving Dense Linear Systems on Graphics Processors , 2008, Euro-Par.
[11] Jack J. Dongarra,et al. Towards dense linear algebra for hybrid GPU accelerated manycore systems , 2009, Parallel Comput..
[12] Eduard Ayguadé,et al. An Extension of the StarSs Programming Model for Platforms with Multiple GPUs , 2009, Euro-Par.
[13] Pascal Hénon,et al. PaStiX: A Parallel Sparse Direct Solver Based on a Static Scheduling for Mixed 1D/2D Block Distributions , 2000, IPDPS Workshops.
[14] Grigori Fursin,et al. Predictive Runtime Code Scheduling for Heterogeneous Architectures , 2008, HiPEAC.
[15] Cédric Augonnet,et al. A Unified Runtime System for Heterogeneous Multi-core Architectures , 2009, Euro-Par Workshops.
[16] Alejandro Duran,et al. Extending the OpenMP Tasking Model to Allow Dependent Tasks , 2008, IWOMP.
[17] James Demmel,et al. Benchmarking GPUs to tune dense linear algebra , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.
[18] R. Dolbeau,et al. HMPP TM : A Hybrid Multi-core Parallel Programming Environment , 2022 .
[19] Naga K. Govindaraju,et al. A Survey of General‐Purpose Computation on Graphics Hardware , 2007 .
[20] Michael Kistler,et al. Accelerating computing with the cell broadband engine processor , 2008, Conf. Computing Frontiers.
[21] Jack J. Dongarra,et al. Automatically Tuned Linear Algebra Software , 1998, Proceedings of the IEEE/ACM SC98 Conference.
[22] Toshio Nakatani,et al. MPI microtask for programming the Cell Broadband EngineTM processor , 2006, IBM Syst. J..
[23] Gregory Diamos,et al. Harmony: an execution model and runtime for heterogeneous many core systems , 2008, HPDC '08.
[24] Larry Carter,et al. Scheduling strategies for master-slave tasking on heterogeneous processor platforms , 2004, IEEE Transactions on Parallel and Distributed Systems.
[25] Gregory Diamos,et al. An Execution Model and Runtime for Heterogeneous Many Core Systems , 2011 .
[26] Cédric Augonnet,et al. Automatic Calibration of Performance Models on Heterogeneous Multicore Architectures , 2009, Euro-Par Workshops.
[27] Laxmikant V. Kalé,et al. Charm++ simplifies coding for the cell processor , 2006, SC.