The PEPPHER Approach to Programmability and Performance Portability for Heterogeneous many-core Architectures
暂无分享,去创建一个
Andrew Richards | Christoph W. Kessler | Jesper Larsson Träff | Peter Sanders | David Moloney | Siegfried Benkner | Raymond Namyst | Philippas Tsigas | Sabri Pllana | Beverly Bachmayer | P. Sanders | J. Träff | C. Kessler | S. Benkner | S. Pllana | R. Namyst | P. Tsigas | A. Richards | Beverly Bachmayer | D. Moloney
[1] Jesper Larsson Träff,et al. Work-stealing for mixed-mode parallelism by deterministic team-building , 2010, SPAA '11.
[2] Emmanuel Agullo,et al. QR Factorization on a Multicore Node Enhanced with Multiple GPU Accelerators , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.
[3] Siegfried Benkner,et al. Explicit Platform Descriptions for Heterogeneous Many-Core Architectures , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.
[4] Teresa H. Y. Meng,et al. Merge: a programming model for heterogeneous multi-core systems , 2008, ASPLOS.
[5] Cédric Augonnet,et al. StarPU: a unified platform for task scheduling on heterogeneous multicore architectures , 2011, Concurr. Comput. Pract. Exp..
[6] Alan Edelman,et al. PetaBricks: a language and compiler for algorithmic choice , 2009, PLDI '09.
[7] Greg Stitt,et al. Elastic computing: a framework for transparent, portable, and adaptive multi-core heterogeneous computing , 2010, LCTES '10.
[8] Fatos Xhafa,et al. Towards an Intelligent Environment for Programming Multi-core Computing Systems , 2009, Euro-Par Workshops.
[9] Andrew Richards,et al. Offload - Automating Code Migration to Heterogeneous Multicore Systems , 2010, HiPEAC.
[10] Jack J. Dongarra,et al. Dense linear algebra solvers for multicore with GPU accelerators , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).
[11] Vitaly Osipov,et al. GPU sample sort , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).
[12] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[13] Christoph W. Kessler,et al. A Framework for Performance-Aware Composition of Explicitly Parallel Components , 2007, PARCO.
[14] Philippas Tsigas,et al. NB-FEB: A Universal Scalable Easy-to-Use Synchronization Primitive for Manycore Architectures , 2009, OPODIS.
[15] Kunle Olukotun,et al. A domain-specific approach to heterogeneous parallelism , 2011, PPoPP '11.
[16] Yolanda Gil,et al. Self-Configuring Applications for Heterogeneous Systems: Program Composition and Optimization Using Cognitive Techniques , 2008, Proceedings of the IEEE.
[17] Julien Langou,et al. A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures , 2007, Parallel Comput..