Identifying Critical Code Sections in Dataflow Programming Models
暂无分享,去创建一个
Mateo Valero | Jesús Labarta | José Carlos Sancho | Vladimir Subotic | M. Valero | J. Labarta | V. Subotic | J. Sancho
[1] Jeroen Tromp,et al. High-frequency simulations of global seismic wave propagation using SPECFEM3D_GLOBE on 62K processors , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.
[2] Jack J. Dongarra,et al. The LINPACK Benchmark: past, present and future , 2003, Concurr. Comput. Pract. Exp..
[3] Jack Dongarra,et al. MPI: The Complete Reference , 1996 .
[4] Thomas E. Anderson,et al. Quartz: a tool for tuning parallel program performance , 1990, SIGMETRICS '90.
[5] Jesús Labarta,et al. Validation of Dimemas Communication Model for MPI Collective Operations , 2000, PVM/MPI.
[6] L. Dagum,et al. OpenMP: an industry standard API for shared-memory programming , 1998 .
[7] G. Amdhal,et al. Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).
[8] Eduard Ayguadé,et al. Overlapping communication and computation by using a hybrid MPI/SMPSs approach , 2010, ICS '10.
[9] P. Hanrahan,et al. Sequoia: Programming the Memory Hierarchy , 2006, ACM/IEEE SC 2006 Conference (SC'06).
[10] Michael A. Laurenzano,et al. High-frequency simulations of global seismic wave propagation using SPECFEM3D_GLOBE on 62K processors , 2008, HiPC 2008.
[11] Alejandro Duran,et al. A Proposal for Task Parallelism in OpenMP , 2007, IWOMP.
[12] Mayank Agarwal,et al. SPARTAN: A software tool for Parallelization Bottleneck Analysis , 2009, 2009 ICSE Workshop on Multicore Software Engineering.
[13] Mateo Valero,et al. Quantifying the Potential Task-Based Dataflow Parallelism in MPI Applications , 2011, Euro-Par.
[14] Susan L. Graham,et al. Gprof: A call graph execution profiler , 1982, SIGPLAN '82.
[15] Jesús Labarta,et al. A dependency-aware task-based programming environment for multi-core architectures , 2008, 2008 IEEE International Conference on Cluster Computing.
[16] Matteo Frigo,et al. The implementation of the Cilk-5 multithreaded language , 1998, PLDI.
[17] Nathan R. Tallent,et al. Analyzing lock contention in multithreaded applications , 2010, PPoPP '10.