MPI performance engineering with the MPI tool interface: The integration of MVAPICH and TAU
暂无分享,去创建一个
Dhabaleswar K. Panda | Allen D. Malony | Sameer Shende | Hari Subramoni | Amit Ruhela | Srinivasan Ramesh | Aurèle Mahéo
[1] Franck Cappello,et al. Distributed Monitoring and Management of Exascale Systems in the Argo Project , 2015, DAIS.
[2] Sandia Report,et al. Improving Performance via Mini-applications , 2009 .
[3] Edgar Gabriel,et al. A Tool for Optimizing Runtime Parameters of Open MPI , 2008, PVM/MPI.
[4] Bernd Mohr,et al. A Tool Framework for Static and Dynamic Analysis of Object-Oriented Software with Templates , 2000, ACM/IEEE SC 2000 Conference (SC'00).
[5] Raymond Namyst,et al. MPC: A Unified Parallel Runtime for Clusters of NUMA Machines , 2008, Euro-Par.
[6] George Bosilca,et al. Implementation and Usage of the PERUSE-Interface in Open MPI , 2006, PVM/MPI.
[7] Anna Sikora,et al. Autotuning of MPI Applications Using PTF , 2016, SEM4HPC@HPDC.
[8] Robert J. Fowler,et al. An early prototype of an autonomic performance environment for exascale , 2013, ROSS '13.
[9] Matthias S. Müller,et al. The Vampir Performance Analysis Tool-Set , 2008, Parallel Tools Workshop.
[10] Martin Schulz,et al. Exploring the Capabilities of the New MPI_T Interface , 2014, EuroMPI/ASIA.
[11] George Bosilca,et al. Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation , 2004, PVM/MPI.
[12] Allen D. Malony,et al. The Tau Parallel Performance System , 2006, Int. J. High Perform. Comput. Appl..
[13] Thomas Fahringer,et al. Automatic tuning of MPI runtime parameter settings by using machine learning , 2010, CF '10.
[14] Holger Gohlke,et al. The Amber biomolecular simulation programs , 2005, J. Comput. Chem..
[15] Mike Dubman,et al. Scalable Hierarchical Aggregation Protocol (SHArP): A Hardware Architecture for Efficient Data Reduction , 2016, 2016 First International Workshop on Communication Optimizations in HPC (COMHPC).
[16] Patricia J. Teller,et al. MPI Advisor: a Minimal Overhead Tool for MPI Library Performance Tuning , 2015, EuroMPI.
[17] Michael Gerndt,et al. Automatic performance analysis with periscope , 2010 .
[18] Anthony Skjellum,et al. A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..
[19] Dhabaleswar K. Panda,et al. High performance RDMA-based MPI implementation over InfiniBand , 2003, ICS.