Bio-Inspired Call-Stack Reconstruction for Performance Analysis
暂无分享,去创建一个
Juan Gonzalez | Jesús Labarta | Harald Servat | Germán Llort | Judit Giménez | Jesús Labarta | Harald Servat | Germán Llort | Judit Giménez | Juan Gonzalez
[1] Jesús Labarta,et al. Unveiling Internal Evolution of Parallel Application Computation Phases , 2011, 2011 International Conference on Parallel Processing.
[2] Message Passing Interface Forum. MPI: A message - passing interface standard , 1994 .
[3] M S Waterman,et al. Identification of common molecular subsequences. , 1981, Journal of molecular biology.
[4] James R. Larus,et al. Efficient path profiling , 1996, Proceedings of the 29th Annual IEEE/ACM International Symposium on Microarchitecture. MICRO 29.
[5] Amer Diwan,et al. Inferred call path profiling , 2009, OOPSLA 2009.
[6] Arnaldo Carvalho de Melo,et al. The New Linux ’ perf ’ Tools , 2010 .
[7] Toni Cortes,et al. PARAVER: A Tool to Visualize and Analyze Parallel Code , 2007 .
[8] Dirk Schmidl,et al. Score-P: A Unified Performance Measurement System for Petascale Applications , 2010, CHPC.
[9] G. Madec. NEMO ocean engine , 2008 .
[10] C. Notredame,et al. Recent progress in multiple sequence alignment: a survey. , 2002, Pharmacogenomics.
[11] Wolfgang E. Nagel,et al. VAMPIR: Visualization and Analysis of MPI Resources , 2010 .
[12] Nathan Froyd,et al. Low-overhead call path profiling of unmodified, optimized code , 2005, ICS '05.
[13] Nathan R. Tallent,et al. HPCToolkit: performance tools for scientific computing , 2008 .
[14] John Whaley,et al. A portable sampling-based profiler for Java virtual machines , 2000, JAVA '00.
[15] J. Wiley. PRACTICAL EXPERIENCE OF THE LIMITATIONS OF GPROF , 1993 .
[16] Stephen A. Jarvis,et al. Exploiting spatiotemporal locality for fast call stack traversal , 2012 .
[17] Juan Gonzalez,et al. Automatic detection of parallel applications computation phases , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.
[18] Allen D. Malony,et al. The Tau Parallel Performance System , 2006, Int. J. High Perform. Comput. Appl..
[19] Bernd Mohr,et al. Usage of the SCALASCA toolset for scalable performance analysis of large-scale parallel applications , 2008, Parallel Tools Workshop.
[20] Martin Schulz,et al. Reconciling Sampling and Direct Instrumentation for Unintrusive Call-Path Profiling of MPI Programs , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.
[21] Heinz Pitsch,et al. High order conservative finite difference scheme for variable density low Mach number turbulent flows , 2007, J. Comput. Phys..