Performance Analysis and Optimization of Parallel Scientific Applications on CMP Clusters
暂无分享,去创建一个
[1] F. Petrini,et al. The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8,192 Processors of ASCI Q , 2003, ACM/IEEE SC 2003 Conference (SC'03).
[2] Xingfu Wu,et al. Processor partitioning: an experimental performance analysis of parallel applications on SMP cluster systems , 2007 .
[3] V. Taylor,et al. DESIGN AND IMPLEMENTATION OF PROPHESY AUTOMATIC INSTRUMENTATION AND DATA ENTRY SYSTEM , 2001 .
[4] Xingfu Wu,et al. Using kernel couplings to predict parallel application performance , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.
[5] Rajeev Thakur,et al. Optimization of Collective Communication Operations in MPICH , 2005, Int. J. High Perform. Comput. Appl..
[6] Mark R. Fahey,et al. GYRO: A 5-D Gyrokinetic-Maxwell Solver , 2004, Proceedings of the ACM/IEEE SC2004 Conference.
[7] Xingfu Wu,et al. Performance Analysis, Modeling and Prediction of a Parallel Multiblock Lattice Boltzmann Application Using Prophesy System , 2006, 2006 IEEE International Conference on Cluster Computing.
[8] Laxmikant V. Kalé,et al. NAMD: Biomolecular Simulation on Thousands of Processors , 2002, ACM/IEEE SC 2002 Conference (SC'02).
[9] Xingfu Wu,et al. Prophesy: an infrastructure for performance analysis and modeling of parallel and grid applications , 2003, PERV.