Effective Implementation of DGEMM on Modern Multicore CPU
暂无分享,去创建一个
[1] Rupak Biswas,et al. Scientific application-based performance comparison of SGI Altix 4700, IBM POWER5+, and SGI ICE 8200 supercomputers , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.
[2] V. Strassen. Gaussian elimination is not optimal , 1969 .
[3] Pawel Gepner,et al. Evaluation of Executing DGEMM Algorithms on Modern Multicore CPU , 2011 .
[4] Jack J. Dongarra,et al. A set of level 3 basic linear algebra subprograms , 1990, TOMS.
[5] Pawel Gepner,et al. Parallel application benchmarks and performance evaluation of the Intel Xeon 7500 family processors , 2011, ICCS.
[6] Pawel Gepner,et al. Early performance evaluation of AVX for HPC , 2011, ICCS.
[7] Robert A. van de Geijn,et al. Anatomy of high-performance matrix multiplication , 2008, TOMS.
[8] Victor Eijkhout,et al. Self-Adapting Linear Algebra Algorithms and Software , 2005, Proceedings of the IEEE.
[9] D. Holdstock. Past, present--and future? , 2005, Medicine, conflict, and survival.