Generating Efficient Programs for Two-Level Memories from Tensor-products
暂无分享,去创建一个
[1] Harold S. Stone,et al. Parallel Processing with the Perfect Shuffle , 1971, IEEE Transactions on Computers.
[2] Jeffrey Scott Vitter,et al. Optimal disk I/O with parallel block transfer , 1990, STOC '90.
[3] Alok Aggarwal,et al. The input/output complexity of sorting and related problems , 1988, CACM.
[4] P. Sadayappan,et al. Efficient transposition algorithms for large matrices , 1993, Supercomputing '93.
[5] J. O. Eklundh,et al. A Fast Computer Method for Matrix Transposing , 1972, IEEE Transactions on Computers.
[6] Sandeep Kumar S. Gupta. Synthesizing communication-efficient distributed-memory parallel programs for block recursive algorithms , 1995 .
[7] P. Sadayappan,et al. A Methodology for Generating Efficient Disk-Based Algorithms from Tensor Product Formulas , 1993, LCPC.