Communication optimal parallel multiplication of sparse random matrices
暂无分享,去创建一个
James Demmel | Sivan Toledo | Laura Grigori | Oded Schwartz | Grey Ballard | Aydin Buluç | Benjamin Lipshitz | J. Demmel | A. Buluç | Sivan Toledo | L. Grigori | Grey Ballard | O. Schwartz | Benjamin Lipshitz
[1] Robert A. van de Geijn,et al. A flexible class of parallel matrix multiplication algorithms , 1998, Proceedings of the First Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing.
[2] Robert A. van de Geijn,et al. SUMMA: Scalable Universal Matrix Multiplication Algorithm , 1995 .
[3] James Demmel,et al. Communication-Optimal Parallel Recursive Rectangular Matrix Multiplication , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.
[4] James Demmel,et al. Communication-optimal parallel algorithm for strassen's matrix multiplication , 2012, SPAA '12.
[5] Alan M. Frieze,et al. Random graphs , 2006, SODA '06.
[6] Lynn Elliot Cannon,et al. A cellular computer to implement the kalman filter algorithm , 1969 .
[7] Martin D. Schatz,et al. Parallel Matrix Multiplication: 2D and 3D FLAME Working Note #62 , 2012 .
[8] Eli Upfal,et al. Space-round tradeoffs for MapReduce computations , 2011, ICS '12.
[9] H. Whitney,et al. An inequality related to the isoperimetric inequality , 1949 .
[10] Fred G. Gustavson,et al. Two Fast Algorithms for Sparse Matrices: Multiplication and Permuted Transposition , 1978, TOMS.
[11] John R. Gilbert,et al. A Unified Framework for Numerical and Combinatorial Computing , 2008, Computing in Science & Engineering.
[12] Larry Rudolph,et al. Techniques for Parallel Manipulation of Sparse Matrices , 1989, Theor. Comput. Sci..
[13] John R. Gilbert,et al. Parallel Sparse Matrix-Matrix Multiplication and Indexing: Implementation and Experiments , 2011, SIAM J. Sci. Comput..
[14] Joost VandeVondele,et al. Linear Scaling Self-Consistent Field Calculations with Millions of Atoms in the Condensed Phase. , 2012, Journal of chemical theory and computation.
[15] Rasmus Pagh,et al. On parallelizing matrix multiplication by the column-row method , 2012, ALENEX.
[16] William L. Briggs,et al. A multigrid tutorial, Second Edition , 2000 .
[17] James Demmel,et al. Brief announcement: strong scaling of matrix multiplication algorithms and memory-independent communication lower bounds , 2012, SPAA '12.
[18] James Demmel,et al. Communication-Optimal Parallel 2.5D Matrix Multiplication and LU Factorization Algorithms , 2011, Euro-Par.
[19] James Demmel,et al. Improving communication performance in dense linear algebra via topology aware collectives , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).
[20] M. Challacombe. A general parallel sparse-blocked matrix multiply for linear scaling SCF theory , 2000 .
[21] Gerald Penn,et al. Efficient transitive closure of sparse matrices over closed semirings , 2006, Theor. Comput. Sci..
[22] James Demmel,et al. Minimizing Communication in Numerical Linear Algebra , 2009, SIAM J. Matrix Anal. Appl..
[23] Eli Upfal,et al. Efficient Algorithms for All-to-All Communications in Multiport Message-Passing Systems , 1997, IEEE Trans. Parallel Distributed Syst..
[24] Robert A. van de Geijn,et al. SUMMA: scalable universal matrix multiplication algorithm , 1995, Concurr. Pract. Exp..
[25] Alexander Tiskin,et al. Memory-Efficient Matrix Multiplication in the BSP Model , 1999, Algorithmica.
[26] James Demmel,et al. Brief announcement: Lower bounds on communication for sparse Cholesky factorization of a model problem , 2010, SPAA '10.
[27] Ramesh C. Agarwal,et al. A three-dimensional approach to parallel matrix multiplication , 1995, IBM J. Res. Dev..
[28] John R. Gilbert,et al. Sparse Matrices in MATLAB: Design and Implementation , 1992, SIAM J. Matrix Anal. Appl..
[29] John R. Gilbert,et al. The Combinatorial BLAS: design, implementation, and applications , 2011, Int. J. High Perform. Comput. Appl..
[30] Jehoshua Bruck,et al. Efficient algorithms for all-to-all communications in multi-port message-passing systems , 1994, SPAA '94.
[31] Stijn van Dongen,et al. Graph Clustering Via a Discrete Uncoupling Process , 2008, SIAM J. Matrix Anal. Appl..
[32] Raphael Yuster,et al. Fast sparse matrix multiplication , 2004, TALG.
[33] John R. Gilbert,et al. Challenges and Advances in Parallel Sparse Matrix-Matrix Multiplication , 2008, 2008 37th International Conference on Parallel Processing.