Hypergraph Models for Sparse Matrix Partitioning and Reordering

HYPERGRAPH MODELS FOR SPARSE MATRIX PARTITIONING AND REORDERING  Umit V. C ataly urek Ph.D. in Computer Engineering and Information Science Supervisor: Assoc. Prof. Cevdet Aykanat November, 1999 Graphs have been widely used to represent sparse matrices for various scienti c applications including one-dimensional (1D) decomposition of sparse matrices for parallel sparse-matrix vector multiplication (SpMxV) and sparse matrix reordering for low ll factorization. The standard graph-partitioning based 1D decomposition of sparse matrices does not re ect the actual communication volume requirement for parallel SpMxV.We propose two computational hypergraph models which avoid this crucial de ciency of the graph model on 1D decomposition. The proposed models reduce the 1D decomposition problem to the well-known hypergraph partitioning problem. In the literature, there is a lack of 2D decomposition heuristic which directly minimizes the communication requirements for parallel SpMxV computations. Three novel hypergraph models are introduced for 2D decomposition of sparse matrices for minimizing the communication volume requirement. The rst hypergraph model is proposed for ne-grain 2D decomposition of the sparse matrices for parallel SpMxV. The second hypergraph model for 2D decomposition is proposed to produce jagged-like decomposition of the sparse matrix. The checkerboard decomposition based parallel matrix-vector multiplication algorithms are widely encountered in the literature. However, only the load balancing problem is addressed in those works. Here, we propose a new hypergraph model which aims the minimization of communication volume while maintaining the load balance among the processors for checkerboard decomposition, as the third model for 2D decomposition. The proposed model reduces the decomposition problem to the multi-constraint hypergraph partitioning problem. The notion of multi-constraint partitioning has recently become popular in graph partitioning. We applied the multi-constraint partitioning to the hypergraph partitioning problem for solving checkerboard partitioning. Graph partitioning by vertex separator (GPVS) is widely used for nested dissection based low ll ordering of sparse matrices for direct solution of linear systems. In this work,

[1]  J. W. Walker,et al.  Direct solutions of sparse network equations by optimally ordered triangular factorization , 1967 .

[2]  K. Brian Haley,et al.  Linear Programming Problems , 1967 .

[3]  Brian W. Kernighan,et al.  A proper model for the partitioning of electrical circuits , 1972, DAC '72.

[4]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[5]  John G. Lewis,et al.  Sparse matrix test problems , 1982, SGNM.

[6]  R. M. Mattheyses,et al.  A Linear-Time Heuristic for Improving Network Partitions , 1982, 19th Design Automation Conference.

[7]  J. Pasciak,et al.  Computer solution of large sparse positive definite systems , 1982 .

[8]  Alberto L. Sangiovanni-Vincentelli,et al.  An Algorithm for Optimal PLA Folding , 1982, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[9]  P. Sadayappan,et al.  Iterative Algorithms for Solution of Large Sparse Systems of Linear Equations on Hypercubes , 1988, IEEE Trans. Computers.

[10]  Joseph W. H. Liu,et al.  A graph partitioning algorithm by node separators , 1989, TOMS.

[11]  Joseph W. H. Liu The Minimum Degree Ordering with Constraints , 1989 .

[12]  Andrew B. Kahng Fast Hypergraph Partition , 1989, 26th ACM/IEEE Design Automation Conference.

[13]  Alex Pothen,et al.  Computing the block triangular form of a sparse matrix , 1990, TOMS.

[14]  Jeffery L. Kennington,et al.  An Empirical Evaluation of the KORBX® Algorithms for Military Airlift Applications , 1990, Oper. Res..

[15]  Thomas Lengauer,et al.  Combinatorial algorithms for integrated circuit layout , 1990, Applicable theory in computer science.

[16]  Chung-Kuan Cheng,et al.  An improved two-way partitioning algorithm with stable performance [VLSI] , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[17]  Chung-Kuan Cheng,et al.  Ratio cut partitioning for hierarchical designs , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[18]  Jason Cong,et al.  Net partitions yield better module partitions , 1992, [1992] Proceedings 29th ACM/IEEE Design Automation Conference.

[19]  Lie Wang,et al.  Towards a fast implementation of spectral nested dissection , 1992, Proceedings Supercomputing '92.

[20]  Curt Jones,et al.  Finding Good Approximate Vertex and Edge Partitions is NP-Hard , 1992, Inf. Process. Lett..

[21]  Wolfgang Fichtner,et al.  A Set of New Mapping and Coloring Heuristics for Distributed-Memory Parallel Processors , 1992, SIAM J. Sci. Comput..

[22]  Tevfik Bultan,et al.  A New Mapping Heuristic Based on Mean Field Annealing , 1992, J. Parallel Distributed Comput..

[23]  William Aiello,et al.  Sparse Matrix Computations on Parallel Processor Arrays , 1993, SIAM J. Sci. Comput..

[24]  Bruce Hendrickson,et al.  The Chaco user`s guide. Version 1.0 , 1993 .

[25]  Kesheng Wu,et al.  Sparse Matrix Computations on the CM-5 , 1993, SIAM Conference on Parallel Processing for Scientific Computing.

[26]  Curt Jones,et al.  A Heuristic for Reducing Fill-In in Sparse Matrix Factorization , 1993, PPSC.

[27]  J. G. Lewis,et al.  Distributed memory matrix-vector multiplication and conjugate gradient algorithms , 1993, Supercomputing '93.

[28]  Dorothea Wagner,et al.  Modeling Hypergraphs by Graphs with the Same Mincut Properties , 1993, Inf. Process. Lett..

[29]  Jason Cong,et al.  A Parallel Bottom-up Clustering Algorithm with Applications to Circuit Partitioning in VLSI Design , 1993, 30th ACM/IEEE Design Automation Conference.

[30]  J.G. Lewis,et al.  Matrix-vector multiplication and conjugate gradient algorithms on distributed memory computers , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[31]  Steven J. Plimpton,et al.  Massively parallel methods for engineering and science problems , 1994, CACM.

[32]  J. Cong,et al.  Multi-way VLSI Circuit Partitioning Based On Dual Net Representation , 1994, IEEE/ACM International Conference on Computer-Aided Design.

[33]  George Karypis,et al.  Introduction to Parallel Computing , 1994 .

[34]  Olivier C. Martin,et al.  Partitioning of unstructured meshes for load balancing , 1995, Concurr. Pract. Exp..

[35]  Bruce Hendrickson,et al.  A Multi-Level Algorithm For Partitioning Graphs , 1995, Proceedings of the IEEE/ACM SC95 Conference.

[36]  Laxmi N. Bhuyan,et al.  Mapping Molecular Dynamics Computations on to Hypercubes , 1995, Parallel Comput..

[37]  Sanjay Ranka,et al.  Partitioning unstructured computational graphs for nonunifor , 1995, IEEE Parallel & Distributed Technology: Systems & Applications.

[38]  Andrew B. Kahng,et al.  Recent directions in netlist partitioning: a survey , 1995, Integr..

[39]  Ümit V. Çatalyürek,et al.  Decomposing Linear Programs for Parallel Solution , 1995, PARA.

[40]  Cleve Ashcraft,et al.  Compressed Graphs and the Minimum Degree Algorithm , 1995, SIAM J. Sci. Comput..

[41]  Ümit V. Çatalyürek,et al.  Decomposing Irregularly Sparse Matrices for Parallel Matrix-Vector Multiplication , 1996, IRREGULAR.

[42]  Andrew B. Kahng,et al.  A hybrid multilevel/genetic approach for circuit partitioning , 1996, Proceedings of APCCAS'96 - Asia Pacific Conference on Circuits and Systems.

[43]  Patrick R. Amestoy,et al.  An Approximate Minimum Degree Ordering Algorithm , 1996, SIAM J. Matrix Anal. Appl..

[44]  Anshul Gupta,et al.  Fast and effective algorithms for graph partitioning and sparse-matrix ordering , 1997, IBM J. Res. Dev..

[45]  Bruce Hendrickson,et al.  Effective Sparse Matrix Ordering: Just Around the BEND , 1997, PPSC.

[46]  Sanjay Ranka,et al.  Parallel Incremental Graph Partitioning , 1997, IEEE Trans. Parallel Distributed Syst..

[47]  Tarek A. El-Ghazawi,et al.  Load-Balanced Sparse Matrix--Vector Multiplication on Parallel Computers , 1997, J. Parallel Distributed Comput..

[48]  Joseph W. H. Liu,et al.  Using domain decomposition to find graph bisectors , 1997 .

[49]  Joseph W. H. Liu,et al.  Applications of the Dulmage--Mendelsohn Decomposition and Network Flow to Graph Bisection Improvement , 1998 .

[50]  Joseph W. H. Liu,et al.  Robust Ordering of Sparse Matrices using Multisection , 1998 .

[51]  Bruce Hendrickson,et al.  Improving the Run Time and Quality of Nested Dissection Ordering , 1998, SIAM J. Sci. Comput..

[52]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[53]  T. Kolda Partitioning Sparse Rectangular Matrices for Parallel Processing ? , 1998 .

[54]  Vipin Kumar,et al.  Multilevel Algorithms for Multi-Constraint Graph Partitioning , 1998, Proceedings of the IEEE/ACM SC98 Conference.

[55]  Vipin Kumar,et al.  A New Algorithm for Multi-objective Graph Partitioning , 1999, Euro-Par.

[56]  Tamara G. Kolda,et al.  Partitioning Rectangular and Structurally Unsymmetric Sparse Matrices for Parallel Processing , 1999, SIAM J. Sci. Comput..