Pipeline implementation of cellular automata for structural design on message-passing multiprocessors

The inherent structure of cellular automata is trivially parallelizable and can directly benefit from massively parallel machines in computationally intensive problems. This paper presents both block synchronous and block pipeline (with asynchronous message passing) parallel implementations of cellular automata on distributed memory (message-passing) architectures. A structural design problem is considered to study the performance of the various cellular automata implementations. The synchronous parallel implementation is a mixture of Jacobi and Gauss-Seidel style iteration, where it becomes more Jacobi like as the number of processors increases. Therefore, it exhibits divergence because of the mathematical characteristics of Jacobi iteration matrix for the structural problem as the number of processors increases. The proposed pipeline implementation preserves convergence by simulating a pure Gauss-Seidel style row-wise iteration. Numerical results for analysis and design of a cantilever plate made of composite material show that the pipeline update scheme is convergent and successfully generates optimal designs.

[1]  Loyce Adams,et al.  New Parallel SOR Method by Domain Partitioning , 1999, SIAM J. Sci. Comput..

[2]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[3]  John von Neumann,et al.  Theory Of Self Reproducing Automata , 1967 .

[4]  Michael Schreckenberg,et al.  A cellular automaton traffic flow model for online simulation of traffic , 2001, Parallel Comput..

[5]  Stanley C. Eisenstat,et al.  Comments on scheduling parallel iterative methods on multiprocessor systems II , 1989, Parallel Comput..

[6]  H. F. Jordan,et al.  Is SOR Color-Blind? , 1986 .

[7]  Yves Robert,et al.  Comments on scheduling parallel iterative methods on multiprocessor systems , 1988, Parallel Comput..

[8]  A ROSENBLUETH,et al.  The mathematical formulation of the problem of conduction of impulses in a network of connected excitable elements, specifically in cardiac muscle. , 1946, Archivos del Instituto de Cardiologia de Mexico.

[9]  Zafer Gürdal,et al.  Cellular Automata for design of truss structures with linear and nonlinear response , 2000 .

[10]  J. Z. Zhu,et al.  The finite element method , 1977 .

[11]  J. Douglas Faires,et al.  Numerical Analysis , 1981 .

[12]  Nikolaos M. Missirlis,et al.  Scheduling parallel iterative methods on multiprocessor systems , 1987, Parallel Comput..

[13]  G. Steven,et al.  A performance-based optimization method for topology design of continuum structures with mean compliance constraints , 2002 .

[14]  Wilhelm Niethammer,et al.  The SOR method on parallel computers , 1989 .

[15]  Layne T. Watson,et al.  Convergence analysis for cellular automata applied to truss design , 2002 .

[16]  Layne T. Watson,et al.  The Cellular Automata Paradigm for the Parallel Solution of Heat Transfer Problems , 1996, Parallel Algorithms Appl..