A Cache-Aware Algorithm for PDEs on Hierarchical Data Structures

A big challenge in implementing up to date simulation software for various applications is to bring together highly efficient mathematical methods on the one hand side and an efficient usage of modern computer archtitectures on the other hand. We concentrate on the solution of PDEs and demonstrate how to overcome the hereby occuring quandary between cache-efficiency and modern multilevel methods on adaptive grids. Our algorithm is based on stacks, the simplest possible and thus very cache-efficient data structures.

[1]  Stephan Knapek,et al.  Numerische Simulation in der Moleküldynamik : Numerik, Algorithmen, Parallelisierung, Anwendungen , 2004 .

[2]  Richard A. Brualdi,et al.  On Sign-Nonsingular Matrices and the Conversion of the Permanent into the Determinant , 1990, Applied Geometry And Discrete Mathematics.

[3]  D. Braess Finite Elements: Theory, Fast Solvers, and Applications in Solid Mechanics , 1995 .

[4]  Folkmar A. Bornemann,et al.  An adaptive multilevel approach to parabolic equations : II. Variable-order time discretization based on a multiplicative error correction , 1991, IMPACT Comput. Sci. Eng..

[5]  Josef Weidendorfer,et al.  A Tool Suite for Simulation Based Analysis of Memory Access Behavior , 2004, International Conference on Computational Science.

[6]  R. J. Stevens,et al.  Manipulation and Presentation of Multidimensional Image Data Using the Peano Scan , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Gediminas Adomavicius,et al.  A Parallel Multilevel Method for Adaptively Refined Cartesian Grids with Embedded Boundaries , 2000 .

[8]  Michael Griebel,et al.  Parallel multigrid in an adaptive PDE solver based on hashing and space-filling curves , 1999, Parallel Comput..

[9]  Erik D. Demaine,et al.  Cache-Oblivious Algorithms and Data Structures , 2003 .

[10]  H. Sagan Space-filling curves , 1994 .

[11]  Luiz Velho,et al.  Digital halftoning with space filling curves , 1991, SIGGRAPH.

[12]  Abani K. Patra,et al.  Efficient Parallel Adaptive Finite Element Methods Using Self-Scheduling Data and Computations , 1999, HiPC.

[13]  Frank Günther,et al.  Eine cache-optimale Implementierung der Finite-Elemente-Methode , 2004 .

[14]  Matteo Frigo,et al.  Cache-oblivious algorithms , 1999, 40th Annual Symposium on Foundations of Computer Science (Cat. No.99CB37039).

[15]  Mithuna Thottethodi,et al.  Recursive array layouts and fast parallel matrix multiplication , 1999, SPAA '99.

[16]  Michael Griebel,et al.  Hash based adaptive parallel multilevel methods with space-filling curves , 2002 .

[17]  Siddhartha Chatterjee,et al.  Cache-efficient matrix transposition , 2000, Proceedings Sixth International Symposium on High-Performance Computer Architecture. HPCA-6 (Cat. No.PR00550).

[18]  Michael Griebel,et al.  Numerische Simulation in der Moleküldynamik , 2004, Numerische Simulation in der Moleküldynamik.

[19]  J. Tinsley Oden,et al.  Problem decomposition for adaptive hp finite element methods , 1995 .