Tradeoff to minimize extra-computations and stopping criterion tests for parallel iterative schemes

Parallel synchronous iterative algorithms are often penalized by global synchro- nization, due to the cost of stopping tests that are achieved. It is well known that such global synchronizations are expensive for parallel implementations on distrib- uted systems, especially on clusters of processors or computational grids, where the heterogeneity and the number of processors imply a large overhead for this global operation. The aim of this work is to propose a new control technique for the stopping tests, original to the best of our knowledge, which enables to reduce the number of global synchronization near to the optimum, while keeping the number of iterations close to the number performed by standard synchronous algorithm. The main advantage of the proposed technique is that the semantic of the sequential al- gorithm is not modified, so that convergence is preserved and identical outputs are guaranteed. Our method is based on an amortized technique inspired by Floyd's and Brent's algorithms to detect periodicity in a sequence.

[1]  Denis Trystram,et al.  Large scale simulation of parallel molecular dynamics , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[2]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[3]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[4]  Didier El Baz,et al.  Asynchronous Iterative Algorithms with Flexible Communication for Nonlinear Network Flow Problems , 1996, J. Parallel Distributed Comput..

[5]  Ronald L. Rivest,et al.  Introduction to Algorithms, Second Edition , 2001 .

[6]  Jacques M. Bahi,et al.  Asynchronous multisplitting methods for nonlinear fixed point problems , 1997, Numerical Algorithms.

[7]  Achim Basermann,et al.  Preconditioned CG Methods for Sparse Matrices on Massively Parallel Machines , 1997, Parallel Comput..