Universal Execution of Parallel Processes: Penetrating NATs over the Grid

Today, clusters are very important computing resources and many computing centers manage their clusters in private networks. But parallel programs may not work in private clusters. Because hosts in private clusters are not globally reachable, hosts behind different private clusters cannot be reached directly in order to communicate. It will certainly be a huge loss of resources if private clusters are excluded from the computing due to this reason. There has been much research on this issue, but most of them concentrate on user-level relaying because it is a general and easily-implementable solution. However, even well-implemented, user-level solutions result in much longer communication latency than kernel-level solutions. This paper adopted a novel kernel-level solution and applied it to MPICH-G2. Our scheme is generally applicable, simple and efficient. The experimental results show that our scheme incurs very little overhead except when small messages are transmitted. That is, it supports a more universal computing environment by including private clusters with remarkably little overhead.

[1]  Hui Gao,et al.  Parallel and Distributed Processing and Applications , 2005 .

[2]  Saikat Guha,et al.  Characterization and measurement of TCP traversal through NATs and firewalls , 2005, IMC '05.

[3]  Geoffrey C. Fox,et al.  MPJ: MPI-like message passing for Java , 2000, Concurr. Pract. Exp..

[4]  Matthias S. Müller,et al.  Grid enabled MPI solutions for clusters , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[5]  Jonathan Rosenberg Traversal Using Relay NAT (TURN) , 2005 .

[6]  Matt Holdrege,et al.  IP Network Address Translator (NAT) Terminology and Considerations , 1999, RFC.

[7]  Geoffrey C. Fox,et al.  MPJ: MPI-like message passing for Java , 2000 .

[8]  Bryan Ford,et al.  Peer-to-Peer Communication Across Network Address Translators , 2005, USENIX Annual Technical Conference, General Track.

[9]  James Arthur Kohl,et al.  HARNESS: Heterogeneous Adaptable Reconfigurable NEtworked SystemS , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[10]  Michael M. Resch,et al.  Distributed Computing in a Heterogeneous Computing Environment , 1998, PVM/MPI.

[11]  Oh-Young Kwon,et al.  MPICH-GP: A Private-IP-Enabled MPI Over Grid Environments , 2004, ISPA.

[12]  Michael Frumkin,et al.  Implementation of NAS Parallel Benchmarks in High Performance Fortran , 2000 .