Grid computing provides a very rich environment for scientific calculations. In addition to the challenges it provides, it also offers new opportunities for optimization. In this paper we have utilized DFS (distributed file streaming) framework to speed up NAS grid benchmark workflows. By studying I/O patterns of NGB codes we have identified program locations where it is possible to overlap computation and data workflow phases. By integrating DFS into NGB, we demonstrate a useful method of improving overall workflow efficiency by streaming the output of the current process to make an input of the following stage, reducing a workflow to a series of distributed producer consumer stages. DFS framework eliminates file transfers and in the process makes process scheduling more efficient, leading to overall performance improvements in the turnaround time for HC (helical chain) data flow graph under Globus grid environment with the embedded DFS over the original version of the benchmark
[1]
W. Chen,et al.
Distributed File Streamer: A Framework for Distributed Application Data Coupling
,
2006,
2006 7th IEEE/ACM International Conference on Grid Computing.
[2]
Simon See,et al.
Benchmark performance on cluster grid with NGB
,
2004,
18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..
[3]
Michael A. Frumkin,et al.
NAS Grid Benchmarks: A Tool for Grid Space Exploration
,
2001,
Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.
[4]
Ian Foster,et al.
The Globus toolkit
,
1998
.
[5]
Frumkin,et al.
NAS Grid Benchmarks Version 1.0
,
2002
.