Many large scale applications, have significant I/O requirements as well as computational and memory requirements. Unfortunately, limited number of I/O nodes provided by the contemporary message-passing distributed-memory architectures such as Intel Paragon and IBM SP-2 limits the I/O performance of these applications severely. In this paper, we examine some software optimization techniques and architectural scalability and evaluate the effect of them in five I/O intensive applications from both small and large application domains. Our goals in this study are twofold: First, we want to understand the behavior of large-scale data intensive applications and the impact of I/O subsystem on their performance and vice-versa. Second, and more importantly, we strive to determine the solutions for improving the applications' performance by a mix of architectural and software solutions. Our results reveal that the different applications can benefit from different optimizations. For example, we found that some applications benefit from file layout optimizations whereas some others benefit from collective I/O. A combination of architectural and software solutions is normally needed to obtain good I/O performance. For example, we show that with limited number of I/O resources, it is possible to obtain good performance by using appropriate software optimizations. We also show that beyond a certain level, imbalance in the architecture results in performance degradation even when using optimized software, thereby indicating the necessity of increase in I/O resources.
[1]
Alok Choudhary,et al.
Design and evaluation of optimizations in i/o-intensive applications
,
1998
.
[2]
Ian T. Foster,et al.
Remote I/O: fast access to distant storage
,
1997,
IOPADS '97.
[3]
Rajeev Thakur,et al.
An Experimental Evaluation of the Parallel I/O Systems of the IBM SP and Intel Paragon Using a Production Application
,
1996,
ACPC.
[4]
Kai Li,et al.
CLIP: A Checkpointing Tool for Message Passing Parallel Programs
,
1997,
ACM/IEEE SC 1997 Conference (SC'97).
[5]
Rajeev Thakur,et al.
Passion: Optimized I/O for Parallel Applications
,
1996,
Computer.
[6]
Mahmut T. Kandemir,et al.
Improving the performance of out-of-core computations
,
1997,
Proceedings of the 1997 International Conference on Parallel Processing (Cat. No.97TB100162).
[7]
Dror G. Feitelson,et al.
Parallel File Systems for the IBM SP Computers
,
1995,
IBM Syst. J..
[8]
D.A. Reed,et al.
Input/Output Characteristics of Scalable Parallel Applications
,
1995,
Proceedings of the IEEE/ACM SC95 Conference.
[9]
Samuel A. Fineberg.
Implementing the NHT-1 application I/O benchmark
,
1993,
CARN.