Scheduling parallel I/O operations

The I/O bottleneck in parallel computer systems has recently begun receiving increasing interest. Most attention has focused on improving the performance of I/O devices using fairly low-level parallelism in techniques such as disk striping and interleaving. Widely applicable solutions, however, will require an integrated approach which addresses the problem at multiple system levels, including applications, systems software, and architecture. We propose that within the context of such an integrated approach, scheduling parallel I/O operations will become increasingly attractive and can potentially provide substantial performance benefits.We describe a simple I/O scheduling problem and present approximate algorithms for its solution. The costs of using these algorithms in terms of execution time, and the benefits in terms of reduced time to complete a batch of I/O operations, are compared with the situations in which no scheduling is used, and in which an optimal scheduling algorithm is used. The comparison is performed both theoretically and experimentally. We have found that, in exchange for a small execution time overhead, the approximate scheduling algorithms can provide substantial improvements in I/O completion times.

[1]  Kiran K Somalwar Data Transfer Scheduling , 1988 .

[2]  Kevin P. McAuliffe,et al.  The IBM Research Parallel Processor Prototype (RP3): Introduction and Architecture , 1985, ICPP.

[3]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[4]  Daniel P. Siewiorek,et al.  Modeling and measurement of the impact of Input/Output on system performance , 1991, ISCA '91.

[5]  John P. Hayes,et al.  Multiple Bus Architectures , 1987, Computer.

[6]  James E. Smith,et al.  Future general purpose supercomputer architectures , 1990, Proceedings SUPERCOMPUTING '90.

[7]  H KatzRandy,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988 .

[8]  Peter J. Denning,et al.  Effects of scheduling on file memory operations , 1899, AFIPS '67 (Spring).

[9]  Takao Nishizeki,et al.  A Better than "Best Possible" Algorithm to Edge Color Multigraphs , 1986, J. Algorithms.

[10]  Harry F. Jordan,et al.  Scalability of data transport , 1992, Proceedings Scalable High Performance Computing Conference SHPCC-92..

[11]  Prithviraj Banerjee,et al.  Compiler Support for Parallel I/O Operations , 1991, ICPP.

[12]  Paula B. Hawthorn Database Machines , 1989, Lecture Notes in Computer Science.

[13]  G. Bongiovanni,et al.  An Optimum Time Slot Assignment Algorithm for an SS/TDMA System with Variable Number of Transponders , 1981, IEEE Trans. Commun..

[14]  Charles E. Catlett,et al.  Balancing resources , 1992 .

[15]  Rajeev Motwani,et al.  The Greedy Algorithm is Optimal for On-Line Edge Coloring , 1992, Inf. Process. Lett..

[16]  C. N. Liu,et al.  Interactive visualization of 3D seismic data: a volumetric method , 1988, IEEE Computer Graphics and Applications.

[17]  Shreekant S. Thakkar,et al.  The Symmetry Multiprocessor System , 1988, ICPP.

[18]  David J. DeWitt,et al.  Database Machines: An Idea Whose Time Passed? A Critique of the Future of Database Machines , 1989, IWDM.

[19]  Michael Stonebraker,et al.  Distributed RAID-a new multiple copy algorithm , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[20]  Garth A. Gibson Redundant disk arrays: Reliable, parallel secondary storage. Ph.D. Thesis , 1990 .

[21]  Randy H. Katz,et al.  Input/output behavior of supercomputing applications , 1991, Proceedings of the 1991 ACM/IEEE Conference on Supercomputing (Supercomputing '91).

[22]  Ravi Jain Scheduling data transfers in parallel computers and communications systems , 1992 .

[23]  Ravi Jain,et al.  Scheduling Parallel I/O Operations in Multiple Bus Systems , 1992, J. Parallel Distributed Comput..

[24]  Ravi Jain,et al.  A General Model for Scheduling of Parallel Computations and its Application to Parallel I/O Operations , 1991, ICPP.

[25]  James C. French,et al.  Performance measurement of a parallel Input/Output system for the Intel iPSC/2 Hypercube , 1991, SIGMETRICS '91.

[26]  Jeffrey Scott Vitter,et al.  Paradigms for optimal sorting with multiple disks , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[27]  Ravi Jain,et al.  Analysis of Approximate Algorithms for Edge-Coloring Bipartite Graphs , 1995, Inf. Process. Lett..

[28]  Joseph Pasquale System software and hardware support considerations for digital video and audio computing , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[29]  Ralf Steinmetz,et al.  Synchronization Properties in Multimedia Systems , 1990, IEEE J. Sel. Areas Commun..

[30]  Phillip M. Dickens,et al.  A comparison of the architecture and performance of two parallel file systems , 1989 .