A project on high performance I/0 subsystems

A Project on High Performance I/0 Subsystems Randy H. Katz, John K. Ousterhout, David A. Patterson, Peter Chen, Ann Chervenak, Rich Drewes, Garth Gibson, Ed Lee, Ken Lutz, Ethan Miller, Mendel Rosenblum Computer Science Division Department of Electrical Engineering and Computer Sciences University of California Berkeley, California 94720 1. Introduction and Overview Computing is seeing an unprecedented improvement in performance; over the last five years there has been an order-of-magnitude improvement in the speeds of workstation CPUs. At least another order of magnitude seems likely in the next five years, to machines with 100 MIPs or more. DARPA has already launched a program to develop even larger, more powerful machines, executing as many as 1012 operations per second. Unfortunately, we have seen no comparable break-throughs in I/O performance; the speeds of I/O devices and the hardware and software architectures for managing them have not changed substantially in many years [Katz 89]. What will unbalanced improvements in performance mean? Twenty years ago, Gene Amdahl was asked to comment about the Illiac-IV. He noted that while the vector portion of programs might run much faster, a major portion of the programs would run essentially at the same speed. In what has come to be known as Amdahl's Law, he observed that no matter how much faster one piece of the program would go over tradi- tional computers, overall performance improvement is limited by the part of the program that is not improved. Without major increases in I/O performance and reliability we think that transaction processing systems, supercomputers, and high-performance workstations will be unable to achieve their true potential. Our research group is pursuing a program of research to develop hardware and software I/O architectures capable of supporting the kinds of internetworked work- stations and super compute servers that will appear in the early 1990s. The project has three overall goals: High Performance. We are developing new I/O architectures and a prototype system that can scale to achieve significant factors of improvement in I/O performance, relative to today's commercially available I/O systems, for the same cost. We believe this speedup can be achieved using a combination of arrays of inexpensive personal computer disks coupled with a file system that can accommodate both striped and partitioned file organizations. High Reliability. To support the high-performance computing of the mid-1990's, I/O 24

[1]  R. MacGregor,et al.  Mermaid—A front-end to distributed heterogeneous databases , 1987, Proceedings of the IEEE.

[2]  James P. Gray,et al.  Advanced Program-to-Program Communication in SNA , 1983, IBM Syst. J..

[3]  Vijay Ahuja Common Communications Support in Systems Application Architecture , 1988, IBM Syst. J..

[4]  H KatzRandy,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988 .

[5]  Martin E. Schulze Considerations in the Design of a RAID Prototype , 1988 .

[6]  W. Morven Gentleman,et al.  Message passing between sequential processes: The reply primitive and the administrator concept , 1981, Softw. Pract. Exp..

[7]  D. Godes,et al.  Use of heterogeneous data sources : three case studies , 1989 .

[8]  Stuart E. Madnick,et al.  A Polygen Data Model for data source tagging in Composite Information Systems , 1989 .

[9]  Stuart E. Madnick,et al.  A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective , 1990, VLDB.

[10]  Michelle Y. Kim,et al.  Synchronized Disk Interleaving , 1986, IEEE Transactions on Computers.

[11]  Jim Gray,et al.  An approach to decentralized computer systems , 1986, IEEE Transactions on Software Engineering.

[12]  Stuart E. Madnick,et al.  Motivating Strategic Alliance for Composite Information Systems: The Case of a Major Regional Hospital , 1989, J. Manag. Inf. Syst..

[13]  Stavros Christodoulakis,et al.  The multimedia object presentation manager of MINOS: a symmetric approach , 1986, SIGMOD '86.

[14]  Stuart E. Madnick,et al.  Facilitating connectivity in composite information systems , 1989, DATB.

[15]  Richard M. Karp,et al.  Failure correction techniques for large disk arrays , 1989, ASPLOS 1989.

[16]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[17]  Michael N. Nelson,et al.  Caching in the Sprite network file system , 1988, TOCS.

[18]  Miron Livny,et al.  Multi-disk management algorithms , 1987, SIGMETRICS '87.

[19]  Stuart E. Madnick,et al.  An architectural comparison of contemporary apporaches and products for integrating heterogeneous information systems , 1989 .

[20]  Randy H. Katz,et al.  How reliable is a RAID? , 1989, Digest of Papers. COMPCON Spring 89. Thirty-Fourth IEEE Computer Society International Conference: Intellectual Leverage.

[21]  Alan E. Bell Critical Issues In High Density Magnetic And Optical Data Storage , 1983, Other Conferences.

[22]  Randy H. Katz,et al.  Disk system architectures for high performance computing , 1989, Proc. IEEE.

[23]  Larry Fujitani Laser optical disk: the coming revolution in on-line storage , 1984, CACM.

[24]  George P. Copeland What if Mass Storage Were Free? , 1980, Computer.

[25]  Stavros Christodoulakis Issues in the architecture of a document archiver using optical disk technology , 1985, SIGMOD '85.

[26]  Christos Faloutsos,et al.  Design and Performance Considerations for an Optical Disk-Based, Multimedia Object Server , 1986, Computer.

[27]  Stavros Christodoulakis Query Processing in Optical Disk Based Multimedia Information Systems. , 1986 .

[28]  Stuart E. Madnick,et al.  Evolution towards Strategic Applications of Databases through Composite Information Systems , 1988, J. Manag. Inf. Syst..

[29]  Toon King Wong,et al.  Data connectivity for the composite information system/tool kit , 1989 .

[30]  Michael Stonebraker,et al.  Implementation techniques for main memory database systems , 1984, SIGMOD '84.

[31]  Stavros Christodoulakis,et al.  Performance analysis and fundamental performance tradeoffs for CLV optical disks , 1988, SIGMOD '88.

[32]  George Paul,et al.  Disk Interleaving and Very Large Fast Fourier Transforms , 1987 .

[33]  Eugene Wong,et al.  Multibase: integrating heterogeneous distributed database systems , 1981, AFIPS '81.

[34]  Yuri Breitbart,et al.  Multidatabase Interoperability , 1990, SGMD.

[35]  Hector Garcia-Molina,et al.  Disk striping , 1986, 1986 IEEE Second International Conference on Data Engineering.

[36]  Stuart E. Madnick,et al.  Identification and reconciliation of semantic conflicts using metadata , 1989 .

[37]  Maria Linn Paget,et al.  A knowledge-based approach toward integrating international on-line databases , 1989 .

[38]  Stuart E. Madnick,et al.  The inter-database instance identification problem in integrating autonomous systems , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[39]  Michael Stonebraker,et al.  The design of POSTGRES , 1986, SIGMOD '86.