Tertiary Storage: An Evaluation of New Applications

This thesis focuses on an often-neglected area of computer system design: tertiary storage. In the last decade, several advances in tertiary storage have made it of increasing interest, including increased tape capacities, less expensive tape drives and optical disk drives, and the proliferation of robots for loading tertiary devices automatically. Concurrently, faster processor speeds have enabled a growing number of applications that would benefit from fast access to massive storage. We evaluate the usefulness of current tertiary storage systems for some of these new applications. First, we describe the design and performance of tertiary storage products. Next, we evaluate the technique of data striping in tape arrays. We find that tape striping improves the performance of sequential workloads. However, striped tape systems perform poorly for applications in which there are several non-sequential, concurrent requests active in the tape library because of contention for a small number of tape drives. We characterize two new workloads: video-on-demand servers and digital libraries. For the former, we evaluate design alternatives for providing storage in a movies-on-demand system. First, we study disk farms in which one movie is stored per disk. This is a simple scheme, but it wastes substantial disk bandwidth, since disks holding less popular movies are under-utilized; also, good performance requires that movies be replicated to reflect the user request pattern. Next, we examine disk farms in which movies are striped across disks, and find that striped video servers offer close to full utilization of the disks by achieving better load balancing. Finally, we evaluate the use of storage hierarchies for video service that include a tertiary library along with a disk farm. Unfortunately, we show that the performance of neither magnetic tape libraries nor optical disk jukeboxes as part of a storage hierarchy is adequate to service the predicted distribution of movie accesses. Throughout the dissertation, we identify several desirable changes in tertiary storage systems. To support new applications with higher concurrencies, tertiary libraries should be redesigned with a higher ratio of drives to media, higher bandwidth per drive and faster access times.

[1]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[2]  J.C. Mallinson,et al.  Tutorial review of magnetic recording , 1976, Proceedings of the IEEE.

[3]  H. Sugaya Recent advances in video tape recording , 1978 .

[4]  Alan Jay Smith Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms , 1981, IEEE Transactions on Software Engineering.

[5]  Alan Jay Smith,et al.  Long term file migration: development and evaluation of algorithms , 1981, CACM.

[6]  John P. Harris,et al.  Innovations in the Design of Magnetic Tape Subsystems , 1981, IBM J. Res. Dev..

[7]  Duncan H. Lawrie,et al.  Experiments with Automatic File Migration , 1982, Computer.

[8]  R.A. Bartolini Optical recording: High-density information storage and retrieval , 1982, Proceedings of the IEEE.

[9]  Paul H. Siegel,et al.  Recording codes for digital magnetic storage , 1985 .

[10]  Richard A. Baugh,et al.  Extremely low error rate digital recording with a helical scan recorder , 1986 .

[11]  Akihiko Hitomi,et al.  Servo Technology of R-DAT , 1986, IEEE Transactions on Consumer Electronics.

[12]  Hector Garcia-Molina,et al.  Disk striping , 1986, 1986 IEEE Second International Conference on Data Engineering.

[13]  Ray R. Larson,et al.  Workload characteristics and computer system utilization in online library catalogs , 1986 .

[14]  W.H. Meiklejohn,et al.  Magnetooptics: A thermomagnetic recording technology , 1986, Proceedings of the IEEE.

[15]  Michelle Y. Kim,et al.  Synchronized Disk Interleaving , 1986, IEEE Transactions on Computers.

[16]  R.W. Wood,et al.  Magnetic recording systems , 1986, Proceedings of the IEEE.

[17]  Fukuzo Itoh,et al.  Magnetic Tape and Cartridge of R-DAT , 1986, IEEE Transactions on Consumer Electronics.

[18]  Miron Livny,et al.  Multi-disk management algorithms , 1987, SIGMETRICS '87.

[19]  Tatsuya Adachi,et al.  A Fast Random Accessing Scheme for R-DAT , 1987, IEEE Transactions on Consumer Electronics.

[20]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[21]  Ken Spencer Terabyte optical tape recorder , 1988, Digest of Papers Ninth IEEE Symposium on Mass Storage Systems, 1988. 'Storage Systems: Perspectives'.

[22]  Tracy G. Wood D-1 through DAT , 1988, Digest of Papers Ninth IEEE Symposium on Mass Storage Systems, 1988. 'Storage Systems: Perspectives'.

[23]  C. Mee,et al.  Magnetic recording; Vol. II: computer data storage , 1988 .

[24]  Eric D. Daniel,et al.  Magnetic recording. Vol. III: video, audio, and instrumentation recording , 1988 .

[25]  Erich Thanhardt,et al.  File migration in the NCAR mass storage system , 1988, Digest of Papers Ninth IEEE Symposium on Mass Storage Systems, 1988. 'Storage Systems: Perspectives'.

[26]  Randy H. Katz,et al.  How reliable is a RAID? , 1989, Digest of Papers. COMPCON Spring 89. Thirty-Fourth IEEE Computer Society International Conference: Intellectual Leverage.

[27]  Gary M. King,et al.  Storage Hierarchies , 1989, IBM Syst. J..

[28]  Don Petravick,et al.  Exabyte helical scan devices at Fermilab , 1989 .

[29]  P. Hansen,et al.  Media for erasable magnetooptic recording , 1989 .

[30]  M. Sharrock Particulate magnetic recording media: a review , 1989 .

[31]  Takaharu Noguchi,et al.  A new concept of DATA/DAT system , 1989 .

[32]  E. Tan,et al.  Digital audio tape for data storage , 1989, IEEE Spectrum.

[33]  Mark H. Kryder,et al.  Data storage in 2000-trends in data storage technologies , 1989 .

[34]  David A. Patterson,et al.  Maximizing performance in a striped disk array , 1990, ISCA '90.

[35]  David Tweten Hiding mass storage under Unix: NASA's MSS-II architecture , 1990, [1990] Digest of papers. Tenth IEEE Symposium on Mass Storage Systems@m_Crisis in Mass Storage.

[36]  Bharat Bhushan,et al.  Tribology and Mechanics of Magnetic Storage Devices , 1990 .

[37]  Fred McClain DataTree and UniTree: software for file and storage management , 1990, [1990] Digest of papers. Tenth IEEE Symposium on Mass Storage Systems@m_Crisis in Mass Storage.

[38]  Carole B. Hogan,et al.  The Livermore distributed storage system: implementation and experiences , 1990, [1990] Digest of papers. Tenth IEEE Symposium on Mass Storage Systems@m_Crisis in Mass Storage.

[39]  John C. Mallinson Magnetic tape recording: archival considerations , 1990, [1990] Digest of papers. Tenth IEEE Symposium on Mass Storage Systems@m_Crisis in Mass Storage.

[40]  Jim Gray,et al.  Parity Striping of Disk Arrays: Low-Cost Reliable Storage with Acceptable Throughput , 1990, VLDB.

[41]  J. C. Mallinson Achievements in rotary head magnetic recording , 1990 .

[42]  Thomas W. Lanzatella,et al.  Storage management issues for Cray Research , 1990, [1990] Digest of papers. Tenth IEEE Symposium on Mass Storage Systems@m_Crisis in Mass Storage.

[43]  Harry C. Hinz Magnetic tape technology in the 1990s , 1990, [1990] Digest of papers. Tenth IEEE Symposium on Mass Storage Systems@m_Crisis in Mass Storage.

[44]  Margo I. Seltzer,et al.  Disk Scheduling Revisited , 1990 .

[45]  Steven Broeils Luitjens,et al.  Magnetic recording trends: media developments and future (video) recording systems , 1990 .

[46]  Carole B. Hogan,et al.  The Livermore distributed storage system: requirements and overview , 1990, [1990] Digest of papers. Tenth IEEE Symposium on Mass Storage Systems@m_Crisis in Mass Storage.

[47]  Randy H. Katz,et al.  An evaluation of redundant arrays of disks using an Amdahl 5890 , 1990, SIGMETRICS '90.

[48]  Alan Poston,et al.  Distributed NAStore as the next step , 1991, [1991] Digest of Papers Eleventh IEEE Symposium on Mass Storage Systems.

[49]  W. David Sincoskie,et al.  System Architecture for a Large Scale Video on Demand Service , 1991, Comput. Networks ISDN Syst..

[50]  Ben Kobler,et al.  NASA Earth Observing System Data Information System (EOSDIS) , 1991, [1991] Digest of Papers Eleventh IEEE Symposium on Mass Storage Systems.

[51]  B. Collins High-performance data systems , 1991, [1991] Digest of Papers Eleventh IEEE Symposium on Mass Storage Systems.

[52]  P. Venkat Rangan,et al.  Designing file systems for digital video and audio , 1991, SOSP '91.

[53]  Randy H. Katz,et al.  Input/output behavior of supercomputing applications , 1991, Proceedings of the 1991 ACM/IEEE Conference on Supercomputing (Supercomputing '91).

[54]  David P. Anderson,et al.  A continuous media I/O server and its synchronization mechanism , 1991, Computer.

[55]  Kiyoshi Itao,et al.  Automated optical mass storage systems with 3-beam magneto-optical disk drives , 1991, [1991] Digest of Papers Eleventh IEEE Symposium on Mass Storage Systems.

[56]  Stephen W. Miller MSS requirements for data acquisition systems , 1991, [1991] Digest of Papers Eleventh IEEE Symposium on Mass Storage Systems.

[57]  S. Redfield,et al.  Holostore technology for higher levels of memory hierarchy , 1991, [1991] Digest of Papers Eleventh IEEE Symposium on Mass Storage Systems.

[58]  Julian Richards,et al.  Volume management by the book: the NAStore Volume Manager , 1991, [1991] Digest of Papers Eleventh IEEE Symposium on Mass Storage Systems.

[59]  Andrew J. G. Strandjord,et al.  Flexible storage medium for write-once optical tape , 1992, Other Conferences.

[60]  Glenn T. Sincerbox,et al.  Selected papers on optical storage , 1992 .

[61]  Garth A. Gibson Redundant disk arrays: Reliable, parallel secondary storage. Ph.D. Thesis , 1990 .

[62]  H.M. Vin,et al.  Designing an on-demand multimedia service , 1992, IEEE Communications Magazine.

[63]  Fouad A. Tobagi,et al.  Streaming RAID: a disk array management system for video files , 1993, MULTIMEDIA '93.

[64]  Randy H. Katz,et al.  An Analysis of File Migration in a UNIX Supercomputing Environment , 1993, USENIX Winter.

[65]  Don L. Bosseau Anatomy of a small step forward , 1993 .

[66]  Gerry Schadegg Optimizing digital 8mm drive performance , 1993 .

[67]  Randy H. Katz,et al.  Performance modeling and analysis of disk arrays , 1993 .

[68]  Andrew J. Ruddick ICI optical data storage tape: An archival mass storage media , 1993 .

[69]  Steve Miller,et al.  Mass storage system reference model, Version 4 , 1993 .

[70]  K. Peter,et al.  Recording and wear characteristics of 4 and 8 mm helical scan tapes , 1993 .

[71]  Randy H. Katz,et al.  Striped tape arrays , 1993, [1993] Proceedings Twelfth IEEE Symposium on Mass Storage systems.

[72]  R. L. Haskin,et al.  The Shark continuous-media file server , 1993, Digest of Papers. Compcon Spring.

[73]  William M. Callicott Data management in NOAA , 1993 .

[74]  Garth A. Gibson,et al.  RAID: high-performance, reliable secondary storage , 1994, CSUR.

[75]  Dinkar Sitaram,et al.  A methodology for evaluating storage systems in distributed and hierarchical video servers , 1994, Proceedings of COMPCON '94.

[76]  Randy H. Katz,et al.  Storage alternatives for video service , 1994, Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems.

[77]  D. Skellern,et al.  An open-systems approach to video on demand , 1994, IEEE Communications Magazine.

[78]  Lawrence A. Rowe,et al.  Distributed hierarchical storage manager for a video-on-demand system , 1994, Electronic Imaging.