Design of the Rio (randomized I/o) Storage Server

We describe the design and implementation of the RIO (Randomized I/O) multimedia object server which manages a set of parallel disks and supports real-time throughput and delay guarantees. This storage subsystem was implemented as part of a multimedia information server which supports multiple concurrent applications such as video on demand and 3D interactive virtual worlds. We discuss the principal issues and innovations involved in the design and implementation of the RIO storage system, and present experimental performance results measured on our prototype. A multimedia data server must be ready to handle a variety of realtime object types (video, audio, 3D interactive virtual worlds, etc.) as well as non realtime workload. Achieving simultaneously (1) high utilization and (2) low latency with a high degree of certainty is the challenge. Our prototype system provides a statistical guarantee of quality of service. Our experimental results shows that it is possible to achieve a very small probability of missing a deadline (less than 10 ?6), with a relatively high disk utilization (70% to 99%, in terms of fraction of the maximum disk capacity) , together with relatively small deadlines (on the order of 0.5 sec. to 1.5 sec), using contemporary disks. To achieve guaranteed low delay with high utilization requires good short term load balancing to keep disk queue length distributions from having long tails. For workloads with predictable access patterns it is possible to try to exploit this predictability to gain the desired eeect. However this is an approach with limited applicability and quickly becomes untenable as the workload becomes more diverse. Our approach is to randomize the physical allocation of disk blocks across the system's multiple disks. At the level of physical disk block access this turns all workloads into the same uniformly random access pattern and thus gives us one problem to deal with. Since each disk is equally likely to be the target of each disk access, this approach provides long range load balance. However short term statistical variations can result in short term imbalances in disk queues which in turn imply either (1) the guaranteed maximum latency must be increased or (2) a higher probability of missing the deadline. Our approach to counteracting this problem is to introduce limited redundancy by replicating (at random) some fraction of the data blocks. This data redundancy gives the disk scheduler some exibility in scheduling disk block reads, allowing the system to reduce …

[1]  Randy H. Katz,et al.  RAMA: Easy Access to a High-Bandwidth Massively Parallel File System , 1995, USENIX.

[2]  Shahram Ghandeharizadeh,et al.  Staggered striping in multimedia information systems , 1994, SIGMOD '94.

[3]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[4]  Philip S. Yu Mon-Song Chen, Dilip D. Kandlur: Design and Analysis of a Grouped Sweeping Scheme for Multimedia Storage Management , 1992, NOSSDAV.

[5]  Carlo H. Séquin,et al.  Management of large amounts of data in interactive building walkthroughs , 1992, I3D '92.

[6]  P. Venkat Rangan,et al.  Multimedia Storage Servers: A Tutorial , 1995, Computer.

[7]  James W. Layland,et al.  Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment , 1989, JACM.

[8]  A. L. Narasimha Reddy,et al.  Disk scheduling in a multimedia I/O system , 1993, MULTIMEDIA '93.

[9]  Jonathan C. L. Liu,et al.  Performance of a Mass-Storage System for Video-on-Demand , 1995, J. Parallel Distributed Comput..

[10]  Richard R. Muntz,et al.  Randomized data allocation for real-time disk I/O , 1996, COMPCON '96. Technologies for the Information Superhighway Digest of Papers.

[11]  David J. DeWitt,et al.  Chained declustering: a new availability strategy for multiprocessor database machines , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[12]  Yitzhak Birk,et al.  A novel video layout strategy for near-video-on-demand servers , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[13]  Doron Rotem,et al.  Declustering Objects for Visualization , 1993, VLDB.

[14]  Charles U. Martel,et al.  On non-preemptive scheduling of period and sporadic tasks , 1991, [1991] Proceedings Twelfth Real-Time Systems Symposium.

[15]  Philip S. Yu,et al.  DASD dancing: a disk load balancing optimization scheme for video-on-demand computer systems , 1995, SIGMETRICS '95/PERFORMANCE '95.

[16]  Yale N. Patt,et al.  On-line extraction of SCSI disk drive parameters , 1995, SIGMETRICS '95/PERFORMANCE '95.

[17]  Randy H. Katz,et al.  Multi-resolution video representation for parallel disk arrays , 1993, MULTIMEDIA '93.

[18]  D. Leneman,et al.  Experimental observation of Alfvén wave cones , 1994 .

[19]  Randy H. Katz,et al.  Choosing the best storage system for video service , 1995, MULTIMEDIA '95.