Disk write caching with an optical network

In this paper we propose a simple extension to the optical network of a scalable multiprocessor that optimizes page swap-outs significantly. More specifically, we propose to extend the network with an optical ring that not only transfers swapped-out pages between the local memories and the disks of the multiprocessor but also acts as a system-wide write cache for these pages. This extended optical network has several performance benefits: it provides a staging area where swapped-out pages can reside until the disk is free; it increases the possibility of combining several writes to disk; and it acts as a victim cache for pages that are swapped out and subsequently accessed by the same or a different processor. In order to evaluate the extent to which these benefits affect performance, we use detailed execution-driven simulations of several out-of-core parallel applications running on an 8-node scalable multiprocessor. Our results demonstrate that our optical ring provides consistent performance improvements, coming mostly from faster page swap-outs and victim caching. Based on these results and on our parameter space study, our main conclusion is that our optical ring is highly efficient under several architectural assumptions and for most out-of-core parallel applications.

[1]  R. Langenhorst,et al.  Fiber loop optical buffer , 1996 .

[2]  H. F. Jordan,et al.  Fiber optic delay line memory. , 1990, Applied optics.

[3]  Brian N. Bershad,et al.  A trace-driven comparison of algorithms for parallel prefetching and caching , 1996, OSDI '96.

[4]  Qing Yang,et al.  DCD --- Disk Caching Disk: A New Approach for Boosting I/O Performance , 1996, 23rd Annual International Symposium on Computer Architecture (ISCA'96).

[5]  Ricardo Bianchini,et al.  OPTNET: a cost-effective optical network for multiprocessors , 1998, ICS '98.

[6]  Harry F. Jordan,et al.  Optoelectronic time-of-flight design and the demonstration of an all-optical, stored program, digital computer , 1994 .

[7]  R. S. Fabry,et al.  A fast file system for UNIX , 1984, TOCS.

[8]  John Wilkes,et al.  UNIX Disk Access Patterns , 1993, USENIX Winter.

[9]  Allan Porterfield,et al.  The Tera computer system , 1990 .

[10]  Ricardo Bianchini,et al.  NetCache: A Network/Cache Hybrid for Multiprocessors , 1999, IPPS/SPDP Workshops.

[11]  Garth A. Gibson,et al.  Parity logging disk arrays , 1994, TOCS.

[12]  Yiming Hu,et al.  DCD—disk caching disk: a new approach for boosting I/O performance , 1996, ISCA '96.

[13]  Todd C. Mowry,et al.  Automatic compiler-inserted I/O prefetching for out-of-core applications , 1996, OSDI '96.

[14]  Ricardo Bianchini,et al.  The MIT Alewife machine: architecture and performance , 1995, Proceedings 22nd Annual International Symposium on Computer Architecture.

[15]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.

[16]  Ricardo Bianchini,et al.  NWCache: Optimizing Disk Accesses via an Optical Network/Write Cache Hybrid , 1999, IPPS/SPDP Workshops.

[17]  Robert J. Fowler,et al.  MINT: a front end for efficient simulation of shared-memory multiprocessors , 1994, Proceedings of International Workshop on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.