Improving cluster IO performance with remote efficient access to distant device

Several pieces of grand challenge application software in biology or physics process large datasets which are stored on disks. These applications require high performance IO systems. Cluster computing is a good approach to build an IO intensive platform. In this paper we investigate a technique we called READ/sup 2/ which stands for Remote Efficient Access to Distant Device. The aim of this technique is to reduce the stress which is put on the buses of the cluster's node during the execution of IO streaming applications using parallel IO. In READ/sup 2/ any cluster's node directly accesses a remote disk of a distant node: the distant processor and the distant memory are removed from the control and data path. With this technique, a cluster can be considered as a shared disk architecture instead of a shared nothing one, and may inherit works from the SAN community. This paper presents what the architectural benefit of READ/sup 2/ are, i.e. a better use of IO and memory buses which eventually improve the IO scalability of a cluster and the performance of streaming applications.

[1]  Rodney Van Meter,et al.  Network attached storage architecture , 2000, CACM.

[2]  Patrick Geoffray OPIOM: Off-Processor I/O with Myrinet , 2002, Future Gener. Comput. Syst..

[3]  David A. Patterson,et al.  The Art of Massive Storage: A Web Image Archive , 2000, Computer.

[4]  Andrea C. Arpaci-Dusseau,et al.  High-performance sorting on networks of workstations , 1997, SIGMOD '97.

[5]  Grant Erickson,et al.  A 64-bit, shared disk file system for Linux , 1999, 16th IEEE Symposium on Mass Storage Systems in cooperation with the 7th NASA Goddard Conference on Mass Storage Systems and Technologies (Cat. No.99CB37098).

[6]  Simon Walton,et al.  Efficient High-Speed Data Paths for IP Forwarding using Host Based Routers , 2003 .

[7]  Andrew A. Chien,et al.  PPFS: a high performance portable parallel file system , 1995, ICS '95.