READ2: Put disks at network level

Grand challenge applications have to process largeamounts of data, and then require high performance IOsystems.Cluster computing is a good alternative to proprietarysystem for building cost effective IO intensive platform:some cluster architectures won sorting benchmark(MinuteSort, Datamation)! Recent advances in IO componenttechnologies (disk, controller and network) let us expecthigher IO performance for data intensive applicationson cluster.The counterpart of this evolution is that muchstress is put on the different buses (memory, IO) of eachnode which cannot be scaled.In this paper we investigate astrategy we called READ2 (Remote Efficient Access to DistantDevice) to reduce the stress.With READ2 any clusternode accesses directly to remote disk: the remote processorand the remote memory are removed from the control anddata path: Inputs/Outputs don't interfere with the host processorand the host memory activity.With READ2 strategy,a cluster can be considered as a shared disk architectureinstead of a shared nothing one.This papers describes animplementation of READ2 on Myrinet Networks.First experimentalresults show IO performance improvement.

[1]  Andrea C. Arpaci-Dusseau,et al.  High-performance sorting on networks of workstations , 1997, SIGMOD '97.

[2]  Patrick Geoffray OPIOM: Off-Processor I/O with Myrinet , 2002, Future Gener. Comput. Syst..

[3]  Rodney Van Meter,et al.  Network attached storage architecture , 2000, CACM.

[4]  David A. Patterson,et al.  The Art of Massive Storage: A Web Image Archive , 2000, Computer.

[5]  Andrew A. Chien,et al.  PPFS: a high performance portable parallel file system , 1995, ICS '95.

[6]  Andrea C. Arpaci-Dusseau,et al.  The architectural costs of streaming I/O: A comparison of workstations, clusters, and SMPs , 1998, Proceedings 1998 Fourth International Symposium on High-Performance Computer Architecture.

[7]  Hai Jin,et al.  RAID-x: a new distributed disk array for I/O-centric cluster computing , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[8]  Stein Gjessing,et al.  Distributed-directory scheme: scalable coherent interface , 1990, Computer.

[9]  Simon Walton,et al.  Efficient High-Speed Data Paths for IP Forwarding using Host Based Routers , 2003 .

[10]  Gil Utard,et al.  Improving cluster IO performance with remote efficient access to distant device , 2002, 27th Annual IEEE Conference on Local Computer Networks, 2002. Proceedings. LCN 2002..

[11]  Grant Erickson,et al.  A 64-bit, shared disk file system for Linux , 1999, 16th IEEE Symposium on Mass Storage Systems in cooperation with the 7th NASA Goddard Conference on Mass Storage Systems and Technologies (Cat. No.99CB37098).