Parallel out-of-core sorting and fast accesses to disks

The paper addresses two problems. We investigate the problem of parallel external sorting in the context of a form of heterogeneous clusters then we investigate the impact of efficient disk remote accesses on the performance of external sorting. We explore three techniques to show how they can be deployed for clusters with proportional processor performances. We also validate the READ2 library, an efficient implementation of remote SCSI disk accesses. We derive a new parallel sorting algorithm that is adapted to the READ2 interface. The expected gain of using READ2 is compared to the measured gain for one external sorting implementation.

[1]  Rodney Van Meter,et al.  Network attached storage architecture , 2000, CACM.

[2]  Christophe Cérin An out-of-core sorting algorithm for clusters with processors at different speed , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[3]  Ali R. Hurson,et al.  Scheduling and Load Balancing in Parallel and Distributed Systems , 1995 .

[4]  Erich Schikuta,et al.  Analysis and evaluation of sorting on hypercube-based systems , 1996, Proceedings of 4th Euromicro Workshop on Parallel and Distributed Processing.

[5]  Jonathan Schaeffer,et al.  Parallel Sorting by Regular Sampling , 1992, J. Parallel Distributed Comput..

[6]  Guy E. Blelloch,et al.  A comparison of sorting algorithms for the connection machine CM-2 , 1991, SPAA '91.

[7]  Simon Walton,et al.  Efficient High-Speed Data Paths for IP Forwarding using Host Based Routers , 2003 .

[8]  Gil Utard,et al.  Improving MPI-I/O Performance on PVFS , 2001, Euro-Par.

[9]  Gil Utard,et al.  READ2: Put disks at network level , 2003 .

[10]  Henri E. Bal,et al.  User-Level Network Interface Protocols , 1998, Computer.

[11]  Matthew D. Pearson Fast Out-of-Core Sorting on Parallel Disk Systems , 1999 .

[12]  S. Lakshmivarahan,et al.  Parallel Sorting Algorithms , 1984, Adv. Comput..

[13]  Andrew A. Chien,et al.  PPFS: a high performance portable parallel file system , 1995, ICS '95.

[14]  Patrick Geoffray,et al.  OPIOM: off-processor IO with Myrinet , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[15]  Gil Utard,et al.  Improving cluster IO performance with remote efficient access to distant device , 2002, 27th Annual IEEE Conference on Local Computer Networks, 2002. Proceedings. LCN 2002..

[16]  Jeffrey Scott Vitter,et al.  Greed sort: optimal deterministic sorting on parallel disks , 1995, JACM.

[17]  Mohamed Jemni,et al.  A synthesis of parallel out-of-core sorting programs on heterogeneous clusters , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[18]  Alok Aggarwal,et al.  The input/output complexity of sorting and related problems , 1988, CACM.

[19]  Alice Bonhomme,et al.  High Performance Video Server using Myrinet , 2000 .

[20]  Javier Jaén Martínez,et al.  Data Management in an International Data Grid Project , 2000, GRID.

[21]  Hui Li,et al.  Parallel sorting by over partitioning , 1994, SPAA '94.

[22]  Hector Garcia-Molina,et al.  Disk striping , 1986, 1986 IEEE Second International Conference on Data Engineering.

[23]  Dieter Kranzlmüller Scheduling and load balancing , 2003 .

[24]  Grant Erickson,et al.  A 64-bit, shared disk file system for Linux , 1999, 16th IEEE Symposium on Mass Storage Systems in cooperation with the 7th NASA Goddard Conference on Mass Storage Systems and Technologies (Cat. No.99CB37098).

[25]  Jeffrey Scott Vitter,et al.  Algorithms for parallel memory, II: Hierarchical multilevel memories , 1992, Algorithmica.

[26]  Donald E. Knuth,et al.  The art of computer programming, volume 3: (2nd ed.) sorting and searching , 1998 .

[27]  Sanguthevar Rajasekaran,et al.  A Framework for Simple Sorting Algorithms on Parallel Disk Systems (Extended Abstract) , 1998, SPAA.

[28]  Thomas H. Cormen,et al.  Early Experiences in Evaluating the Parallel Disk Model with the ViC* Implementation , 1996, Parallel Comput..

[29]  Jeffrey Scott Vitter,et al.  Algorithms for parallel memory, I: Two-level memories , 2005, Algorithmica.

[30]  Michelle Y. Kim,et al.  Synchronized Disk Interleaving , 1986, IEEE Transactions on Computers.

[31]  David A. Patterson,et al.  The Art of Massive Storage: A Web Image Archive , 2000, Computer.

[32]  Donald E. Knuth,et al.  Sorting and Searching , 1973 .

[33]  J. S. Huang,et al.  Parallel sorting and data partitioning by sampling , 1983 .

[34]  David J. DeWitt,et al.  Parallel sorting on a shared-nothing architecture using probabilistic splitting , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[35]  Andrea C. Arpaci-Dusseau,et al.  The architectural costs of streaming I/O: A comparison of workstations, clusters, and SMPs , 1998, Proceedings 1998 Fourth International Symposium on High-Performance Computer Architecture.

[36]  Walter B. Ligon,et al.  An Overview of the Parallel Virtual File System , 1999 .