Active disks: programming model, algorithms and evaluation

Several application and technology trends indicate that it might be both profitable and feasible to move computation closer to the data that it processes. In this paper, we evaluate Active Disk architectures which integrate significant processing power and memory into a disk drive and allow application-specific code to be downloaded and executed on the data that is being read from (written to) disk. The key idea is to offload bulk of the processing to the diskresident processors and to use the host processor primarily for coordination, scheduling and combination of results from individual disks. To program Active Disks, we propose a stream-based programming model which allows disklets to be executed efficiently and safely. Simulation results for a suite of six algorithms from three application domains (commercial data warehouses, image processing and satellite data processing) indicate that for these algorithms, Active Disks outperform conventional-disk architectures.

[1]  H. R. J. Grosch,et al.  The Future of Computing , 1958 .

[2]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[3]  T. Anderson,et al.  Eecient Software-based Fault Isolation , 1993 .

[4]  Robert Wahbe,et al.  Efficient software-based fault isolation , 1994, SOSP '93.

[5]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[6]  Alan R. Simon,et al.  Understanding the New SQL: A Complete Guide , 1993 .

[7]  David Kotz,et al.  A Detailed Simulation Model of the HP 97560 Disk Drive , 1994 .

[8]  John Wilkes,et al.  An introduction to disk drive modeling , 1994, Computer.

[9]  David B. Lomet,et al.  AlphaSort: a RISC machine sort , 1994, SIGMOD '94.

[10]  Sally A. McKee,et al.  Hitting the memory wall: implications of the obvious , 1995, CARN.

[11]  Jim Zelenka,et al.  Informed prefetching and caching , 1995, SOSP.

[12]  David A. Patterson,et al.  Computer Architecture - A Quantitative Approach, 5th Edition , 1996 .

[13]  Jeffrey F. Naughton,et al.  On the Computation of Multidimensional Aggregates , 1996, VLDB.

[14]  Joel H. Saltz,et al.  Tuning the performance of I/O-intensive parallel applications , 1996, IOPADS '96.

[15]  David A. Patterson,et al.  Computer architecture (2nd ed.): a quantitative approach , 1996 .

[16]  Ramesh C. Agarwal,et al.  A super scalar sort algorithm for RISC processors , 1996, SIGMOD '96.

[17]  Jim Zelenka,et al.  File server scaling with network-attached secure disks , 1997, SIGMETRICS '97.

[18]  Noah Treuhaft,et al.  Intelligent RAM (IRAM): the industrial setting, applications, and architectures , 1997, Proceedings International Conference on Computer Design VLSI in Computers and Processors.

[19]  明野 和彦 EROS Data Center , 1997 .

[20]  Arif Merchant,et al.  Using attribute-managed storage to achieve QoS , 1997 .

[21]  Andrea C. Arpaci-Dusseau,et al.  High-performance sorting on networks of workstations , 1997, SIGMOD '97.

[22]  Christos Faloutsos,et al.  Active Storage for Large-Scale Data Mining and Multimedia , 1998, VLDB.

[23]  Joseph L. Hellerstein,et al.  The intelligent disk (idisk): a revolutionary approach to database computing infrastucture , 1998 .

[24]  Frederic T. Chong,et al.  Active pages: a computation model for intelligent memory , 1998, ISCA.

[25]  M. Dertouzos THE FUTURE OF COMPUTING , 1999 .