Active network file system for data mining and multimedia

Storage Area Networks (SAN) exploit advances in networking to make storage remote. SANs enable large scale storage and sharing which in turn facilitate data mining and multimedia applications. These applications are also compute intensive. Active disks make use of the computational power available in the disk to reduce storage traffic. Many of the file system proposals for active disks work at the block level. In this paper we argue for the necessity of filtering at application level. The proposed active network file system (ANFS) is based on the familiar network file system (NFS). We outline the implementation of ANFS in Linux and demonstrate its applicability to data mining and multimedia applications.

[1]  Spencer Shepler NFS Version 4 Design Considerations , 1999, RFC.

[2]  Matthew T. O'Keefe,et al.  The Global File System , 1996 .

[3]  A. L. Narasimha Reddy,et al.  MVSS: Multi-View Storage System , 2001, Proceedings 21st International Conference on Distributed Computing Systems.

[4]  David Clark,et al.  Architectural considerations for a new generation of protocols , 1990, SIGCOMM 1990.

[5]  Matthew T. O'Keefe,et al.  Device Locks: mutual exclusion for storage area networks , 1999, 16th IEEE Symposium on Mass Storage Systems in cooperation with the 7th NASA Goddard Conference on Mass Storage Systems and Technologies (Cat. No.99CB37098).

[6]  Carl Smith,et al.  NFS Version 3: Design and Implementation , 1994, USENIX Summer.

[7]  Grant Erickson,et al.  The Design and Performance of a Shared Disk File System for IRIX , 1998 .

[8]  Joel H. Saltz,et al.  Active disks: programming model, algorithms and evaluation , 1998, ASPLOS VIII.

[9]  Uresh K. Vahalia UNIX Internals: The New Frontiers , 1995 .

[10]  Christos Faloutsos,et al.  Active Storage for Large-Scale Data Mining and Multimedia , 1998, VLDB.

[11]  David Hung-Chang Du,et al.  Active Disk File System : A Distributed, Scalable File System , 2001, 2001 Eighteenth IEEE Symposium on Mass Storage Systems and Technologies.

[12]  Christian Böhm,et al.  Improving the Query Performance of High-Dimensional Index Structures by Bulk-Load Operations , 1998, EDBT.

[13]  Christian Böhm,et al.  Fast parallel similarity search in multimedia databases , 1997, SIGMOD '97.

[14]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[15]  Jim Zelenka,et al.  File server scaling with network-attached secure disks , 1997, SIGMETRICS '97.