A Versatile Searchable File System for HPC Analytics

Emerging HPC analytics applications urgently demand filesearch services to drastically reduce the scale of the input data in real-time, so that the speed of computation and data analytics can be greatly accelerated. Unfortunately, the existing file-search solutions are either poorly scalable for large-scale systems, or lack a well-integrated interface to allow applications to easily use them for critical tasks. We believe that the time is ripe for the design of a searchable file system capable of accurate and scalable system-level filesearch functionality. In this paper, we propose a Versatile Searchable File System, VSFS, which provides a transparent, accurate and real-time file-search service through a POSIX-compatible file system namespace that can be integrated into any HPC/Big Data legacy code without modifications. Additionally, to support real-time file search, VSFS uses a DRAM-based distributed architecture to perform real-time file indexing. Moreover, a versatile index scheme is designed to adapt to the various forms of HPC datasets. The results of our VSFS prototype evaluation show that VSFS is scalable in a typical HPC environment. It achieves significantly better file-indexing and file-search performance than the popular SQL/NoSQL solutions, while it only introduces negligible I/O overhead. Finally, we integrate VSFS to a scientific analytics application to show its benefits in terms of performance and ease of use.

[1]  David H. Small,et al.  Nowcasting: the real time informational content of macroeconomic data releases , 2008 .

[2]  Amar Phanishayee,et al.  FAWN: a fast array of wimpy nodes , 2009, SOSP '09.

[3]  Detlef D. Nauck,et al.  Real Time Business Intelligence for the Adaptive Enterprise , 2006, The 8th IEEE International Conference on E-Commerce Technology and The 3rd IEEE International Conference on Enterprise Computing, E-Commerce, and E-Services (CEC/EEE'06).

[4]  Yu Hua,et al.  Propeller: A Scalable Metadata Organization for A Versatile Searchable File System , 2011 .

[5]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[6]  E. Mardis Next-generation DNA sequencing methods. , 2008, Annual review of genomics and human genetics.

[7]  Pierre Jouvelot,et al.  Semantic file systems , 1991, SOSP '91.

[8]  George Bosilca,et al.  Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation , 2004, PVM/MPI.

[9]  Cici 将搜索延伸到桌面——Google Desktop Search , 2004 .

[10]  F. Tödtling,et al.  One size fits all?: Towards a differentiated regional innovation policy approach , 2005 .

[11]  Yawei Li,et al.  Megastore: Providing Scalable, Highly Available Storage for Interactive Services , 2011, CIDR.

[12]  Shankar Pasupathy,et al.  Spyglass: Fast, Scalable Metadata Search for Large-Scale Storage Systems , 2009, FAST.

[13]  Mike J. Mason,et al.  Induced pluripotent stem cells and embryonic stem cells are distinguished by gene expression signatures. , 2009, Cell stem cell.

[14]  Jacob R. Lorch,et al.  A five-year study of file-system metadata , 2007, TOS.

[15]  Lu Zhang,et al.  Large scale real-time PCR validation on gene expression measurements from two commercial long-oligonucleotide microarrays , 2006, BMC Genomics.

[16]  Komal Shringare,et al.  Apache Hadoop Goes Realtime at Facebook , 2015 .

[17]  Udi Manber,et al.  Integrating content-based access mechanisms with hierarchical file systems , 1999, OSDI '99.

[18]  Krzysztof Janowicz,et al.  spatial@linkedscience - Exploring the Research Field of GIScience with Linked Data , 2012, GIScience.

[19]  Bradley M. Hemminger,et al.  Scientific data repositories on the Web: An initial survey , 2010, J. Assoc. Inf. Sci. Technol..

[20]  Robert B. Hagmann,et al.  Reimplementing the Cedar file system using logging and group commit , 1987, SOSP '87.

[21]  Karthik Ranganathan,et al.  Apache hadoop goes realtime at Facebook , 2011, SIGMOD '11.

[22]  Hong Jiang,et al.  SmartStore: a new metadata organization paradigm with semantic-awareness for next-generation file systems , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[23]  René Thomsen,et al.  MolDock: a new technique for high-accuracy molecular docking. , 2006, Journal of medicinal chemistry.

[24]  Andrea C. Arpaci-Dusseau,et al.  Analysis and Evolution of Journaling File Systems , 2005, USENIX Annual Technical Conference, General Track.

[25]  Craig A. N. Soules,et al.  Connections: using context to enhance file search , 2005, SOSP '05.

[26]  Andrey Gubarev,et al.  Dremel : Interactive Analysis of Web-Scale Datasets , 2011 .

[27]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[28]  Alexander S. Szalay,et al.  Just-in-Time Analytics on Large File Systems , 2011, IEEE Transactions on Computers.

[29]  Bin Zhou,et al.  Scalable Performance of the Panasas Parallel File System , 2008, FAST.

[30]  Werner Vogels,et al.  Dynamo: amazon's highly available key-value store , 2007, SOSP.

[31]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[32]  Carlos Maltzahn,et al.  Ceph: a scalable, high-performance distributed file system , 2006, OSDI '06.

[33]  P. G. Neumann,et al.  A general-purpose file system for secondary storage , 1965, Published in AFIPS '65 (Fall, part I).

[34]  Henrik Loeser,et al.  "One Size Fits All": An Idea Whose Time Has Come and Gone? , 2011, BTW.

[35]  Adam Silberstein,et al.  Benchmarking cloud serving systems with YCSB , 2010, SoCC '10.

[36]  Mendel Rosenblum,et al.  Fast crash recovery in RAMCloud , 2011, SOSP.

[37]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[38]  David R. Karger,et al.  Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web , 1997, STOC '97.

[39]  Scott Michael,et al.  Demonstrating Lustre over a 100Gbps wide area network of 3,500km , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[40]  Margo I. Seltzer,et al.  Hierarchical File Systems Are Dead , 2009, HotOS.

[41]  Kyle Banker,et al.  MongoDB in Action , 2011 .

[42]  Jim Gray,et al.  To BLOB or Not To BLOB: Large Object Storage in a Database or a Filesystem? , 2007, ArXiv.

[43]  David R. Karger,et al.  Web Caching with Consistent Hashing , 1999, Comput. Networks.

[44]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[45]  Christopher Frost,et al.  Spanner: Google's Globally-Distributed Database , 2012, OSDI.

[46]  Garth A. Gibson,et al.  Scale and Concurrency of GIGA+: File System Directories with Millions of Files , 2011, FAST.