Towards Implementing Virtual Data Infrastructures - a Case Study with iRODS

Scientists demand easy-to-use, scalable and flexible infrastructures for sharing, managing and processing their data spread over multiple resources accessible via different technologies and interfaces. In our previous work, we developed the conceptual framework VISPA for addressing these requirements. This paper provides a case study assessing the integrated Rule-Oriented Data System (iRODS) for implementing the key concepts of VISPA. We found that iRODS is already well suited for handling metadata and sharing data. Although it does not directly support provenance information of data and the temporal provisioning of data, basic forms of these capabilities may be provided through its customization mechanisms, ie rules and micro-services.

[1]  Dan Walsh,et al.  Design and implementation of the Sun network filesystem , 1985, USENIX Conference Proceedings.

[2]  Ian Foster,et al.  The Globus toolkit , 1998 .

[3]  Andreas Schreiber,et al.  A Data Management System for UNICORE 6 , 2009, Euro-Par Workshops.

[4]  Brian Matthews,et al.  Enhancing the Core Scientific Metadata Model to Incorporate Derived Data , 2010, 2010 IEEE Sixth International Conference on e-Science.

[5]  Sandra Payette,et al.  Fedora: an architecture for complex objects and their relationships , 2005, International Journal on Digital Libraries.

[6]  Andreas Aschenbrenner,et al.  Reference framework for distributed repositories: towards an open repository environment , 2009 .

[7]  Reagan Moore,et al.  Enabling Inter-Repository Access Management between iRODS and Fedora , 2009 .

[8]  Carlos Maltzahn,et al.  Ceph: a scalable, high-performance distributed file system , 2006, OSDI '06.

[9]  A. Rajasekar,et al.  Integration of Cloud Storage with Data Grids , 2009 .

[10]  Robert Wilensky,et al.  A framework for distributed digital object services , 2006, International Journal on Digital Libraries.

[11]  Vincent Y. Lum,et al.  EXPRESS: a data EXtraction, Processing, and Restructuring System , 1977, TODS.

[12]  Andrew J. Hutton,et al.  Lustre: Building a File System for 1,000-node Clusters , 2003 .

[13]  Michael Hoppe,et al.  eSciDoc Infrastructure: A Fedora-Based e-Research Framework , 2009, ECDL.

[14]  Dennis Gannon,et al.  Active management of scientific data , 2005, IEEE Internet Computing.

[15]  Eugenio Cesario,et al.  The XtreemFS architecture—a case for object‐based file systems in Grids , 2008, Concurr. Comput. Pract. Exp..

[16]  Marian Bubak,et al.  Development and execution of collaborative application on the ViroLab virtual laboratory , 2008 .