The Back-End of a Two-Layer Model for a Federated National Datastore for Academic Research VOs that Integrates EGEE Data Management

This paper proposes an architecture for the back-end of a federated national datastore for use by academic research communities, developed by the e-INIS (Irish National e-InfraStructure) project, and describes in detail one member of the federation, the regional datastore at Trinity College Dublin. It builds upon existing infrastructure and services, including Grid-Ireland, the National Grid Initiative and EGEE, Europe’s leading Grid infrastructure. It assumes users are in distinct research communities and that their data access patterns can be described via two properties, denoted as mutability and frequency-of-access. The architecture is for a back-end—individual academic communities are best qualified to define their own front-end services and user interfaces. The proposal is designed to facilitate front-end development by placing minimal restrictions on how the front-end is implemented and on the internal community security policies. The proposal also seeks to ensure that the communities are insulated from the back-end and from each other in order to ensure quality of service and to decouple their front-end implementation from site-specific back-end implementations.

[1]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[2]  Douglas Thain,et al.  Transparent access to Grid resources for user software , 2006, Concurr. Comput. Pract. Exp..

[3]  Paola Grosso,et al.  Amsterdam CineGrid Exchange: A distributed high-quality digital media solution , 2009 .

[4]  Reagan Moore,et al.  Enabling Inter-Repository Access Management between iRODS and Fedora , 2009 .

[5]  Samir Saklikar,et al.  Next steps for security assertion markup language (saml) , 2007, SWS '07.

[6]  A. D. Meglio,et al.  Programming the Grid with gLite , 2006 .

[7]  Dirk Grunwald,et al.  The Case for Massive Arrays of Idle Disks (MAID) , 2002 .

[8]  Mark Hedges,et al.  Arts and humanities e-science - Current practices and future challenges , 2009, Future Gener. Comput. Syst..

[9]  Julian Satran,et al.  Internet Small Computer Systems Interface (iSCSI) , 2004, RFC.

[10]  Ian T. Foster,et al.  A security architecture for computational grids , 1998, CCS '98.

[11]  Mark Hedges,et al.  Rule-based curation and preservation of data: A data grid approach using iRODS , 2009, Future Gener. Comput. Syst..

[12]  Andrew L. Wendelborn,et al.  Davis: A generic interface for iRODS and SRB , 2009, 2009 10th IEEE/ACM International Conference on Grid Computing.

[13]  Nuno Santos,et al.  The AMGA Metadata Service , 2008, Journal of Grid Computing.

[14]  Christopher Hertel Implementing CIFS: The Common Internet File System , 2003 .

[15]  James Gallagher,et al.  OPeNDAP: Accessing data in a distributed, heterogeneous environment , 2003, Data Sci. J..

[16]  E.J. Whitehead,et al.  WEBDAV: IETF Standard for Collaborative Authoring on the Web , 1998, IEEE Internet Comput..

[17]  Dirk Pilat,et al.  OECD Principles and Guidelines for Access to Research Data from Public Funding , 2007, Data Sci. J..

[18]  Douglas Thain,et al.  Parrot: Transparent User-Level Middleware for Data-Intensive Computing , 2005, Scalable Comput. Pract. Exp..