A Semantic Search Engine for the Storage Resource Broker

Information discovery is looming as a major challenge with the growth of tera-byte size datagrids. In order to manage their distributed data collections, many scientific organizations are adopting San Diego SuperComputer's Storage Resource Broker (SRB). Indexing and retrieval of data stored in SRB is via SRB's Metadata Catalogue (MCAT). MCAT focuses primarily on system or administrative metadata but supports domain-specific metadata through user-defined extensions. Although this approach provides maximum flexibility, it will lead to interoperability problems when searching across distributed collections described using different user-defined metadata schemas. The aim of the work described in this paper is to semantically augment SRB through an ontology and Resource Description Framework (RDF) descriptions in order to support arbitrary metadata schemata and to enhance the system's search capabilities. In particular we describe a semantic search engine and interface built on top of an OWL ontology, RDF instance data and a Jena reasoning engine that enables easier and more sophisticated searching of heterogeneous data stored using SRB.

[1]  Sandra Payette,et al.  Fedora: an architecture for complex objects and their relationships , 2005, International Journal on Digital Libraries.

[2]  Magnus Karlsson,et al.  Towards a Semantic-Aware File Store , 2003, HotOS.

[3]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[4]  Osamu Tatebe,et al.  Parallel and Distributed Astronomical Data Analysis on Grid Datafarm , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[5]  Kerstin Kleese van Dam,et al.  2005 Ieee International Symposium on Cluster Computing and the Grid Semantic Integration of File-based Data for Grid Services ' Cclrc E-science Centre 2 Bitish Atmospheric Data Centre 3 British Oceanographic Data Centre , 2022 .

[6]  Marios Pitikakis,et al.  A Data, Computation and Knowledge Grid the Case of the Arion System , 2003, ICEIS.

[7]  Reagan Moore,et al.  Data Grids, Digital Libraries, and Persistent Archives: An Integrated Approach to Sharing, Publishing, and Archiving Data , 2005, Proceedings of the IEEE.

[8]  Vassilis Christophides,et al.  Onthology-Driven Integration of Scientific Repositories , 1999, NGITS.

[9]  Jonathan W. Essex,et al.  A semantic datagrid for combinatorial chemistry , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[10]  Reagan Moore,et al.  MySRB & SRB: Components of a Data Grid , 2002 .

[11]  L. Stein,et al.  OWL Web Ontology Language - Reference , 2004 .

[12]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..