An Ontology Mapping Approach to Integrating Earth Science Metadata

One of the main barriers to exploiting the great wealth of global earth science data available today is that researchers are unable to rapidly search and find data relevant to their studies. This data is spread across a large number of archives maintained by different institutions employing a bewildering array of different data description languages. In this paper, we describe a metadata federation approach designed to support queries across multiple earth science data archives without requiring adoption of a unified metadata standard. Our ontology-based approach employs a central metadata transformation facility capable of integrating heterogeneous metadata using a set of ‘translators’ and ‘wrappers’. This shifts the burden of federation from the data provider to the central metadata facility, acknowledging that not all data providers have the motivation or resources to comply with externally-imposed metadata standards. We demonstrate the feasibility of this approach with a proof-of-concept prototype that federates metadata across two earth science data archives – one containing NASA data and the other containing USDA data – despite the differences in their metadata languages.

[1]  Daniel Bloomfield Ramagem,et al.  AnnoTerra: building an integrated earth science resource using semantic Web technologies , 2004, IEEE Intelligent Systems.

[2]  Natalya F. Noy,et al.  Semantic integration: a survey of ontology-based approaches , 2004, SGMD.

[3]  S. Diane Eckles,et al.  Conservation effects assessment Project , 2008 .

[4]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[5]  John J. McCarthy,et al.  The Rule Engine for the Java Platform , 2008 .

[6]  Shawn R. Wolfe,et al.  Semantic Integration of Heterogeneous NASA Mission Data Sources , 2006, AAAI Fall Symposium: Semantic Web for Collaborative Knowledge Acquisition.

[7]  Alon Y. Halevy,et al.  Semantic Integration , 2005, AI Mag..

[8]  Shawn Bowers,et al.  Towards a Generic Framework for Semantic Registration of Scientific Data , 2003 .

[9]  Samson W. Tu,et al.  DataMaster – a Plug-in for Importing Schemas and Data from Relational Databases into Protégé , 2007 .

[10]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[11]  Robert G. Raskin,et al.  Knowledge representation in the semantic web for Earth and environmental terminology (SWEET) , 2005, Comput. Geosci..

[12]  Éric Leclercq,et al.  ISIS: a semantic mediation model and an agent based architecture for GIS interoperability , 1999, Proceedings. IDEAS'99. International Database Engineering and Applications Symposium (Cat. No.PR00265).

[13]  Brian McBride,et al.  Jena: A Semantic Web Toolkit , 2002, IEEE Internet Comput..