A semantically enabled metadata repository for scientific data

The LASP Extended Metadata Repository (LEMR) is a semantically enabled repository of information (metadata) about the scientific datasets that LASP offers to the public. The repository enables the provision of consistent, current, verified metadata to our users. It serves as a Single Source of Truth for this information, enabling more rigorous metadata management and addressing problems related to duplication of information. The linked open data aspect of the repository allows interlinking of concepts both within and across organizations and web sites. Associated interfaces allow users to browse and search the metadata. This information can be dynamically incorporated into web pages, so web page content is always up-to-date and consistent across the lab. With this information we can generate metadata records in a variety of schemas, such as ISO or SPASE, allowing federation with other organizations interested in our data. We leveraged open source technologies to build the repository and the dynamic web pages that read from it. VIVO, an open source semantic web application, provided key capabilities such as ontology and triple store management interfaces. AngularJS, an open source JavaScript framework for building web dynamic applications, was also invaluable in developing web pages that provide semantically enabled public interfaces to the metadata. In this paper we discuss our use of these tools and what we had to craft in order to meet our lab-specific needs.

[1]  Todd King,et al.  Developing a SPASE Query Language , 2008, Earth Sci. Informatics.

[2]  Ellen J. Cramer,et al.  VIVO: Enabling National Networking of Scientists , 2010, IASSIST.

[3]  Deborah L. McGuinness,et al.  The Virtual Solar-Terrestrial Observatory: A Deployed Semantic Web Application Case Study for Scientific Research , 2007, AAAI.

[4]  Michel C. A. Klein,et al.  Ontology versioning on the Semantic Web , 2001, SWWS.

[5]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[6]  Robert G. Raskin,et al.  Knowledge representation in the semantic web for Earth and environmental terminology (SWEET) , 2005, Comput. Geosci..

[7]  Peter Fox,et al.  From science to e-Science to Semantic e-Science: A Heliophysics case study , 2012, Comput. Geosci..

[8]  Thomas W. Narock,et al.  Using semantics to extend the space physics data environment , 2009, Comput. Geosci..

[9]  Virginia Gewin Networking in VIVO , 2009 .

[10]  Deborah L. McGuinness,et al.  Ontology-supported scientific data frameworks: The Virtual Solar-Terrestrial Observatory experience , 2009, Comput. Geosci..

[11]  Pascal Hitzler Ontology Design Patterns for Large-Scale Data Interchange and Discovery , 2014 .

[12]  Thomas W. Narock,et al.  Navigating through SPASE to heliospheric and magnetospheric data , 2008, Earth Sci. Informatics.

[13]  D. M. Lindholm,et al.  LISIRD 2: Applying Standards and Open Source Software in Exploring and Serving Scientific Data , 2009 .