Mapping Geospatial Metadata to Open Provenance Model

This paper maps the data lineage entities in ISO19115 and ISO19115-2, the metadata standards of the International Organization for Standardization for geographic information and for imagery and gridded data [ISO geospatial metadata (GMD)], to the entities in open provenance model (OPM). The term “map” refers to establishing a correspondence between the said entities in ISO GMD and OPM. Presently, many geospatial data available in spatial data infrastructures (SDI) are described using ISO GMD. Its structure, however, makes tracing the provenance of these data a challenging task. OPM prioritizes causal relationships between things for capturing the workflow applied to particular data, making it easier to trace the data provenance. The mapping in this paper provides a convenient means to trace the provenance of data through the OPM causal relations and evaluate the fitness for use of these data, a necessary step toward data integration. This paper uses the notion of process to identify various data processing activities encoded in ISO GMD, the resource and the agent types involved in these activities, and state changes. A software prototype to carry out the mapping is developed. The mapping result is encoded in the resource description framework format to permit integral use of geospatial data in SDI and the data from the open data world. An exemplar metadata in ISO GMD from the National Oceanographic Data Center of the National Oceanic and Atmospheric Administration is used to demonstrate the feasibility to convert from the ISO GMD data lineage entities to the OPM entities.

[1]  Liping Di,et al.  Augmenting geospatial data provenance through metadata tracking in geospatial service chaining , 2010, Comput. Geosci..

[2]  Yogesh L. Simmhan,et al.  Special Section: The third provenance challenge on using the open provenance model for interoperability , 2011, Future Gener. Comput. Syst..

[3]  Bruce R. Barkstrom,et al.  A mathematical framework for earth science data provenance tracing , 2010, Earth Sci. Informatics.

[4]  X. Yang,et al.  An integrated view of data quality in Earth observation , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[5]  Wang Chiew Tan Provenance in Databases: Past, Current, and Future , 2007, IEEE Data Eng. Bull..

[6]  Emden R. Gansner,et al.  Graphviz - Open Source Graph Drawing Tools , 2001, GD.

[7]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[8]  Jan Van den Bussche,et al.  Mapping the NRC Dataflow Model to the Open Provenance Model , 2008, IPAW.

[9]  Chathura Herath,et al.  Data provenance for preservation of digital geoscience data , 2011 .

[10]  Deborah L. McGuinness,et al.  Ontology-supported scientific data frameworks: The Virtual Solar-Terrestrial Observatory experience , 2009, Comput. Geosci..

[11]  David A. Bennett,et al.  Toward an understanding of provenance in complex land use dynamics , 2011 .

[12]  Eric G. Stephan,et al.  Leveraging the Open Provenance Model as a Multi-tier Model for Global Climate Research , 2010, IPAW.

[13]  Liping Di A Framework for Developing Web-Service-Based Intelligent Geospatial Knowledge Systems , 2005, Ann. GIS.

[14]  Yogesh L. Simmhan,et al.  The Open Provenance Model core specification (v1.1) , 2011, Future Gener. Comput. Syst..

[15]  Tomás Knap,et al.  W3P: Building an OPM based provenance model for the Web , 2011, Future Gener. Comput. Syst..

[16]  Yelena Yesha,et al.  Tracking provenance of earth science data , 2010, Earth Sci. Informatics.

[17]  Simon Miles,et al.  Mapping attribution metadata to the Open Provenance Model , 2011, Future Gener. Comput. Syst..

[18]  Sudha Ram,et al.  A Semantic Foundation for Provenance Management , 2012, Journal on Data Semantics.

[19]  Alan E. Strong,et al.  Remote sensing of sea surface temperatures during 2002 Barrier Reef coral bleaching , 2003 .

[20]  Daniel Crawl,et al.  Workflows and extensions to the Kepler scientific workflow system to support environmental sensor data access and analysis , 2010, Ecol. Informatics.

[21]  James D. Myers,et al.  A provenance-aware virtual sensor system using the Open Provenance Model , 2010, 2010 International Symposium on Collaborative Technologies and Systems.

[22]  Robert Stevens,et al.  Annotating, Linking and Browsing Provenance Logs for {e-Science} , 2003 .

[23]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[24]  강문설 [서평]「The Unified Modeling Language User Guide」 , 1999 .

[25]  Yogesh L. Simmhan,et al.  A survey of data provenance in e-science , 2005, SGMD.