Ontology engineering in provenance enablement for the National Climate Assessment

The National Climate Assessment of the U.S. Global Change Research Program (USGCRP) analyzes and presents the impacts of climate change on the United States. The provenance information in the assessment is important because the assessment findings are of great public and academic concern and are used in policy and decision-making. By applying a use case-driven iterative methodology, we developed information models and ontology to represent the content structure of the recent National Climate Assessment draft report and its associated provenance information. We tested the ontology by using it in pilot systems serving information about instances of chapters, scientific findings, figures, tables, images, datasets, references, people, and organizations, etc. in the draft report, as well as interrelationships among those instances. The results successfully help users trace provenance in the draft report, such as finding all the journal articles from which a figure in the report was derived. The provenance information in our work was maintained in the context of the "Web of Data". In addition to the pilot systems we developed, other tools and services are also able to retrieve and utilize the provenance information. Our work is part of a Global Change Information System coordinated by the USGCRP that will eventually cover provenance information for the entire scope of global change research. Such a system will greatly increase understanding, credibility and trust in the global change research and foster reproducibility of scientific results and conclusions. Reviewed the use case-driven iterative method and compared it with other methods.Introduced the method's feature in bridging up people for knowledge engineering.Built a provenance-explicit ontology for the U.S. National Climate Assessment.Built provenance-tracing services to support internal and external applications.

[1]  Andrew R. Maffei,et al.  A Framework for Integrating Oceanographic Data Repositories , 2010 .

[2]  Chris Hope,et al.  Environmental information for all : The need of a monthly index , 1990 .

[3]  Paul T. Groth,et al.  Automatic Metadata Annotation through Reconstructing Provenance , 2012, SWPM@ESWC.

[4]  Yolanda Gil,et al.  PROV-DM: The PROV Data Model , 2013 .

[5]  James A. Hendler,et al.  Semantic escience: encoding meaning in next-generation digitally enhanced science , 2009, The Fourth Paradigm.

[6]  Paul T. Groth,et al.  PrIMe: A methodology for developing provenance-aware applications , 2011, TSEM.

[7]  Aldo Gangemi,et al.  Ontology Design Patterns , 2005 .

[8]  Chen-Chieh Feng Mapping Geospatial Metadata to Open Provenance Model , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Mikael Lindvall,et al.  An introduction to agile methods , 2004, Adv. Comput..

[10]  Sanjeev Khanna,et al.  Data Provenance: Some Basic Issues , 2000, FSTTCS.

[11]  Deborah L. McGuinness,et al.  A proof markup language for Semantic Web services , 2006, Inf. Syst..

[12]  G Stix,et al.  The mice that warred. , 2001, Scientific American.

[13]  Deborah L. McGuinness,et al.  Ontology-supported scientific data frameworks: The Virtual Solar-Terrestrial Observatory experience , 2009, Comput. Geosci..

[14]  Matthew B. Jones,et al.  Challenges and Opportunities of Open Data in Ecology , 2011, Science.

[15]  Luc Moreau,et al.  PROV-Overview. An Overview of the PROV Family of Documents , 2013 .

[16]  Yogesh L. Simmhan,et al.  A survey of data provenance in e-science , 2005, SGMD.

[17]  Alexey Voinov,et al.  Systems Science and Modeling for Ecological Economics , 2008 .

[18]  Jeff Dozier,et al.  Environmental Informatics , 2012 .

[19]  Cecelia DeLuca,et al.  Toward self-describing and workflow integrated Earth system models: A coupled atmosphere-ocean modeling system application , 2013, Environ. Model. Softw..

[20]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[21]  Joep Crompvoets,et al.  Geographic Information Science at the Heart of Europe , 2013, AGILE Conf..

[22]  Andrea Emilio Rizzoli,et al.  Modelling with knowledge: A review of emerging semantic approaches to environmental modelling , 2009, Environ. Model. Softw..

[23]  James Hendler,et al.  Science and the Semantic Web , 2003, Science.

[24]  Kurt Bittner,et al.  Use Case Modeling , 2002 .

[25]  Andreas Harth,et al.  Weaving the Pedantic Web , 2010, LDOW.

[26]  Alexey Voinov Systems science and modeling for ecological , 2015 .

[27]  Liping Di,et al.  Augmenting geospatial data provenance through metadata tracking in geospatial service chaining , 2010, Comput. Geosci..

[28]  James A. Edmonds,et al.  Climate Change Impacts for the Conterminous USA: An Integrated Assessment: From Mink to the ‘Lower 48’ , 2005 .

[29]  Yuanzheng Shao,et al.  Implementation of Geospatial Data Provenance in a Web Service Workflow Environment With ISO 19115 and ISO 19115-2 Lineage Model , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[30]  G. Yohe,et al.  Climate Change Impacts in the United States: The Third National Climate Assessment , 2014 .

[31]  Stephan Mäs,et al.  Provenance Information in Geodata Infrastructures , 2013, AGILE Conf..

[32]  Robert Meersman,et al.  Data modelling versus ontology engineering , 2002, SGMD.

[33]  Deborah L. McGuinness,et al.  PROV-O: The PROV Ontology , 2013 .

[34]  Deborah L. McGuinness,et al.  Provenance Representation for the National Climate Assessment in the Global Change Information System , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[35]  Xiaogang Ma,et al.  Capturing provenance of global change information , 2014 .

[36]  Deborah L. McGuinness,et al.  Provenance Aware Faceted Search , 2010 .

[37]  Rik Van de Walle,et al.  Automatic Discovery of High-Level Provenance Using Semantic Similarity , 2012, IPAW.

[38]  Yelena Yesha,et al.  Tracking provenance of earth science data , 2010, Earth Sci. Informatics.

[39]  François Bousquet,et al.  Modelling with stakeholders , 2010, Environ. Model. Softw..

[40]  Yogesh L. Simmhan,et al.  The Open Provenance Model core specification (v1.1) , 2011, Future Gener. Comput. Syst..

[41]  María Martín,et al.  The Gene Ontology: enhancements for 2011 , 2011, Nucleic Acids Res..

[42]  Vassilios Peristeras,et al.  Linked Open Government Data [Guest editors' introduction] , 2012, IEEE Intell. Syst..

[43]  Sibel Adali,et al.  Semantically Enabling Knowledge Representation of Metamorphic Petrology Data , 2012 .

[44]  Paul T. Groth Transparency and Reliability in the Data Supply Chain , 2013, IEEE Internet Computing.

[45]  José Maria Parente de Oliveira,et al.  Concept maps as the first step in an ontology construction method , 2013, Inf. Syst..

[46]  Boris Kompare,et al.  Environmental Modelling & Software , 2014 .

[47]  Mary C. Hill,et al.  Integrated environmental modeling: A vision and roadmap for the future , 2013, Environ. Model. Softw..

[48]  Elena Paslaru Bontas Simperl,et al.  Reusing ontologies on the Semantic Web: A feasibility study , 2009, Data Knowl. Eng..

[49]  Sven Apel,et al.  Types and modularity for implicit invocation with implicit announcement , 2010, TSEM.