Modeling heterogeneous data resources for social-ecological research: a data-centric perspective

Digital repositories are grappling with an influx of scientific data brought about by the well publicized "data deluge" in science, business, and society. One particularly perplexing problem is the long-term archival and reuse of complex data sets. This paper presents an integrated approach to data discovery over heterogeneous data resources in social-ecological systems research. Social-ecological systems data is complex because the research draws from both social and natural sciences. Using a sample set of data resources from the domain, we explore an approach to discovery and representation of this data. Specifically, we develop an ontology-based process of organization and visualization from a data-centric perspective. We define data resources broadly and identify six key categories of resources that include data collected from site visits to shared ecological resources, the structure of research instruments, domain concepts, research designs, publications, theories and models. We identify the underlying relationships and construct an ontology that captures these relationships using semantic web languages. The ontology and a NoSQL data store at the back end store the data resource instances. These are integrated into a portal architecture we refer to as the Integrated Visualization of Social-Ecological Resources (IViSER) that allows users to both browse the relationships captured in the ontology and easily visualize the granular details of data resources.

[1]  Sean Bechhofer,et al.  Research Objects: Towards Exchange and Reuse of Digital Knowledge , 2010 .

[2]  Matthew Jones,et al.  Maximizing the Value of Ecological Data with Structured Metadata: An Introduction to Ecological Metadata Language (EML) and Principles for Metadata Creation , 2005 .

[3]  Pascal Heus,et al.  Data Documentation Initiative: Toward a Standard for the Social Sciences , 2008, Int. J. Digit. Curation.

[4]  Jun Zhao,et al.  Describing Linked Datasets On the Design and Usage of voiD, the "Vocabulary Of Interlinked Datasets" , 2009 .

[5]  E. Ostrom A General Framework for Analyzing Sustainability of Social-Ecological Systems , 2009, Science.

[6]  Scott Jensen,et al.  Generalized representation and mapping for social-ecological data: Freeing data from the database , 2012, 2012 IEEE 8th International Conference on E-Science.

[7]  Herbert Van de Sompel,et al.  Adding eScience Assets to the Data Web , 2009, ArXiv.

[8]  Matthew S. Mayernik,et al.  From artifacts to aggregations: Modeling scientific life cycles on the semantic Web , 2010, J. Assoc. Inf. Sci. Technol..

[9]  Jane Hunter,et al.  LORE: A Compound Object Authoring and Publishing Tool for the Australian Literature Studies Community , 2008, ICADL.

[10]  Joachim Wackerow,et al.  Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences , 2012, Dublin Core Conference.

[11]  Arun Agrawal,et al.  Trade-offs and synergies between carbon storage and livelihood benefits from forest commons , 2009, Proceedings of the National Academy of Sciences.

[12]  E. Ostrom A diagnostic approach for going beyond panaceas , 2007, Proceedings of the National Academy of Sciences.