Generalized representation and mapping for social-ecological data: Freeing data from the database

Scientific discovery increasingly requires collaboration between scientific sub-domains that often have different representations for their data. To bridge gaps between varying domain representations, researchers are developing metadata and semantic representations meaningful to broader communities. Through exploiting these representations we propose a logical model and architecture by which cross-domain researchers can more easily discover, use, and eventually archive, data. In this paper we present an architecture, intermediate data model, and methodology for mapping diverse social-ecological data sources stored in relational databases to a common representation, and for classifying textual data using machine learning. The results are visualized through client views that are built against the general logical model, and applied against a longitudinal database from social-ecological research.

[1]  C. Bizer,et al.  D2R MAP - A Database to RDF Mapping Language , 2003, WWW.

[2]  E. Ostrom,et al.  Coping with Asymmetries in the Commons: Self-Governing Irrigation Systems Can Work , 1993 .

[3]  E. Ostrom A diagnostic approach for going beyond panaceas , 2007, Proceedings of the National Academy of Sciences.

[4]  Amy R. Poteete,et al.  Working Together: Collective Action, the Commons, and Multiple Methods in Practice , 2010 .

[5]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[6]  Christian Bizer,et al.  D2R Server - Publishing Relational Databases on the Semantic Web , 2004 .

[7]  E. Ostrom,et al.  Fourteen Years of Monitoring Community-Managed Forests: Learning from IFRI's Experience , 2007 .

[8]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[9]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[10]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[11]  A. Agrawal,et al.  Forest commons and local enforcement , 2008, Proceedings of the National Academy of Sciences.

[12]  E. Ostrom A General Framework for Analyzing Sustainability of Social-Ecological Systems , 2009, Science.

[13]  Devarshi Ghoshal,et al.  Evaluation of Two XML Storage Approaches for Scientific Metadata , 2011 .

[14]  Keishi Tajima,et al.  Archiving scientific data , 2002, SIGMOD '02.

[15]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[16]  Arvind Malhotra,et al.  XML Schema Part 2: Datatypes Second Edition , 2004 .

[17]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.