An integrated system for publishing environmental observations data

Over the next decade, it is likely that science and engineering research will produce more scientific data than has been created over the whole of human history. The successful use of these data to achieve new scientific breakthroughs will depend on the ability to access, integrate, and analyze these large datasets. Robust data organization and publication methods are needed within the research community to enable data discovery and scientific analysis by researchers other than those that collected the data. We present a new method for publishing research datasets consisting of point observations that employs a standard observations data model populated using controlled vocabularies for environmental and water resources data along with web services for transmitting data to consumers. We describe how these components have reduced the syntactic and semantic heterogeneity in the data assembled within a national network of environmental observatory test beds and how this data publication system has been used to create a federated network of consistent research data out of a set of geographically decentralized and autonomous test bed databases.

[1]  Kenneth G. Renard,et al.  Preface to special section on Fifty Years of Research and Data Collection: U.S. Department of Agriculture Walnut Gulch Experimental Watershed , 2008 .

[2]  Bodo Rieger,et al.  Semantic Integration of Heterogeneous Information Sources , 2000, EFIS.

[3]  A. Swan,et al.  To share or not to share: Publication and quality assurance of research data outputs. A report commissioned by the Research Information Network , 2008 .

[4]  Fèlix Saltor,et al.  Ontologies: Solving Semantic Heterogeneity in a Federated Spatial Database System , 2003, ICEIS.

[5]  Kai Lin,et al.  A System for Semantic Integration of Geologic Maps via Ontologies ∗ , 2003 .

[6]  Jeffery S. Horsburgh,et al.  A relational model for environmental and water resources data , 2008 .

[7]  Jeffery S. Horsburgh,et al.  A first approach to web services for the National Water Information System , 2008, Environ. Model. Softw..

[8]  Charles N Haas,et al.  The WATERS Network: an integrated environmental observatory network for water research. , 2007, Environmental science & technology.

[9]  R. K. Hubbard,et al.  Little River Experimental Watershed database , 2007 .

[10]  Benjamin L. Ruddell,et al.  Hydrologic Data Models , 2005 .

[11]  Gerald N. Flerchinger,et al.  Thirty‐five years of research data collection at the Reynolds Creek Experimental Watershed, Idaho, United States , 2001 .

[12]  John Helly Digital Library Technology for Hydrology , 2005 .

[13]  Bilişim Observations and Measurements , 2010 .

[14]  Noel Enyedy,et al.  Little science confronts the data deluge: habitat ecology, embedded sensor networks, and digital libraries , 2007, International Journal on Digital Libraries.

[15]  Mary H. Nichols,et al.  Southwest Watershed Research Center Data Access Project , 2008 .

[16]  David J. DeWitt,et al.  Scientific data management in the coming decade , 2005, SGMD.

[17]  Simon J. Cox,et al.  Interoperability Issues in Scientific Data Management (version 1.0) , 2006 .

[18]  David D. Bosch,et al.  Long‐term water chemistry database, Little River Experimental Watershed, southeast Coastal Plain, United States , 2007 .

[19]  Peter Bajcsy,et al.  Hydroinformatics: Data Integrative Approaches in Computation, Analysis, and Modeling , 2005 .

[20]  Stuart E. Madnick,et al.  Improving data quality through effective use of data semantics , 2006, Data Knowl. Eng..

[21]  Shawn Bowers,et al.  An ontology for describing and synthesizing ecological observation data , 2007, Ecol. Informatics.

[22]  Robert M. Colomb Impact of Semantic Heterogeneity and Federating Databases , 1997, Comput. J..

[23]  Michael Piasecki,et al.  Engineering new paths to water data , 2009, Comput. Geosci..

[24]  Sunil Movva,et al.  Earth Science Markup Language: A Solution to Address Data Format Heterogeneity Problems in Atmospheric Sciences , 2005 .

[25]  Carolyn E. Begg,et al.  Database Systems: A Practical Approach to Design, Implementation and Management , 1998 .