Sustainable long term scientific data publication: Lessons learned from a prototype Observatory Information System for the Illinois River Basin

In 2005 a prototype Observatory Information System (OIS) was developed for the Illinois River Basin Observatory (IRBO), connected to a federated scientific data network, populated with a representative collection of legacy datasets, and linked to external data streams. The perspective of seven years' time and the disestablishment of the system provide an opportunity to study the system life cycle. We detail best practices for multi-level OIS design for long-term performance, based on a publication-mandatory metadata implementation standard using ISO-19115. These principles balance general users' requirements against the requirements of specific scientific applications, and maximize the system's capacity to deal with legacy and heterogeneous data sources, enhancing long-term sustainability and flexibility for diverse multi-level user groups. These findings are relevant to ongoing developments of networked Scientific Information Systems that are increasingly critical to support and sustain the long-term benefits of modeling and observatory science.

[1]  John Kunze,et al.  DataONE: Data Observation Network for Earth - Preserving Data and Enabling Innovation in the Biological and Environmental Sciences , 2011, D Lib Mag..

[2]  Helena Karasti,et al.  Enriching the Notion of Data Curation in E-Science: Data Managing and Information Infrastructuring in the Long Term Ecological Research (LTER) Network , 2006, Computer Supported Cooperative Work (CSCW).

[3]  Silvana Castano,et al.  Semantic integration of heterogeneous information sources , 2001, Data Knowl. Eng..

[4]  William K. Michener,et al.  Living in an increasingly connected world: a framework for continental-scale environmental science , 2008 .

[5]  F. Braudel,et al.  French historical method. The `Annales' paradigm , 1978, Medical History.

[6]  Daniel Atkins,et al.  Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure , 2003 .

[7]  Kristin Vanderbilt,et al.  Long term ecological research and information management , 2011, Ecol. Informatics.

[8]  Stefano Nativi,et al.  Environmental model access and interoperability: The GEO Model Web initiative , 2013, Environ. Model. Softw..

[9]  G. Randy Keller,et al.  GEON (GEOscience Network): A First Step in Creating Cyberinfrastructure for the Geosciences , 2003 .

[10]  Jeffery S. Horsburgh,et al.  Hydrologic data access using web services , 2006 .

[11]  William K. Michener,et al.  Ecological Data: Design, Management and Processing , 2000 .

[12]  Andrea Emilio Rizzoli,et al.  Model and data integration and re-use in environmental decision support systems , 1998, Decis. Support Syst..

[13]  P. Bryan Heidorn,et al.  National Biological Information Infrastructure (NBII) , 2010 .

[14]  William K. Michener,et al.  NONGEOSPATIAL METADATA FOR THE ECOLOGICAL SCIENCES , 1997 .

[15]  Noel Enyedy,et al.  Little science confronts the data deluge: habitat ecology, embedded sensor networks, and digital libraries , 2007, International Journal on Digital Libraries.

[16]  Anthony Aufdenkampe,et al.  Data Infrastructure for the Critical Zone Observatories ( CZOData ) : an EarthCube Design Prototype , 2011 .

[17]  Michael Piasecki,et al.  A semantic annotation tool for hydrologic sciences , 2009, Earth Sci. Informatics.

[18]  Michael Piasecki,et al.  Engineering new paths to water data , 2009, Comput. Geosci..

[19]  William K. Michener,et al.  Meta-information concepts for ecological data management , 2006, Ecol. Informatics.

[20]  Jiří Horák,et al.  Web services for distributed and interoperable hydro-information systems , 2007 .

[21]  Jeffery S. Horsburgh,et al.  An integrated system for publishing environmental observations data , 2009, Environ. Model. Softw..

[22]  David R. Maidment,et al.  User Needs Assessment, Chapter 4 , 2005 .

[23]  Yasmin B. Kafai,et al.  Social aspects of digital libraries , 1995 .

[24]  Ilya Zaslavsky 1 Archiving Spatial Data : Research Issues , 2001 .

[25]  Robert M. Colomb Impact of Semantic Heterogeneity and Federating Databases , 1997, Comput. J..

[26]  Peter Bajcsy,et al.  Hydroinformatics: Data Integrative Approaches in Computation, Analysis, and Modeling , 2005 .

[27]  Clemente Izurieta,et al.  A centralized tool for managing, archiving, and serving point-in-time data in ecological research laboratories , 2014, Environ. Model. Softw..

[28]  Stuart E. Madnick,et al.  Improving data quality through effective use of data semantics , 2006, Data Knowl. Eng..

[29]  Benjamin L. Ruddell,et al.  Hydrologic Data Models , 2005 .

[30]  Yasmin B. Kafai,et al.  Social Aspects of Digital Libraries. Final Report to the National Science Foundation , 1996 .

[31]  John Helly Digital Library Technology for Hydrology , 2005 .

[32]  Jeffery S. Horsburgh,et al.  Components of an environmental observatory information system , 2011, Comput. Geosci..

[33]  Jeffery S. Horsburgh,et al.  A relational model for environmental and water resources data , 2008 .

[34]  Scott D. Peckham,et al.  A component-based approach to integrated modeling in the geosciences: The design of CSDMS , 2013, Comput. Geosci..

[35]  Jane Hunter,et al.  Providing online access to hydrological model simulations through interactive geospatial animations , 2013, Environ. Model. Softw..

[36]  Stefano Nativi,et al.  The Brokering Approach for Multidisciplinary Interoperability: A Position Paper , 2012, Int. J. Spatial Data Infrastructures Res..

[37]  Jeffery S. Horsburgh,et al.  A first approach to web services for the National Water Information System , 2008, Environ. Model. Softw..

[38]  T. Todd Elvins,et al.  A method for interoperable digital libraries and data repositories , 1999, Future Gener. Comput. Syst..

[39]  Chaitanya K. Baru,et al.  The GEON portal: accelerating knowledge discovery in the geosciences , 2006, WIDM '06.

[40]  Jeffery S. Horsburgh,et al.  Hydroserver: A Platform for Publishing Space-Time Hydrologic Datasets , 2010 .

[41]  David R. Maidment,et al.  Accessing and sharing data using CUAHSI Water Data Services. , 2009 .

[42]  Gail Hodge,et al.  Digital Preservation and Permanent Access to Scientific Information: The State of the Practice , 2004 .