Documenting, storing, and executing models in Ecology: A conceptual framework and real implementation in a global change monitoring program

Many of the best practices concerning the development of ecological models or analytic techniques published in the scientific literature are not fully available to modelers but rather are stored in scientists' digital or biological memories. We propose that it is time to address the problem of storing, documenting, and executing ecological models and analytical procedures. In this paper, we propose a conceptual framework to design and implement a web application that will help to meet this challenge. This tool will foster cooperation among scientists, enhancing the creation of relevant knowledge that could be transferred to environmental managers. We have implemented this conceptual framework in a tool called ModeleR. This is being used to document, share, and execute more than 200 models and analytical processes associated with a global change monitoring program that is being undertaken in the Sierra Nevada Mountains (south Spain). ModeleR uses the concept of scientific workflow to connect and execute different types of models and analytical processes. Finally, we have envisioned the creation of a federation of model repositories where models documented within a local repository could be linked and even executed by other researchers.

[1]  James Frew,et al.  Computational provenance in hydrologic science: a snow mapping example , 2009, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[2]  Philip H. Crowley,et al.  RESAMPLING METHODS FOR COMPUTATION-INTENSIVE DATA ANALYSIS IN ECOLOGY AND EVOLUTION , 1992 .

[3]  William K. Michener,et al.  Meta-information concepts for ecological data management , 2006, Ecol. Informatics.

[4]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[5]  Leonardo Murta,et al.  Comparison and versioning of scientific workflows , 2009, 2009 ICSE Workshop on Comparison and Versioning of Software Models.

[6]  J. Hobbie,et al.  Scientific Accomplishments of the Long Term Ecological Research Program: An Introduction , 2003 .

[7]  Nico Cellinese,et al.  Evolutionary informatics: unifying knowledge about the diversity of life. , 2012, Trends in ecology & evolution.

[8]  Alexey Voinov,et al.  A Community Approach to Earth Systems Modeling , 2010 .

[9]  J. M. Peterson,et al.  Accessible integration of agriculture, groundwater, and economic models using the Open Modeling Interface (OpenMI): methodology and initial results , 2009 .

[10]  Anthony J. Jakeman,et al.  Good Modelling Practice , 2008 .

[11]  Manuel Wimmer,et al.  A survey on model versioning approaches , 2009, Int. J. Web Inf. Syst..

[12]  Matthew Jones,et al.  Maximizing the Value of Ecological Data with Structured Metadata: An Introduction to Ecological Metadata Language (EML) and Principles for Metadata Creation , 2005 .

[13]  Pawel Plaszczak,et al.  Grid Computing: The Savvy Manager's Guide , 2005 .

[14]  F. Ayala,et al.  Complexity in Ecology and Conservation: Mathematical, Statistical, and Computational Challenges , 2005 .

[15]  Alexey A. Voinov,et al.  Model integration and the role of data , 2010, Environ. Model. Softw..

[16]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[17]  Noel Enyedy,et al.  Little science confronts the data deluge: habitat ecology, embedded sensor networks, and digital libraries , 2007, International Journal on Digital Libraries.

[18]  Thomas Petzoldt,et al.  simecol : An Object-Oriented Framework for Ecological Modeling in R , 2007 .

[19]  Rainer Brüggemann,et al.  ECOBAS - A tool to develop ecosystem models exemplified by the shallow lake model EMMO , 2008, Ecol. Informatics.

[20]  M. Whitlock Data archiving in ecology and evolution: best practices. , 2011, Trends in ecology & evolution.

[21]  Daniel P. Miranker,et al.  Schema Driven Assignment and Implementation of Life Science Identifiers (lsids) , 2006 .

[22]  J. B. Gregersen,et al.  OpenMI: Open modelling interface , 2007 .

[23]  J. Gareth Polhill,et al.  The ODD protocol: A review and first update , 2010, Ecological Modelling.

[24]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004 .

[25]  Cecelia DeLuca,et al.  Coupling climate and hydrological models: Interoperability through Web Services , 2013, Environ. Model. Softw..

[26]  S. Carpenter,et al.  Ecosystem stewardship: sustainability strategies for a rapidly changing planet. , 2010, Trends in ecology & evolution.

[27]  Steven D. Buckingham To build a better model , 2007, Nature Methods.

[28]  Cecelia DeLuca,et al.  Toward self-describing and workflow integrated Earth system models: A coupled atmosphere-ocean modeling system application , 2013, Environ. Model. Softw..

[29]  Jue Wang Digital Object Identifiers and Their Use in Libraries , 2007 .

[30]  Shawn Bowers,et al.  The New Bioinformatics: Integrating Ecological Data from the Gene to the Biosphere , 2006 .

[31]  Michael A. Thomas,et al.  Implications of informatics approaches in ecological research , 2011, Ecol. Informatics.

[32]  Günter Wozny,et al.  MOSAIC a web-based modeling environment for code generation , 2011, Comput. Chem. Eng..

[33]  Andre Zerger,et al.  Beyond Modelling: Linking Models with GIS for Flood Risk Management , 2004 .

[34]  Blas M. Benito,et al.  ModeleR: An enviromental model repository as knowledge base for experts , 2012, Expert Syst. Appl..

[35]  James Frew,et al.  Automatic capture and reconstruction of computational provenance , 2008 .

[36]  P. Crutzen Geology of mankind , 2002, Nature.

[37]  Cecelia DeLuca,et al.  The architecture of the Earth System Modeling Framework , 2003, Computing in Science & Engineering.

[38]  Dean P. Holzworth,et al.  Simplifying environmental model reuse , 2010, Environ. Model. Softw..

[39]  Carole A. Goble,et al.  myExperiment: a repository and social network for the sharing of bioinformatics workflows , 2010, Nucleic Acids Res..

[40]  Hans V Westerhoff,et al.  Towards building the silicon cell: a modular approach. , 2006, Bio Systems.

[41]  TIM M. BLACKBURN,et al.  Reproducibility and Repeatability in Ecology , 2006 .

[42]  B. Law,et al.  Archiving numerical models of biogeochemical dynamics , 2005 .

[43]  Joachim Benz,et al.  ECOBAS — modelling and documentation , 2001 .

[44]  Matthew B. Jones,et al.  Challenges and Opportunities of Open Data in Ecology , 2011, Science.

[45]  Jacky L. Snoep,et al.  Web-based kinetic modelling using JWS Online , 2004, Bioinform..

[46]  Huub Scholten Good Modelling Practice , 2000 .

[47]  Javier Nogueras-Iso,et al.  Metadata standard interoperability: application in the geographic information domain , 2004, Comput. Environ. Urban Syst..

[48]  Melanie I. Stefan,et al.  BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models , 2010, BMC Systems Biology.

[49]  Steve Kelling,et al.  Data-Intensive Science: A New Paradigm for Biodiversity Studies , 2009 .

[50]  James P. M. Syvitski,et al.  Strategies for integrated modeling: The community surface dynamics modeling system example , 2013, Environ. Model. Softw..

[51]  Romina Martin,et al.  Describing human decisions in agent-based models - ODD + D, an extension of the ODD protocol , 2013, Environ. Model. Softw..

[52]  Darrel C. Ince,et al.  The case for open computer programs , 2012, Nature.

[53]  Walter Jetz,et al.  Integrating biodiversity distribution knowledge: toward a global map of life. , 2012, Trends in ecology & evolution.

[54]  Richard M. Keller,et al.  Meta-modeling: a knowledge-based approach to facilitating process model construction and reuse , 1999 .

[55]  Matthew B Jones,et al.  Ecoinformatics: supporting ecology as a data-intensive science. , 2012, Trends in ecology & evolution.

[56]  Cathal Heavey,et al.  A review of Web-based simulation and supporting tools , 2010, Simul. Model. Pract. Theory.

[57]  Anthony M. Castronova,et al.  Integrated modeling within a Hydrologic Information System: An OpenMI based approach , 2013, Environ. Model. Softw..

[58]  Scott D. Peckham,et al.  A component-based approach to integrated modeling in the geosciences: The design of CSDMS , 2013, Comput. Geosci..

[59]  A. Walz,et al.  Linking Models of Land Use, Resources, and Economy to Simulate the Development of Mountain Regions (ALPSCAPE) , 2007, Environmental management.

[60]  Uta Berger,et al.  Pattern-Oriented Modeling of Agent-Based Complex Systems: Lessons from Ecology , 2005, Science.