Toward Open Science at the European Scale: Geospatial Semantic Array Programming for Integrated Environmental Modelling

Excerpt: Interfacing science and policy raises challenging issues when large spatial-scale (regional, continental, global) environmental problems need transdisciplinary integration within a context of modelling complexity and multiple sources of uncertainty. This is characteristic of science-based support for environmental policy at European scale, and key aspects have also long been investigated by European Commission transnational research. Approaches (either of computational science or of policy-making) suitable at a given domain-specific scale may not be appropriate for wide-scale transdisciplinary modelling for environment (WSTMe) and corresponding policy-making. In WSTMe, the characteristic heterogeneity of available spatial information and complexity of the required data-transformation modelling (D-TM) appeal for a paradigm shift in how computational science supports such peculiarly extensive integration processes. In particular, emerging wide-scale integration requirements of typical currently available domain-specific modelling strategies may include increased robustness and scalability along with enhanced transparency and reproducibility. This challenging shift toward open data and reproducible research (open science) is also strongly suggested by the potential - sometimes neglected - huge impact of cascading effects of errors within the impressively growing interconnection among domain-specific computational models and frameworks. Concise array-based mathematical formulation and implementation (with array programming tools) have proved helpful in supporting and mitigating the complexity of WSTMe when complemented with generalized modularization and terse array-oriented semantic constraints. This defines the paradigm of Semantic Array Programming (SemAP) where semantic transparency also implies free software use (although black-boxes - e.g. legacy code - might easily be semantically interfaced). A new approach for WSTMe has emerged by formalizing unorganized best practices and experience-driven informal patterns. The approach introduces a lightweight (non-intrusive) integration of SemAP and geospatial tools - called Geospatial Semantic Array Programming (GeoSemAP). GeoSemAP exploits the joint semantics provided by SemAP and geospatial tools to split a complex D-TM into logical blocks which are easier to check by means of mathematical array-based and geospatial constraints. Those constraints take the form of precondition, invariant and postcondition semantic checks. This way, even complex WSTMe may be described as the composition of simpler GeoSemAP blocks. GeoSemAP allows intermediate data and information layers to be more easily and formally semantically described so as to increase fault-tolerance, transparency and reproducibility of WSTMe. This might also help to better communicate part of the policy-relevant knowledge, often diffcult to transfer from technical WSTMe to the science-policy interface. [...]

[1]  V. Stodden Trust your science? Open your data and code , 2011 .

[2]  Daniele de Rigo,et al.  Semantic Array Programming for Environmental Modelling: Application of the Mastrave Library , 2012 .

[3]  Richard M. Stallman,et al.  ViewpointWhy "open source" misses the point of free software , 2009, CACM.

[4]  Tim Sutton Clipping data from Postgis , 2009 .

[5]  Victoria Stodden,et al.  Reproducible Research: Tools and Strategies for Scientific Computing , 2012, CSE 2012.

[6]  Erik Hazzard,et al.  OpenLayers 2.10 Beginner's Guide , 2011 .

[7]  Gregory J. Wilson,et al.  Where’s the Real Bottleneck in Scientific Computing? , 2006 .

[8]  A. Castelletti,et al.  A selective improvement technique for fastening neuro-dynamic programming in water resources network management , 2005 .

[9]  Peter J. Denning,et al.  The science in computer science , 2013, CACM.

[10]  Jennifer M. Urban,et al.  Shining Light into Black Boxes , 2012, Science.

[11]  Funtowicz Silvio Post-Normal Science , 2020, Encyclopedia of Creativity, Invention, Innovation and Entrepreneurship.

[12]  John W. Eaton,et al.  GNU octave : a high-level interactive language for numerical computations : Octave version 2.0.17 (stable) , 1997 .

[13]  Jeroen P. van der Sluijs,et al.  Uncertainty and Dissent in Climate Risk Assessment: A Post-Normal Perspective , 2012 .

[14]  Camia Andrea,et al.  Projecting Future Burnt Area in the EU-Mediterranean Countries under IPCC SRES A2/B2 Climate Change Scenarios , 2009 .

[15]  Daniele de Rigo,et al.  Free and Open Source Software underpinning the European Forest Data Centre , 2013 .

[16]  Lucy Bastin,et al.  Developing a Forest Data Portal to Support Multi-Scale Decision Making , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[17]  John W. Eaton,et al.  GNU Octave and reproducible research , 2012 .

[18]  Brian Fox,et al.  Bash reference manual: reference documentation for Bash - Edition 3.2, for Bash version 3.2 , 2006 .

[19]  Daniele de Rigo,et al.  Semantic Array Programming with Mastrave - Introduction to Semantic Computational Modelling , 2012 .

[20]  R. Peng Reproducible Research in Computational Science , 2011, Science.

[21]  Steven C Bankes,et al.  Tools and techniques for developing policies for complex and uncertain systems , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[22]  J. P. van der Sluijs,et al.  Uncertainty as a monster in the science-policy interface: four coping strategies. , 2005, Water science and technology : a journal of the International Association on Water Pollution Research.

[23]  J. Ravetz,et al.  The post-normal science of precaution , 2004 .

[24]  Mihaela Ulieru,et al.  Emergent engineering: a radical paradigm shift , 2011, Int. J. Auton. Adapt. Commun. Syst..

[25]  Daniele de Rigo,et al.  Software Uncertainty in Integrated Environmental Modelling: the role of Semantics and Open Science , 2013, ArXiv.

[26]  P. Strobl,et al.  Comprehensive Monitoring of Wildfires in Europe: The European Forest Fire Information System (EFFIS) , 2012 .

[27]  Vinton G. Cerf Where is the science in computer science? , 2012, CACM.

[28]  Markus Metz,et al.  GRASS GIS: A multi-purpose open source GIS , 2012, Environ. Model. Softw..

[29]  Markus Neteler,et al.  Open Source GIS: A GRASS GIS Approach , 2007 .

[30]  Kenneth E. Iverson,et al.  Notation as a tool of thought , 1980, APLQ.

[31]  S. Simon International Society for Ecological Economics Internet Encyclopaedia of Ecological Economics , 2003 .

[32]  Jerome R. Ravetz,et al.  Uncertainty, complexity and post-normal science , 1994 .

[33]  Frank Warmerdam,et al.  The Geospatial Data Abstraction Library , 2008 .

[34]  Jesús San-Miguel-Ayanz,et al.  Enhancing the European Forest Fire Information System (EFFIS) with open source software , 2010 .

[35]  Monica G. Turner,et al.  Predicting across scales: Theory development and testing , 1989, Landscape Ecology.

[36]  Daniele de Rigo,et al.  Integrated Natural Resources Modelling and Management: minimal redefinition of a known challenge for environmental modelling , 2012 .

[37]  N. Drake,et al.  Scaling Issues in Environmental Modelling , 2004 .

[38]  Camia Andrea,et al.  Fire news management in the context of the European Forest Fire Information System (EFFIS) , 2012 .

[39]  S. Funtowicz,et al.  The worth of a songbird: ecological economics as a post-normal science , 1994 .

[40]  Jennifer C Molloy,et al.  The Open Knowledge Foundation: Open Data Means Better Science , 2011, PLoS biology.

[41]  Richard Stallman Free Community Science and the Free Development of Science , 2005, PLoS medicine.

[42]  San-Miguel-Ayanz Jesus,et al.  Reporting on European forest fragmentation: Standardised indices and web map services , 2012 .

[43]  Daniele de Rigo,et al.  Architecture of a Pan-European Framework for Integrated Soil Water Erosion Assessment , 2011, ISESS.