A decade of RAPID—Reflections on the development of an open source geoscience code

Earth science increasingly relies on computer‐based methods and many government agencies now require further sharing of the digital products they helped fund. Earth scientists, while often supportive of more transparency in the methods they develop, are concerned by this recent requirement and puzzled by its multiple implications. This paper therefore presents a reflection on the numerous aspects of sharing code and data in the general field of computer modeling of dynamic Earth processes. Our reflection is based on 10 years of development of an open source model called the Routing Application for Parallel Computation of Discharge (RAPID) that simulates the propagation of water flow waves in river networks. Three consecutive but distinct phases of the sharing process are highlighted here: opening, exposing, and consolidating. Each one of these phases is presented as an independent and tractable increment aligned with the various stages of code development and justified based on the size of the users community. Several aspects of digital scholarship are presented here including licenses, documentation, websites, citable code and data repositories, and testing. While the many existing services facilitate the sharing of digital research products, digital scholarship also raises community challenges related to technical training, self‐perceived inadequacy, community contribution, acknowledgment and performance assessment, and sustainable sharing.

[1]  David R. Maidment,et al.  Conceptual Framework for the National Flood Interoperability Experiment , 2017 .

[2]  Jeffery S. Horsburgh,et al.  HydroShare: Sharing Diverse Environmental Data Types and Models as Social Objects with Application to the Hydrology Domain , 2016 .

[3]  Zong‐Liang Yang,et al.  A GIS Framework for Regional Modeling of Riverine Nitrogen Transport: Case Study, San Antonio and Guadalupe Basins , 2016 .

[4]  Cédric H. David,et al.  Development and evaluation of a physically-based lake level model for water resource management: A case study for Lake Buchanan, Texas , 2015 .

[5]  V. Eijkhout,et al.  Enhanced fixed‐size parallel speedup with the Muskingum method using a trans‐boundary approach and a large subbasins approximation , 2015 .

[6]  E. Martin,et al.  Evaluation of Regional-Scale River Depth Simulations Using Various Routing Schemes within a Hydrometeorological Modeling Framework for the Preparation of the SWOT Mission , 2015 .

[7]  Tony Hey,et al.  Open science decoded , 2015, Nature Physics.

[8]  C. Strasser,et al.  Researcher Perspectives on Publication and Peer Review of Data , 2014, PloS one.

[9]  David G. Tarboton A Conceptual Framework for the National Flood Interoperability Experiment (NFIE) , 2015 .

[10]  Steve Easterbrook,et al.  Open code for open science , 2014 .

[11]  More bang for your byte , 2014, Scientific data.

[12]  B. Minsker,et al.  Real-Time Water Decision Support Services For Droughts , 2014 .

[13]  Code share , 2014, Nature.

[14]  Cédric H. David RAPID v1.4.0 , 2013 .

[15]  I Onwuekwe,et al.  Author's response , 2013, Annals of medical and health sciences research.

[16]  W. Collins,et al.  The Community Earth System Model: A Framework for Collaborative Research , 2013 .

[17]  Matthew G. Knepley,et al.  PETSc Users Manual (Rev. 3.3) , 2013 .

[18]  J. Famiglietti,et al.  Quantification of the upstream‐to‐downstream influence in the Muskingum method and implications for speedup in parallel computations of river flow , 2013 .

[19]  Cédric H. David,et al.  Regional-scale river flow modeling using off-the-shelf runoff products, thousands of mapped rivers and hundreds of stream flow gauges , 2013, Environ. Model. Softw..

[20]  Scott D. Peckham,et al.  A component-based approach to integrated modeling in the geosciences: The design of CSDMS , 2013, Comput. Geosci..

[21]  J. Holdren Memorandum for the Heads of Executive Departments and Agencies: Increasing Access to the Results of Federally Funded Scientific Research , 2013 .

[22]  Cédric H. David RAPID v1.3.0 , 2013 .

[23]  Zong-Liang Yang,et al.  Technical description of version 4.5 of the Community Land Model (CLM) , 2013 .

[24]  C. Fouquet,et al.  Hybrid fitting of a hydrosystem model: Long‐term insight into the Beauce aquifer functioning (France) , 2012 .

[25]  E. Martin,et al.  Assessing the water balance of the Upper Rhine Graben hydrosystem , 2012 .

[26]  Thom Dunning The Future of Scientific Computing , 2012 .

[27]  Darrel C. Ince,et al.  The case for open computer programs , 2012, Nature.

[28]  M. Ek,et al.  Continental‐scale water and energy flux analysis and validation for North American Land Data Assimilation System project phase 2 (NLDAS‐2): 2. Validation of model‐simulated streamflow , 2012 .

[29]  R. Peng Reproducible Research in Computational Science , 2011, Science.

[30]  F. Habets,et al.  RAPID input and output files corresponding to "RAPID Applied to the SIM-France Model" , 2011 .

[31]  F. Habets,et al.  RAPID applied to the SIM‐France model , 2011 .

[32]  V. Eijkhout,et al.  RAPID input and output files corresponding to "River Network Routing on the NHDPlus Dataset" , 2011 .

[33]  Victor Eijkhout,et al.  River Network Routing on the NHDPlus Dataset , 2011 .

[34]  Cédric H. David RAPID v1.2.0 , 2011 .

[35]  A. Ducharne,et al.  Modeling the impact of in-stream water level fluctuations on stream-aquifer interactions at the regional scale , 2011 .

[36]  S. Kanae,et al.  A physically based description of floodplain inundation dynamics in a global river routing model , 2011 .

[37]  Cédric H. David RAPID v1.1.0 , 2011 .

[38]  Cacm Staff Science has four legs , 2010 .

[39]  Nick Barnes Publish your computer code: it is good enough , 2010, Nature.

[40]  Cédric H. David RAPID v1.0.0 , 2010 .

[41]  Moshe Y. Vardi Science has only two legs , 2010, Commun. ACM.

[42]  Richard J. Lipton,et al.  Too Much Debate? Author's reply , 2010 .

[43]  Tony Hey Science Has Four Legs , 2010 .

[44]  Y. He,et al.  Simulating hydrologic and hydraulic processes throughout the Amazon River Basin , 2009 .

[45]  Gordon Bell,et al.  Beyond the Data Deluge , 2009, Science.

[46]  Anthony J. G. Hey,et al.  Jim Gray on eScience: a transformed scientific method , 2009, The Fourth Paradigm.

[47]  T. Wilbanks,et al.  Contribution of Working Group II to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change , 2007 .

[48]  Lawrence Rosen,et al.  Open Source Licensing: Software Freedom and Intellectual Property Law , 2004 .

[49]  Lifeng Luo,et al.  Streamflow and water balance intercomparisons of four land surface models in the North American Land Data Assimilation System Project , 2004 .

[50]  CLIMATE AND THE OCEAN CIRCULATION’ 1. THE ATMOSPHERIC CIRCULATION AND THE HYDROLOGY OF THE EARTH’S SURFACE , 2004 .

[51]  Development of a parallel river transport algorithm and applications to climate studies , 2001 .

[52]  P. Bates,et al.  A simple raster-based model for flood inundation simulation , 2000 .

[53]  P. D. Batesa,et al.  A simple raster-based model for flood inundation simulation , 2000 .

[54]  Arnold Robbins,et al.  Linux in a Nutshell , 1997 .

[55]  William Gropp,et al.  Efficient Management of Parallelism in Object-Oriented Numerical Software Libraries , 1997, SciTools.

[56]  Dag Lohmann,et al.  A large‐scale horizontal routing model to be coupled to land surface parametrization schemes , 1996 .

[57]  G. Russell,et al.  Continental-Scale River Flow in Climate Models , 1994 .

[58]  Message P Forum,et al.  MPI: A Message-Passing Interface Standard , 1994 .

[59]  Russ Rew,et al.  NetCDF: an interface for scientific data access , 1990, IEEE Computer Graphics and Applications.

[60]  David Bainbridge,et al.  The copyright act , 1989 .

[61]  A. Anthes Summary of a Workshop on the NCAR Community Climate Forecast Models 14-26 July 1985, Boulder, Colorado , 1986 .

[62]  L. Williamson,et al.  Description of NCAR Community Climate Model (CCM0B) , 1983 .

[63]  Jens Kattge,et al.  Carrots and sticks. , 2014, Newsweek.

[64]  S. Manabe CLIMATE AND THE OCEAN CIRCULATION1 , 1969 .

[65]  K. Bryan,et al.  A numerical investigation of the oceanic general circulation , 1967 .

[66]  Herman H. Goldstine The Future of Scientific Computing , 1966 .

[67]  Norman A. Phillips,et al.  The general circulation of the atmosphere: A numerical experiment , 1956 .