Software and the Scientist: Coding and Citation Practices in Geodynamics

In geodynamics as in other scientific areas, computation has become a core component of research, complementing field observation, laboratory analysis, experiment, and theory. Computational tools for data analysis, mapping, visualization, modeling, and simulation are essential for all aspects of the scientific workflow. Specialized scientific software is often developed by geodynamicists for their own use, and this effort represents a distinctive intellectual contribution. Drawing on a geodynamics community that focuses on developing and disseminating scientific software, we assess the current practices of software development and attribution, as well as attitudes about the need and best practices for software citation. We analyzed publications by participants in the Computational Infrastructure for Geodynamics and conducted mixed method surveys of the solid earth geophysics community. From this we learned that coding skills are typically learned informally. Participants considered good code as trusted, reusable, readable, and not overly complex and considered a good coder as one that participates in the community in an open and reasonable manor contributing to both long‐ and short‐term community projects. Participants strongly supported citing software reflected by the high rate a software package was named in the literature and the high rate of citations in the references. However, lacking are clear instructions from developers on how to cite and education of users on what to cite. In addition, citations did not always lead to discoverability of the resource. A unique identifier to the software package itself, community education, and citation tools would contribute to better attribution practices.

[1]  Bradford H. Hager,et al.  Conman: vectorizing a finite element code for incompressible two-dimensional convection in the Earth's mantle , 1990 .

[2]  Walter H. F. Smith,et al.  Free software helps map and display data , 1991 .

[3]  Accessing northern California earthquake data via Internet , 1994 .

[4]  Walter H. F. Smith,et al.  New version of the generic mapping tools , 1995 .

[5]  William Gropp,et al.  Efficient Management of Parallelism in Object-Oriented Numerical Software Libraries , 1997, SciTools.

[6]  Hans Petter Langtangen,et al.  Modern Software Tools for Scientific Computing , 1997, Birkhäuser Boston.

[7]  Walter H. F. Smith,et al.  New, improved version of generic mapping tools released , 1998 .

[8]  Kerstin Lehnert,et al.  A global geochemical database structure for rocks , 2000 .

[9]  William Hung Kan Lee,et al.  International handbook of earthquake and engineering seismology , 2002 .

[10]  Peter Goldstein,et al.  85.5 SAC2000: Signal processing and analysis tools for seismologists and engineers , 2003 .

[11]  Jack Dongarra,et al.  Computational Science: Ensuring America's Competitiveness , 2005 .

[12]  P Goldstein,et al.  SAC Availability for the IRIS Community , 2005 .

[13]  Greg Wilson,et al.  Software Carpentry: Getting Scientists to Write Better Code by Making Them More Productive , 2006, Computing in Science & Engineering.

[14]  W. Bangerth,et al.  deal.II—A general-purpose object-oriented finite element library , 2007, TOMS.

[15]  M. Hochberg,et al.  Author Sequence and Credit for Contributions in Multiauthored Publications , 2007, PLoS biology.

[16]  Jeffrey C. Carver,et al.  Understanding the High-Performance-Computing Community: A Software Engineer's Perspective , 2008, IEEE Software.

[17]  Diane Kelly,et al.  Five Recommended Practices for Computational Scientists Who Write Software , 2009, Computing in Science & Engineering.

[18]  Janice Singer,et al.  How do scientists develop and use scientific software? , 2009, 2009 ICSE Workshop on Software Engineering for Computational Science and Engineering.

[19]  Christopher J. Roy,et al.  Verification and Validation in Scientific Computing , 2010 .

[20]  Richard Van Noorden,et al.  Metrics: Do metrics matter? , 2010, Nature.

[21]  J. Orman Understanding the Building Blocks of the Planet , 2010 .

[22]  Z. Merali Computational science: ...Error , 2010, Nature.

[23]  J. Sahel Quality Versus Quantity: Assessing Individual Research Performance , 2011, Science Translational Medicine.

[24]  Feng Liu,et al.  A survey of the practice of computational science , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[25]  Martin Kronbichler,et al.  High accuracy mantle convection simulation through modern numerical methods , 2012 .

[26]  Achim Zeileis,et al.  Who Did What? The Roles of R Package Authors and How to Refer to Them , 2012 .

[27]  G. Marshall,et al.  Citation analysis as a measure of article quality, journal influence and individual researcher performance , 2012 .

[28]  Amber Dance,et al.  Authorship: Who's on first? , 2012, Nature.

[29]  Chad Trabant,et al.  Data Products at the IRIS DMC: Stepping Stones for Research and Other Applications , 2012 .

[30]  Remko Scharroo,et al.  Generic Mapping Tools: Improved Version Released , 2013 .

[31]  Wolfgang Bangerth,et al.  What makes computational open source software libraries successful , 2013 .

[32]  Ian M. Mitchell,et al.  Best Practices for Scientific Computing , 2012, PLoS biology.

[33]  Nancy Wilkins-Diehr,et al.  Summary of the First Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE1) , 2014, ArXiv.

[34]  Greg Wilson,et al.  Software Carpentry: lessons learned , 2013, F1000Research.

[35]  Yolanda Gil,et al.  OntoSoft: Capturing Scientific Software Metadata , 2015, K-CAP.

[36]  Nancy Wilkins-Diehr,et al.  Report on the Second Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE2) , 2015, ArXiv.

[37]  Daniel S. Katz,et al.  Report on the Third Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE3) , 2016, ArXiv.

[38]  William Gropp,et al.  PETSc Users Manual Revision 3.4 , 2016 .

[39]  Daniel S. Katz,et al.  Software citation principles , 2016, PeerJ Comput. Sci..

[40]  David Wells,et al.  The deal.II Library, Version 8.4 , 2016, J. Num. Math..

[41]  James Howison,et al.  Software in the scientific literature: Problems with seeing, finding, and using software mentioned in the biology literature , 2016, J. Assoc. Inf. Sci. Technol..

[42]  Diane H. Sonnenwald,et al.  Association for Information Science and Technology , 2017 .