Attributing and Referencing (Research) Software: Best Practices and Outlook From Inria

Software is a fundamental pillar of modern scientific research, across all fields and disciplines. However, there is a lack of adequate means to cite and reference software due to the complexity of the problem in terms of authorship, roles, and credits. This complexity is further increased when it is considered over the lifetime of a software that can span up to several decades. Building upon the internal experience of Inria, the French research institute for digital sciences, we provide in this article a contribution to the ongoing efforts in order to develop proper guidelines and recommendations for software citation and reference. Namely, we recommend: first, a richer taxonomy for software contributions with a qualitative scale; second, to put humans at the heart of the evaluation; and third, to distinguish citation from reference.

[1]  Ralph C. Merkle,et al.  A Digital Signature Based on a Conventional Encryption Function , 1987, CRYPTO.

[2]  Merijn de Jonge,et al.  Nix: A Safe and Policy-Free System for Software Deployment , 2004, LISA.

[3]  Leonard J. Shustek What Should We Collect to Preserve the History of Software? , 2006, IEEE Annals of the History of Computing.

[4]  Axel Funk,et al.  Die GNU General Public License, Version 3 , 2007 .

[5]  Xavier Leroy,et al.  Formal verification of a realistic compiler , 2009, CACM.

[6]  N. Paskin Digital Object Identifier (DOI) System , 2010 .

[7]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[8]  Fabrizio Valpreda,et al.  GNU General Public License , 2012 .

[9]  Ian M. Mitchell,et al.  Reproducible research for scientific computing: Tools and strategies for changing the culture , 2012, Computing in Science & Engineering.

[10]  Matthew S. Mayernik,et al.  Who’s Got the Data? Interdependencies in Science and Technology Collaborations , 2012, Computer Supported Cooperative Work (CSCW).

[11]  Konrad Hinsen,et al.  Software Development for Reproducible Research , 2013, Computing in Science & Engineering.

[12]  Shriram Krishnamurthi,et al.  Artifact evaluation for software conferences , 2013, SOEN.

[13]  Richard Van Noorden,et al.  The top 100 papers , 2014, Nature.

[14]  Alice Allen,et al.  Looking before leaping: Creating a software registry , 2014, ArXiv.

[15]  Christoph Treude,et al.  Assessing developer contribution with repository mining-based metrics , 2015, 2015 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[16]  Roger D. Peng,et al.  The reproducibility crisis in science: A statistical counterattack , 2015 .

[17]  Ludovic Courtès,et al.  Reproducible and User-Controlled Software Environments in HPC with Guix , 2015, Euro-Par Workshops.

[18]  Andreas Zeller,et al.  Artifact Evaluation for Publications (Dagstuhl Perspectives Workshop 15452) , 2015, Dagstuhl Reports.

[19]  Daniel S. Katz,et al.  Software citation principles , 2016, PeerJ Comput. Sci..

[20]  Suzanne A. Pierce,et al.  Toward the Geoscience Paper of the Future: Best practices for documenting and sharing research from data to software to provenance , 2016 .

[21]  M. Baker 1,500 scientists lift the lid on reproducibility , 2016, Nature.

[22]  Christian S. Collberg,et al.  Repeatability in computer systems research , 2016, Commun. ACM.

[23]  James Howison,et al.  Software in the scientific literature: Problems with seeing, finding, and using software mentioned in the biology literature , 2016, J. Assoc. Inf. Sci. Technol..

[24]  MacKenzie Smith,et al.  Software and the Scientist: Coding and Citation Practices in Geodynamics , 2017 .

[25]  Roberto Di Cosmo,et al.  Software Heritage: Why and How to Preserve Software Source Code , 2017, iPRES.

[26]  Gail Clement,et al.  Toward the Geoscience Paper of the Future: Best practices for documenting and sharing research from data to software to provenance , 2017 .

[27]  R. Sarpong,et al.  Bio-inspired synthesis of xishacorenes A, B, and C, and a new congener from fuscol† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c9sc02572c , 2019, Chemical science.

[28]  Morane Gruenpeter,et al.  Create software deposit , 2018 .

[29]  Roberto Di Cosmo,et al.  Identifiers for Digital Objects: The case of software source code preservation , 2018, iPRES.

[30]  Roberto Di Cosmo,et al.  The creation of a new type of scientific deposit: Software , 2018 .

[31]  Roberto Di Cosmo,et al.  Building the universal archive of source code , 2018, Commun. ACM.

[32]  Morane Gruenpeter,et al.  La modération d'un dépôt logiciel , 2018 .

[33]  Veronique Kiermer,et al.  How can we ensure visibility and diversity in research contributions? How the Contributor Role Taxonomy (CRediT) is helping the shift from authorship to contributorship , 2019, Learn. Publ..

[34]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[35]  P. Alam ‘A’ , 2021, Composites Engineering: An A–Z Guide.