Understanding and Improving Artifact Sharing in Software Engineering Research

In recent years, many software engineering researchers have begun to include artifacts alongside their research papers. Ideally, artifacts, which include tools, benchmarks, data, and more, support the dissemination of ideas, provide evidence for research claims, and serve as a starting point for future research. This often takes the form of a link in the paper pointing to a website containing these additional materials. However, in practice, artifacts suffer from a variety of issues that prevent them from fully realising that potential. To help the software engineering community realise the potential of artifacts, we seek to understand the challenges involved in the creation, sharing, and use of artifacts. To that end, we perform a mixed-methods study including a publication analysis and online survey of 153 software engineering researchers. We apply the established theory of diffusion of innovation, and draw from the field of implementation science, to make evidence-based recommendations. By analysing the perspectives of artifact creators, users, and reviewers, we identify several high-level challenges that affect the quality of artifacts including mismatched expectations between these groups, and a lack of sufficient reward for both creators and reviewers. Using diffusion of innovation as a framework, we analyse how these challenges relate to one another, and build an understanding of the factors that affect the sharing and success of artifacts. Finally, using principles from implementation science, we make evidence-based recommendations for specific sub-communities (e.g., students and postdocs, artifact evaluation committees, funding bodies, and professional organisations) to improve the quality of artifacts.

[1]  Jeffrey C. Carver,et al.  Replicating software engineering experiments: addressing the tacit knowledge problem , 2002, Proceedings International Symposium on Empirical Software Engineering.

[2]  Victoria Stodden,et al.  Enabling Reproducible Research: Open Licensing for Scientific Innovation , 2009 .

[3]  Jeffrey C. Carver Towards Reporting Guidelines for Experimental Replications: A Proposal , 2010 .

[4]  K. Charmaz,et al.  Constructing Grounded Theory , 2014 .

[5]  Shriram Krishnamurthi,et al.  Artifact evaluation for software conferences , 2013, SOEN.

[6]  Grigori Fursin,et al.  Collective Knowledge: Towards R&D sustainability , 2016, 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[7]  Andrea C. Arpaci-Dusseau,et al.  The Popper Convention: Making Reproducible Systems Evaluation Practical , 2017, 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[8]  Vitaly Shmatikov,et al.  Robust De-anonymization of Large Sparse Datasets , 2008, 2008 IEEE Symposium on Security and Privacy (sp 2008).

[9]  Lauren Herckis Passing the Baton: Digital Literacy and Sustained Implementation of eLearning Technologies , 2018 .

[10]  Anne C. Grego-Nagel An exploratory study of the adoption of mobile telecommunications service in order to improve mobile health service development , 2016 .

[11]  Shouling Ji,et al.  Structural Data De-anonymization: Quantification, Practice, and Implications , 2014, CCS.

[12]  Panos K. Chrysanthis,et al.  Artifact Evaluation: FAD or Real News? , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[13]  Susan Stepney,et al.  BugZoo: a platform for studying software bugs , 2018, ICSE.

[14]  Johnny Saldaña,et al.  The Coding Manual for Qualitative Researchers , 2009 .

[15]  David Silverman,et al.  Doing Qualitative Research: A Comprehensive Guide , 2008 .

[16]  Panos K. Chrysanthis,et al.  Artifact Evaluation: Is It a Real Incentive? , 2017, 2017 IEEE 13th International Conference on e-Science (e-Science).

[17]  R. Glasgow,et al.  Evaluating the public health impact of health promotion interventions: the RE-AIM framework. , 1999, American journal of public health.

[18]  Paul Watson,et al.  CARMEN: Code analysis, Repository and Modeling for e-Neuroscience , 2011, ICCS.

[19]  Christian S. Collberg,et al.  Repeatability in computer systems research , 2016, Commun. ACM.

[20]  John N Lavis,et al.  Use of health systems and policy research evidence in the health policymaking in eastern Mediterranean countries: views and practices of researchers , 2012, Implementation Science.

[21]  Jeffrey C. Carver,et al.  The role of replications in Empirical Software Engineering , 2008, Empirical Software Engineering.

[22]  Inder M. Verma,et al.  The NIH Roadmap , 2004 .

[23]  Victoria Stodden,et al.  The Legal Framework for Reproducible Scientific Research: Licensing and Copyright , 2009, Computing in Science & Engineering.

[24]  R BasiliVictor,et al.  Protocols in the use of empirical software engineering artifacts , 2007 .

[25]  A. Detsky,et al.  Evidence-based medicine. A new approach to teaching the practice of medicine. , 1992, JAMA.

[26]  K. F. レンツ,et al.  the Creative Commons , 2011 .

[27]  Brigid Wilson,et al.  Implementing Reproducible Research , 2014 .

[28]  N. Schillewaert,et al.  Organizational innovation adoption: a multi-level framework of determinants and opportunities for future research , 2002 .

[29]  E. Zerhouni The NIH Roadmap , 2003, Science.

[30]  Douglas Thain,et al.  Conducting reproducible research with Umbrella: Tracking, creating, and preserving execution environments , 2016, 2016 IEEE 12th International Conference on e-Science (e-Science).

[31]  A. Brooks,et al.  Replication's Role in Software Engineering , 2008, Guide to Advanced Empirical Software Engineering.

[32]  Gary Johns,et al.  CONSTRAINTS ON THE ADOPTION OF PSYCHOLOGY‐BASED PERSONNEL PRACTICES: LESSONS FROM ORGANIZATIONAL INNOVATION , 1993 .

[33]  Hugh M. O'Neill,et al.  Patterns in the Diffusion of Strategies Across Organizations: Insights from the Innovation Diffusion Literature , 1998 .

[34]  Alex M. Warren Repeatability and Benefaction in Computer Systems Research — A Study and a Modest Proposal , 2015 .

[35]  Leif Singer,et al.  A study of innovation diffusion through link sharing on stack overflow , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[36]  Forrest Shull,et al.  An evolutionary testbed for software technology evaluation , 2005, Innovations in Systems and Software Engineering.

[37]  Sébastien Li-Thiao-Té,et al.  Literate Program Execution for Reproducible Research and Executable Papers , 2012, ICCS.

[38]  Suzanne J. Matthews,et al.  Paper Mâché: Creating Dynamic Reproducible Science , 2011, ICCS.

[39]  Janice M. Morse,et al.  Qualitative Nursing Research: A Free-for-All? , 1991 .

[40]  Martina Zitterbart,et al.  Taming the Complexity of Artifact Reproducibility , 2017, Reproducibility@SIGCOMM.

[41]  Forrest Shull,et al.  Building Knowledge through Families of Experiments , 1999, IEEE Trans. Software Eng..

[42]  G. Andrews,et al.  Mja Guidelines for Assessing Qualitative Research Quality in Qualitative Research Criteria for Authors and Assessors in the Submission and Assessment of Qualitative Research Articles for the Medical Journal of Australia , 2022 .

[43]  Leif Singer,et al.  An exploratory study of the adoption of mobile development platforms by software engineers , 2014, MOBILESoft 2014.

[44]  Jan Vitek,et al.  The real software crisis , 2015, Commun. ACM.

[45]  Natalia Juristo Juzgado,et al.  Replications of software engineering experiments , 2013, Empirical Software Engineering.

[46]  Sree Nilakanta,et al.  Implementation of Electronic Data Interchange: An Innovation Diffusion Perspective , 1994, J. Manag. Inf. Syst..

[47]  Barbara A. Kitchenham,et al.  The role of replications in empirical software engineering—a word of warning , 2008, Empirical Software Engineering.

[48]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[49]  Alan R. Hevner,et al.  The Successful Diffusion of Innovations: Guidance for Software Development Organizations , 2000, IEEE Softw..

[50]  Emerson R. Murphy-Hill,et al.  Do Developers Discover New Tools On The Toilet? , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).