From Personal Tool to Community Resource: What's the Extra Work and Who Will Do It?

Sharing scientific data, software, and instruments is becoming increasingly common as science moves toward large-scale, distributed collaborations. Sharing these resources requires extra work to make them generally useful. Although we know much about the extra work associated with sharing data, we know little about the work associated with sharing contributions to software, even though software is of vital importance to nearly every scientific result. This paper presents a qualitative, interview-based study of the extra work that developers and end users of scientific software undertake. Our findings indicate that they conduct a rich set of extra work around community management, code maintenance, education and training, developer-user interaction, and foreseeing user needs. We identify several conditions under which they are likely to do this work, as well as design principles that can facilitate it. Our results have important implications for future empirical studies as well as funding policy.

[1]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[2]  James D. Herbsleb,et al.  Impression formation in online peer production: activity traces and personal profiles in github , 2013, CSCW.

[3]  Kevin Crowston,et al.  Free/Libre open-source software development: What we know and what we do not know , 2012, CSUR.

[4]  James D. Herbsleb,et al.  Scientific software production: incentives and collaboration , 2011, CSCW.

[5]  Egon L. Willighagen,et al.  Bioclipse: an open source workbench for chemo- and bioinformatics , 2007, BMC Bioinformatics.

[6]  Sonali K. Shah Motivation, Governance, and the Viability of Hybrid Forms in Open Source Software Development , 2006, Manag. Sci..

[7]  Theresa Velden,et al.  Explaining field differences in openness and sharing in scientific communities , 2013, CSCW.

[8]  Mary Beth Rosson,et al.  Design Planning in End-User Web Development , 2007 .

[9]  Wanda J. Orlikowski,et al.  Learning from Notes: organizational issues in groupware implementation , 1992, CSCW '92.

[10]  Juliet M. Corbin,et al.  Basics of Qualitative Research (3rd ed.): Techniques and Procedures for Developing Grounded Theory , 2008 .

[11]  J. Tirole,et al.  Some Simple Economics of Open Source , 2002 .

[12]  Kouichi Kishida,et al.  Toward an understanding of the motivation of open source software developers , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[13]  Eric A. von Hippel,et al.  How Open Source Software Works: 'Free' User-to-User Assistance? , 2000 .

[14]  Jonathan Grudin,et al.  Why CSCW applications fail: problems in the design and evaluationof organizational interfaces , 1988, CSCW '88.

[15]  Ixchel M. Faniel,et al.  Reusing Scientific Data: How Earthquake Engineering Researchers Assess the Reusability of Colleagues’ Data , 2010, Computer Supported Cooperative Work (CSCW).

[16]  Nancy A. Van House,et al.  Cooperative knowledge work and practices of trust: sharing environmental planning data sets , 1998, CSCW '98.

[17]  Judith Segal Some Problems of Professional End User Developers , 2007 .

[18]  James D. Herbsleb,et al.  Social coding in GitHub: transparency and collaboration in an open software repository , 2012, CSCW.

[19]  Charlotte P. Lee,et al.  Beyond trust and reliability: reusing data in collaborative cancer epidemiology research , 2013, CSCW.

[20]  Charlotte P. Lee,et al.  Synergizing in Cyberinfrastructure Development , 2010, Computer Supported Cooperative Work (CSCW).

[21]  Geoffrey C. Bowker,et al.  Making an Issue out of a Standard , 2013 .

[22]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[23]  Ann Zimmerman,et al.  Not by metadata alone: the use of diverse forms of knowledge to locate data for reuse , 2007, International Journal on Digital Libraries.

[24]  Premkumar T. Devanbu,et al.  How social Q&A sites are changing knowledge sharing in open source software communities , 2014, CSCW.

[25]  Ralph Katz,et al.  Shifting Innovation to Users via Toolkits , 2002, Manag. Sci..

[26]  Brian Fitzgerald,et al.  Why Hackers Do What They Do: Understanding Motivation and Effort in Free/Open Source Software Projects , 2007 .

[27]  Jeremy P. Birnholtz,et al.  Data at work: supporting sharing in science and engineering , 2003, GROUP.

[28]  Anselm L. Strauss,et al.  Basics of qualitative research : techniques and procedures for developing grounded theory , 1998 .

[29]  Jennifer Marlow,et al.  Activity traces and signals in software developer recruitment and hiring , 2013, CSCW.

[30]  James D. Herbsleb,et al.  Incentives and integration in scientific software production , 2013, CSCW.

[31]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[32]  Tun Lu,et al.  Meanings and boundaries of scientific software sharing , 2013, CSCW.

[33]  Sandra Slaughter,et al.  Understanding the Motivations, Participation, and Performance of Open Source Software Developers: A Longitudinal Study of the Apache Projects , 2006, Manag. Sci..

[34]  E. von Hippel,et al.  Sources of Innovation , 2016 .

[35]  Drew Paine,et al.  The work of developing cyberinfrastructure middleware projects , 2013, CSCW.

[36]  Egon L. Willighagen,et al.  Bioclipse 2: A scriptable integration platform for the life sciences , 2009, BMC Bioinformatics.

[37]  Steven J. Jackson,et al.  Why CSCW needs science policy (and vice versa) , 2013, CSCW.

[38]  Paul Dourish,et al.  The human infrastructure of cyberinfrastructure , 2006, CSCW '06.