Ship space to database: Motivations to manage research data for the deep subseafloor biosphere

Author(s): Darch, Peter; Borgman, Christine L. | Abstract: What motivates the building of databases by scientific collaborations? In this paper, we argue that not only are databases being built to support scientific work per se, but also with the intention of performing a variety of social functions. To explore this, we present findings from a longitudinal ethnographic case study of a large, multidisciplinary, distributed scientific project studying subseafloor microbial life. A critical element of this project’s Data Management Plan is the construction of a data portal. We found a range of factors motivating not only the very construction of this portal, but also the inclusion of particular features. In addition to scientific factors relating to improved curation and accessibility of diverse and scarce data, we argue that the building of the portal is also motivated by social factors. One such factor is the attempt to build a community of domain researchers to endure beyond the end of this project in 2020. Another motivation is the possibility of using the portal as a tool to demonstrate the productivity of the project’s scientific domain in negotiations about the allocation of scarce and valuable ocean drilling cruise resources amongst multiple, competing scientific domains. Considering the social factors, in addition to scientific factors, that motivate the construction of scientific databases enriches accounts of how these databases are built.

[1]  Katrina J. Edwards,et al.  Center for Dark Energy Biosphere Investigations (C-DEBI) , 2009 .

[2]  S. Ross :Scholarship in the Digital Age: Information, Infrastructure, and the Internet , 2009 .

[3]  Thomas A. Finholt,et al.  The Long Now of Technology Infrastructure: Articulating Tensions in Development , 2009, J. Assoc. Inf. Syst..

[4]  Matthew S. Mayernik,et al.  Digital libraries for scientific data discovery and reuse: from vision to practical reality , 2010, JCDL '10.

[5]  Sabina Leonelli,et al.  When humans are the exception: Cross-species databases at the interface of biological and clinical research , 2012, Social studies of science.

[6]  A. Strauss,et al.  The discovery of grounded theory: strategies for qualitative research aldine de gruyter , 1968 .

[7]  Noel Enyedy,et al.  Little science confronts the data deluge: habitat ecology, embedded sensor networks, and digital libraries , 2007, International Journal on Digital Libraries.

[8]  Christine Hine,et al.  Databases as Scientific Instruments and Their Role in the Ordering of Scientific Work , 2006 .

[9]  Gordon Bell,et al.  Beyond the Data Deluge , 2009, Science.

[10]  Christine L. Borgman,et al.  The conundrum of sharing research data , 2012, J. Assoc. Inf. Sci. Technol..

[11]  Helena Karasti,et al.  Enriching the Notion of Data Curation in E-Science: Data Managing and Information Infrastructuring in the Long Term Ecological Research (LTER) Network , 2006, Computer Supported Cooperative Work (CSCW).

[12]  Christine L. Borgman,et al.  Big Data, Little Data, No Data: Scholarship in the Networked World , 2014 .

[13]  Charlotte P. Lee,et al.  Boundary Negotiating Artifacts: Unbinding the Routine of Boundary Objects and Embracing Chaos in Collaborative Work , 2007, Computer Supported Cooperative Work (CSCW).

[14]  A. Strauss,et al.  The Discovery of Grounded Theory , 1967 .

[15]  Ixchel M. Faniel,et al.  Reusing Scientific Data: How Earthquake Engineering Researchers Assess the Reusability of Colleagues’ Data , 2010, Computer Supported Cooperative Work (CSCW).

[16]  Nithya Ramanathan,et al.  Know Thy Sensor: Trust, Data Quality, and Data Integrity in Scientific Digital Libraries , 2007, ECDL.

[17]  Ann Zimmerman,et al.  New Knowledge from Old Data , 2008 .

[18]  Christopher Kelty,et al.  This is not an article: Model organism newsletters and the question of ‘open science’ , 2012 .

[19]  Charlotte P. Lee,et al.  Collaboration in Metagenomics: Sequence Databases and the Organization of Scientific Work , 2009, ECSCW.

[20]  Sabina Leonelli,et al.  Global data for local science: Assessing the scale of data infrastructures in biological and biomedical research , 2013, BioSocieties.

[21]  Nancy A. Van House,et al.  Cooperative knowledge work and practices of trust: sharing environmental planning data sets , 1998, CSCW '98.

[22]  Geoffrey C. Bowker,et al.  Designing an Infrastructure for Heterogeneity of Ecosystem Data, Collaborators, Organizations , 2002, DG.O.

[23]  Geoffrey C. Bowker Biodiversity Datadiversity , 2000 .

[24]  Jeremy P. Birnholtz,et al.  Data at work: supporting sharing in science and engineering , 2003, GROUP.

[25]  Micah Altman Transformative Effects of NDIIPP, the Case of the Henry A. Murray Archive , 2009, Libr. Trends.

[26]  F. Berman,et al.  Who Will Pay for Public Access to Research Data? , 2013, Science.

[27]  Matthew S. Mayernik,et al.  Drowning in data: digital library architecture to support scientific use of embedded sensor networks , 2007, JCDL '07.

[28]  B. Latour,et al.  Laboratory Life: The Construction of Scientific Facts , 1979 .

[29]  P. N. Edwards,et al.  Knowledge Infrastructures: Intellectual Frameworks and Research Challenges , 2013 .

[30]  Thea P Atwood,et al.  NSF Data Management Plans , 2014 .

[31]  S. Goldman,et al.  A Vast Machine: Computer Models, Climate Data, and the Politics of Global Warming , 2011 .

[32]  Florence Millerand,et al.  Infrastructure Time: Long-term Matters in Collaborative Development , 2010, Computer Supported Cooperative Work (CSCW).

[33]  Ronald L. Larsen On the Threshold of Cyberscholarship , 2008 .