Scientific Data Collections and Distributed Collective Practice

As the basic sciences become increasingly information-intensive, the management and use of research data presents new challenges in the collective activities that constitute scholarly and scientific communication. This also presents new opportunities for understanding the role of informatics in scientific work practices, and for designing new kinds tools and resources needed to support them. These issues of data management, scientific communication and collective activity are brought together at once in scientific data collections (SDCs). What can the development and use of shared SDCs tell us about collective activity, dynamic infrastructures, and distributed scientific work? Using examples drawn from a nascent neuroscience data collection, we examine some unique features of SDCs to illustrate that they do more than act as infrastructures for scientific research. Instead, we argue that they are themselves instantiations of Distributed Collective Practice (DCP), and as such illustrate concepts of transition, emergence, and interdependency that may not be so apparent in other kinds of DCPs. We propose that research into SDCs can yield new insights into institutional arrangements, policymaking, and authority structures in other very large-scale socio-technical networks.

[1]  Stephen H. Koslow,et al.  Neuroinformatics: Opportunities across Disciplinary and National Borders , 1996, NeuroImage.

[2]  Neil R Smalheiser Linking investigators , 2003, EMBO reports.

[3]  John Leslie King Modern Information Infrastructure in the Support of Distributed Collective Practice in Transport , 2006, Computer Supported Cooperative Work (CSCW).

[4]  Langdon Winner Trust and terror: the vulnerability of complex socio‐technical systems , 2004 .

[5]  Jeremy P. Birnholtz,et al.  Data at work: supporting sharing in science and engineering , 2003, GROUP.

[6]  John L. Faundeen The challenge of archiving and preserving remotely sensed data , 2003, Data Sci. J..

[7]  Karen Ruhleder,et al.  Steps Toward an Ecology of Infrastructure: Design and Access for Large Information Spaces , 1996, Inf. Syst. Res..

[8]  B. Schutt Sorting things out. , 1970, The American journal of nursing.

[9]  Sarita Albagli,et al.  Memory Practices in the Sciences , 2008 .

[10]  Walid Ben-Ameur,et al.  Networks new economical virtual private , 2003, CACM.

[11]  Thomas A. Finholt,et al.  Collaboratories as a new form of scientific organization , 2003 .

[12]  M. Cragin Toward Integrative Science: Organizing Biodiversity and Neuroscience Data , 2005 .

[13]  Daniel E. Atkins A Report from the U.S. National Science Foundation Blue Ribbon Panel on Cyberinfrastructure , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[14]  Stephen H. Koslow,et al.  Should the neuroscience community make a paradigm shift to sharing primary data? , 2000, Nature Neuroscience.

[15]  Ann S. Zimmerman,et al.  DATA SHARING AND SECONDARY USE OF SCIENTIFIC DATA: EXPERIENCES OF ECOLOGISTS , 2003 .

[16]  Susan Leigh Star,et al.  Sorting Things Out: Classification and Its Consequences , 1999 .

[17]  Daniel Atkins,et al.  Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure , 2003 .

[18]  Carole L. Palmer,et al.  Information at the intersections of discovery: Case studies in neuroscience , 2004, ASIST.

[19]  Myriam Lewkowicz,et al.  Reflections on representations for infrastructural studies in the field of DCP , 2004 .

[20]  Rob Kling,et al.  Scientific Collaboratories as Socio-Technical Interaction Networks: A Theoretical Approach , 2000, ArXiv.

[21]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[22]  E. Gerson Scientific Work and Social Worlds , 1983 .

[23]  Nancy A. Van House Trust and epistemic communities in biodiversity data sharing , 2002, JCDL.

[24]  S E Fienberg,et al.  Sharing statistical data in the biomedical and health sciences: ethical, institutional, legal, and professional dimensions. , 1994, Annual review of public health.

[25]  Mark H. Ellisman,et al.  Data-intensive e-science frontier research , 2003, CACM.