Data, data use, and scientific inquiry: two case studies of data practices

Data are proliferating far faster than they can be captured, managed, or stored. What types of data are most likely to be used and reused, by whom, and for what purposes? Answers to these questions will inform information policy and the design of digital libraries. We report findings from semi-structured interviews and field observations to investigate characteristics of data use and reuse and how those characteristics vary within and between scientific communities. The two communities studied are researchers at the Center for Embedded Network Sensing (CENS) and users of the Sloan Digital Sky Survey (SDSS) data. The data practices of CENS and SDSS researchers have implications for data curation, system evaluation, and policy. Some data that are important to the conduct of research are not viewed as sufficiently valuable to keep. Other data of great value may not be mentioned or cited, because those data serve only as background to a given investigation. Metrics to assess the value of documents do not map well to data.

[1]  Reijo Savolainen,et al.  Epistemic work and knowing in practice as conceptualizations of information use , 2009, Inf. Res..

[2]  Paul Solomon,et al.  Looking for Information—A Survey of Research on Information Seeking, Needs, and Behavior , 2003, Information Retrieval.

[3]  Jonathan Furner,et al.  Little Book, Big Book , 2003, J. Libr. Inf. Sci..

[4]  S. Lele,et al.  The nature of scientific evidence : statistical, philosophical, and empirical considerations , 2004 .

[5]  Christine L. Borgman,et al.  When use cases are not useful: data practices, astronomy, and digital libraries , 2011, JCDL '11.

[6]  B. Dervin,et al.  Information needs and uses. , 1986 .

[7]  Christina Courtright,et al.  Context in information behavior research , 2007 .

[8]  Laura Wynholds,et al.  Linking to Scientific Data: Identity Problems of Unruly and Poorly Bounded Digital Objects , 2011, Int. J. Digit. Curation.

[9]  B. Latour,et al.  Laboratory Life: The Social Construction of Scientific Facts , 1983 .

[10]  C. Borgman,et al.  Scholarly Communication and Bibliometrics. , 1992 .

[11]  S. Shapin Laboratory life. The social construction of scientific facts , 1981, Medical History.

[12]  Christine L. Borgman,et al.  Who is responsible for data? An exploratory study of data authorship, ownership, and responsibility , 2011, ASIST.

[13]  Matthew S. Mayernik,et al.  Moving Archival Practices Upstream: An Exploration of the Life Cycle of Ecological Sensing Data in Collaborative Field Research , 2008, Int. J. Digit. Curation.

[14]  Brian A. Maurer Models of Scientific Inquiry and Statistical Practice: Implications for the Structure of Scientific Knowledge , 2004 .

[15]  Noel Enyedy,et al.  Building Digital Libraries for Scientific Data: An Exploratory Study of Data Practices in Habitat Ecology , 2006, ECDL.

[16]  Rosy Jan,et al.  Citation analysis of Library Trends , 2009, Webology.

[17]  Matthew S. Mayernik,et al.  Digital libraries for scientific data discovery and reuse: from vision to practical reality , 2010, JCDL '10.

[18]  S. Woolgar,et al.  Representation in Scientific Practice , 1990 .

[19]  Noel Enyedy,et al.  Little science confronts the data deluge: habitat ecology, embedded sensor networks, and digital libraries , 2007, International Journal on Digital Libraries.

[20]  Plergiorgio Strata,et al.  Citation analysis , 1995, Nature.

[21]  Wiebe E. Bijker,et al.  Science in action : how to follow scientists and engineers through society , 1989 .

[22]  Nithya Ramanathan,et al.  Know Thy Sensor: Trust, Data Quality, and Data Integrity in Scientific Digital Libraries , 2007, ECDL.