Know Thy Sensor: Trust, Data Quality, and Data Integrity in Scientific Digital Libraries

For users to trust and interpret the data in scientific digital libraries, they must be able to assess the integrity of those data. Criteria for data integrity vary by context, by scientific problem, by individual, and a variety of other factors. This paper compares technical approaches to data integrity with scientific practices, as a case study in the Center for Embedded Networked Sensing (CENS) in the use of wireless, in-situ sensing for the collection of large scientific data sets. The goal of this research is to identify functional requirements for digital libraries of scientific data that will serve to bridge the gap between current technical approaches to data integrity and existing scientific practices.

[1]  S. Ross :Scholarship in the Digital Age: Information, Infrastructure, and the Internet , 2009 .

[2]  Deborah Estrin,et al.  Sympathy for the sensor network debugger , 2005, SenSys '05.

[3]  A. Strauss,et al.  The discovery of grounded theory: strategies for qualitative research aldine de gruyter , 1968 .

[4]  Ann Zimmerman,et al.  New Knowledge from Old Data , 2008 .

[5]  Geoffrey C. Bowker Biodiversity Datadiversity , 2000 .

[6]  Anne E. Trefethen,et al.  The Data Deluge: An e-Science Perspective , 2003 .

[7]  Gaurav S. Sukhatme,et al.  Designing Wireless Sensor Networks as a Shared Resource for Sustainable Development , 2006, 2006 International Conference on Information and Communication Technologies and Development.

[8]  Sarita Albagli,et al.  Memory Practices in the Sciences , 2005 .

[9]  Noel Enyedy,et al.  Little science confronts the data deluge: habitat ecology, embedded sensor networks, and digital libraries , 2007, International Journal on Digital Libraries.

[10]  Noel Enyedy,et al.  Building Digital Libraries for Scientific Data: An Exploratory Study of Data Practices in Habitat Ecology , 2006, ECDL.

[11]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[12]  平澤 Social Studies of Science : 抄録雑誌の概要 , 1987 .

[13]  C. Brodsky The Discovery of Grounded Theory: Strategies for Qualitative Research , 1968 .

[14]  Nithya Ramanathan Fixing Faults in Wireless Sensing Systems with Confidence , 2008 .

[15]  Deborah Estrin,et al.  SensorBase.org: A Centralized Repository to Slog Sensor Network Data (KNO 2) , 2006 .

[16]  Matthew S. Mayernik,et al.  Drowning in data: digital library architecture to support scientific use of embedded sensor networks , 2007, JCDL '07.

[17]  Rolf Isermann,et al.  Fault diagnosis and fault tolerance of drive systems - Status and research , 2009, 2009 European Control Conference (ECC).

[18]  Gaurav S. Sukhatme,et al.  IDEA: Iterative experiment Design for Environmental Applications , 2006 .

[19]  Eddie Kohler,et al.  Investigation of Hydrologic and Biogeochemical Controls on Arsenic Mobilization Using Distributed Sensing at a Field Site in Munshiganj, Bangladesh , 2006 .

[20]  Wei Hong,et al.  A macroscope in the redwoods , 2005, SenSys '05.

[21]  Geoffrey C. Bowker,et al.  Mapping biodiversity , 2000, Int. J. Geogr. Inf. Sci..

[22]  S. Traweek,et al.  Beamtimes and Lifetimes: The World of High Energy Physicists , 1988 .

[23]  Geoffrey C. Bowker Work and Information Practices in the Sciences of Biodiversity , 2000, VLDB.

[24]  Deborah Estrin,et al.  Sharing Sensor Network Data , 2007 .

[25]  J. Berson Memory Practices in the Sciences (review) , 2009 .