From artifacts to aggregations: Modeling scientific life cycles on the semantic Web

In the process of scientific research, many information objects are generated, all of which may remain valuable indefinitely. However, artifacts such as instrument data and associated calibration information may have little value in isolation; their meaning is derived from their relationships to each other. Individual artifacts are best represented as components of a life cycle that is specific to a scientific research domain or project. Current cataloging practices do not describe objects at a sufficient level of granularity nor do they offer the globally persistent identifiers necessary to discover and manage scholarly products with World Wide Web standards. The Open Archives Initiative's Object Reuse and Exchange data model (OAI-ORE) meets these requirements. We demonstrate a conceptual implementation of OAI-ORE to represent the scientific life cycles of embedded networked sensor applications in seismology and environmental sciences. By establishing relationships between publications, data, and contextual research information, we illustrate how to obtain a richer and more realistic view of scientific practices. That view can facilitate new forms of scientific research and learning. Our analysis is framed by studies of scientific practices in a large, multi-disciplinary, multi-university science and engineering research center, the Center for Embedded Networked Sensing (CENS).

[1]  Jane Hunter,et al.  Provenance Explorer-a graphical interface for constructing scientific publication packages from provenance trails , 2007, International Journal on Digital Libraries.

[2]  William K. Michener,et al.  Ecological Data: Design, Management and Processing , 2000 .

[3]  M. Stealey,et al.  High Resolution River Hydraulic and Water Quality Characterization Using Rapidly Deployable Networked Infomechanical Systems (NIMS RD) , 2007 .

[4]  W. D. Garvey,et al.  Scientific communication as a social system. The exchange of information on research evolves predictably and can be experimentally modified. , 1967, Science.

[5]  W. D. Garvey,et al.  Studies of social innovations in scientific communication in psychology. , 1966, The American psychologist.

[6]  Deborah Estrin,et al.  Habitat monitoring with sensor networks , 2004, CACM.

[7]  Noel Enyedy,et al.  Little science confronts the data deluge: habitat ecology, embedded sensor networks, and digital libraries , 2007, International Journal on Digital Libraries.

[8]  B. Latour,et al.  Laboratory Life: The Construction of Scientific Facts , 1979 .

[9]  Barbara Tillett,et al.  What is FRBR? A conceptual model for the bibliographic universe , 2005 .

[10]  David Bawden,et al.  Memory Practices in the Sciences , 2007 .

[11]  Noel Enyedy,et al.  Building Digital Libraries for Scientific Data: An Exploratory Study of Data Practices in Habitat Ecology , 2006, ECDL.

[12]  H. Deda,et al.  Communication in science. , 2002, Acta neurochirurgica. Supplement.

[13]  Victoria Stodden,et al.  Enabling Reproducible Research: Open Licensing for Scientific Innovation , 2009 .

[14]  B. Tillet Bibliographic relationships : toward a conceptual structure of bibliographic information used in cataloging , 1987 .

[15]  David M. Shotton,et al.  Adventures in Semantic Publishing: Exemplar Semantic Enhancements of a Research Article , 2009, PLoS Comput. Biol..

[16]  Carole L. Palmer,et al.  Scholarly work and the shaping of digital access , 2005, J. Assoc. Inf. Sci. Technol..

[17]  Anthony J. G. Hey,et al.  e-Science and its implications for the library community , 2006, Libr. Hi Tech.

[18]  Sandra Payette,et al.  Pathways: augmenting interoperability across scholarly repositories , 2007, International Journal on Digital Libraries.

[19]  Victoria Stodden,et al.  The Legal Framework for Reproducible Scientific Research: Licensing and Copyright , 2009, Computing in Science & Engineering.

[20]  Deborah Estrin,et al.  Embedded Networked Sensing , 2009, Encyclopedia of Database Systems.

[21]  Herbert Van de Sompel,et al.  Rethinking Scholarly Communication: Building the System that Scholars Deserve , 2004, D Lib Mag..

[22]  R. Clayton,et al.  Subducting Slab Ultra-Slow Velocity Layer Coincident with Silent Earthquakes in Southern Mexico , 2008, Science.

[23]  Mike Thelwall,et al.  Google Scholar citations and Google Web/URL citations: A multi-discipline exploratory analysis , 2007, J. Assoc. Inf. Sci. Technol..

[24]  Deborah Estrin,et al.  WiLSoN: The Wirelessly Linked Seismological Network and Its Application in the Middle American Subduction Experiment , 2008 .

[25]  Wiebe E. Bijker,et al.  Science in action : how to follow scientists and engineers through society , 1989 .

[26]  Deborah Estrin,et al.  Disruption tolerant shell , 2006, CHANTS '06.

[27]  P. Davis,et al.  Tomography and thermal state of the Cocos plate subduction beneath Mexico City , 2009 .

[28]  Michela Montesi,et al.  Research journal articles as document genres: exploring their role in knowledge organization , 2008, J. Documentation.

[29]  Anne J. Gilliland-Swetland,et al.  Enduring Paradigm , New Opportunities : The Value of the Archival Perspective in the Digital Environment , 2012 .

[30]  Anne E. Trefethen,et al.  Cyberinfrastructure for e-Science , 2005, Science.

[31]  Gaurav S. Sukhatme,et al.  Mobile Robot Sensing for Environmental Applications , 2007, FSR.

[32]  M. Porter Competitive Advantage: Creating and Sustaining Superior Performance , 1985 .

[33]  Paul Wouters,et al.  Turning working papers into journal articles: An exercise in microbibliometrics , 2009, J. Assoc. Inf. Sci. Technol..

[34]  Gordon Bell,et al.  Beyond the Data Deluge , 2009, Science.

[35]  Matthew S. Mayernik,et al.  Drowning in data: digital library architecture to support scientific use of embedded sensor networks , 2007, JCDL '07.

[36]  María José Luzón,et al.  Scholarly hyperwriting: The function of links in academic weblogs , 2009, J. Assoc. Inf. Sci. Technol..

[37]  Matthew S. Mayernik,et al.  Moving Archival Practices Upstream: An Exploration of the Life Cycle of Ecological Sensing Data in Collaborative Field Research , 2008, Int. J. Digit. Curation.

[38]  A. Telier,et al.  Drawing things together , 2012, INTR.

[39]  Nithya Ramanathan,et al.  Know Thy Sensor: Trust, Data Quality, and Data Integrity in Scientific Digital Libraries , 2007, ECDL.

[40]  R. Clayton,et al.  Horizontal subduction and truncation of the Cocos Plate beneath central Mexico , 2008 .

[41]  Jenny Fry,et al.  Scholarship in the Digital Age: Information, Infrastructure, and the Internet , 2010, J. Assoc. Inf. Sci. Technol..

[42]  C. E. Mees The Production of Scientific Knowledge , Nature.

[43]  Megan Sniffin–Marinoff Enduring paradigm, new opportunities: the value of the archival perspective in the digital environment , 2000 .

[44]  Herbert Van de Sompel,et al.  Adding eScience Assets to the Data Web , 2009, ArXiv.

[45]  Matthew S. Mayernik,et al.  Adding context to content: The CENS deployment center , 2007, ASIST.

[46]  A. J. Meadows,et al.  Communicating research , 1997, Library and information science series.

[47]  Seiji Tsuboi,et al.  The International Federation of Digital Seismograph Networks (FDSN): An Integrated System of Seismological Observatories , 2008, IEEE Systems Journal.

[48]  Stasa Milojevic,et al.  The Special Case of Scientific Data Sharing with Education , 2006, ASIST.

[49]  Sarita Albagli,et al.  Memory Practices in the Sciences , 2008 .

[50]  S. Woolgar,et al.  Representation in Scientific Practice , 1990 .

[51]  W. D. Garvey,et al.  SCIENTIFIC INFORMATION EXCHANGE IN PSYCHOLOGY. , 1964, Science.

[52]  Norman Paskin,et al.  Digital Object Identifiers for scientific data , 2005, Data Sci. J..

[53]  L. Lyon Dealing with Data: Roles, Rights, Responsibilities and Relationships. Consultancy Report. , 2007 .

[54]  D Estrin,et al.  In Vivo Characterization of a Wide area 802.11b Wireless Seismic Array , 2007 .

[55]  Tim Ahern Accessing a multi-terabyte seismological archive using a metadata portal , 2000, Proceedings 2000 Kyoto International Conference on Digital Libraries: Research and Practice.

[56]  Philip E. Bourne,et al.  Will a Biological Database Be Different from a Biological Journal? , 2005, PLoS Comput. Biol..

[57]  Matthew S. Mayernik,et al.  Knitting a fabric of sensor data and literature. in Information Processing in Sensor Networks , 2007 .

[58]  Herbert Van de Sompel,et al.  An Interoperable Fabric for Scholarly Value Chains , 2006, D-Lib Magazine.