Scientific Publication Packages - A Selective Approach to the Communication and Archival of Scientific Output

The use of digital technologies within research has led to a proliferation of data, many new forms of research output and new modes of presentation and analysis. Many scientific communities are struggling with the challenge of how to manage the terabytes of data and new forms of output, they are producing. They are also under increasing pressure from funding organizations to publish their raw data, in addition to their traditional publications, in open archives. In this paper I describe an approach that involves the selective encapsulation of raw data, derived products, algorithms, software and textual publications within “scientific publication packages”. Such packages provide an ideal method for: encapsulating expert knowledge; for publishing and sharing scientific process and results; for teaching complex scientific concepts; and for the selective archival, curation and preservation of scientific data and output. They also provide a bridge between technological advances in the Digital Libraries and eScience domains. In particular, I describe the RDF-based architecture that we are adopting to enable scientists to construct, publish and manage “scientific publication packages” - compound digital objects that encapsulate and relate the raw data to its derived products, publications and the associated contextual, provenance and administrative metadata.

[1]  José G. Sanchez Marcano,et al.  On the Classification of Models , 2007 .

[2]  Matjaz B. Juric,et al.  Business process execution language for web services , 2004 .

[3]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[4]  Frank van Harmelen,et al.  The Semantic Web – ISWC 2004 , 2004, Lecture Notes in Computer Science.

[5]  J. Houghton,et al.  Digital Broadband Content: Scientific Publishing , 2005 .

[6]  Richard P. Smiraglia The Nature of 'A Work': Implications for the Organization of Knowledge , 2001 .

[7]  Herbert Van de Sompel,et al.  Using MPEG-21 DIDL to Represent Complex Digital Objects in the Los Alamos National Laboratory Digital Library , 2003, D Lib Mag..

[8]  Carole A. Goble,et al.  Semantically Linking and Browsing Provenance Logs for E-science , 2004, ICSNW.

[9]  Tony Andrews Business Process Execution Language for Web Services Version 1.1 , 2003 .

[10]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004, Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004..

[11]  Sandra Payette,et al.  Fedora: an architecture for complex objects and their relationships , 2005, International Journal on Digital Libraries.

[12]  Anita Sundaram Coleman,et al.  Scientific Models as Works , 2002 .

[13]  Herbert Van de Sompel,et al.  aDORe: a modular, standards-based Digital Object Repository , 2005, Comput. J..

[14]  Gary L. Raines,et al.  Elements of spatial data quality , 1997 .

[15]  C. Rusbridge,et al.  The International Journal of Digital Curation , 2008 .

[16]  Jane Hunter,et al.  The ABC Ontology and Model , 2001, J. Digit. Inf..

[17]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[18]  Ict Access,et al.  Working Party on the Information Economy , 2007 .

[19]  Carole A. Goble,et al.  Using Semantic Web Technologies for Representing E-science Provenance , 2004, SEMWEB.

[20]  Jane Hunter,et al.  Semi-automated preservation and archival of scientific data using semantic grid services , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[21]  Terence R. Smith,et al.  A Content Standard for Computational Models , 2001, D Lib Mag..

[22]  Marta Mattoso,et al.  Sharing scientific models in environmental applications , 2002, SAC '02.

[23]  B D Yallop,et al.  Council for the Central Laboratory of the Research Councils , 2005 .

[24]  James Frew,et al.  Lineage retrieval for scientific data processing: a survey , 2005, CSUR.

[25]  Wil M. P. van der Aalst,et al.  Design and Implementation of the YAWL System , 2004, CAiSE.

[26]  Bernard Rous,et al.  The ACM digital library , 2001, CACM.