Digital Research Data Curation: Overview of Issues, Current Activities, and Opportunities for the Cornell University Library

Formed in December of 2006, the Cornell University Library's Data Working Group's purpose is to exchange information about CUL activities related to data curation, to review and exchange information about developments and activities in data curation in general, and to consider and recommend strategic opportunities for CUL to engage in the area of data curation. The Data Working Group has discussed publications and activities related to data curation, and has hosted presentations by (or discussions with) DaWG members and Cornell faculty and staff. This white paper presents an overview of the current landscape and issues surrounding data curation, and includes recommendations for CUL in this area.

[1]  Helen Hockx-Yu Digital Curation Centre - Phase Two , 2007, Int. J. Digit. Curation.

[2]  T. Todd Elvins,et al.  Controlled publication of digital scientific data , 2002, CACM.

[3]  Glenn R. Flierl,et al.  The US JGOFS data management experience , 2006 .

[4]  Lee Dirks,et al.  Research-Output Repositories - An Overview of Microsoft Initiatives , 2008 .

[5]  A.M.G. Macdonald Codata (committee on data for science and technology), international compendium of numerical data projects , 1970 .

[6]  R. Frost,et al.  From data to wisdom: pathways to successful data management for Australian science. Report of the working group on Data for Science to the Prime Minister's Science, Engineering and Innovation Council (PMSEIC) , 2006 .

[7]  TIM M. BLACKBURN,et al.  Reproducibility and Repeatability in Ecology , 2006 .

[8]  Helena Karasti,et al.  Enriching the Notion of Data Curation in E-Science: Data Managing and Information Infrastructuring in the Long Term Ecological Research (LTER) Network , 2006, Computer Supported Cooperative Work (CSCW).

[9]  Daniel Atkins,et al.  Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure , 2003 .

[10]  Alma Swan,et al.  The business of digital repositories , 2008 .

[11]  Dan L. Burk,et al.  Intellectual Property in the Context of e-Science , 2007, J. Comput. Mediat. Commun..

[12]  Roger Clarke,et al.  Open Source Software And Open Content As Models For eBusiness , 2004, Bled eConference.

[13]  Jianting Zhang,et al.  Data Integration and Workflow Solutions for Ecology , 2005, DILS.

[14]  Peter Buneman,et al.  Report on the First International Workshop on Database Preservation (PresDB'07) , 2007, SGMD.

[15]  Christine L. Borgman,et al.  Data, disciplines, and scholarly publishing , 2008, Learn. Publ..

[16]  L. Lyon Dealing with Data: Roles, Rights, Responsibilities and Relationships. Consultancy Report. , 2007 .

[17]  Ross Harvey DCC Digital Curation Manual: Instalment on Appraisal and Selection , 2006 .

[18]  CYNTHIA SIMS PARR Open Sourcing Ecological Data , 2007 .

[19]  Norbert Lossau DRIVER : Networking European Scientific Repositories , 2006 .

[20]  Neil Beagrie,et al.  Digital Curation for Science, Digital Libraries, and Individuals , 2008, Int. J. Digit. Curation.

[21]  John M. Abowd,et al.  New Approaches to Confidentiality Protection: Synthetic Data, Remote Access and Research Data Centers , 2004, Privacy in Statistical Databases.

[22]  Oya Rieger Cornell University Library Digital Preservation Policy Framework , 2004 .

[23]  Matthew B. Jones,et al.  Metacat: a schema-independent XML database system , 2001, Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001.

[24]  Walter Hamscher,et al.  Principles of Diagnosis: Current Trends and a Report on the First International Workshop , 1991, AI Mag..

[25]  Anna Keller Gold,et al.  Cyberinfrastructure, Data, and Libraries, Part 1: A Cyberinfrastructure Primer for Librarians , 2007, D Lib Mag..

[26]  Clifford Lynch 19 – Open computation: beyond human reader-centric views of scholarly literatures , 2006 .

[27]  Declan Butler Agencies join forces to share data , 2007, Nature.

[28]  David Groenewegen,et al.  The Data Curation Continuum: Managing Data Objects in Institutional Repositories , 2007, D Lib Mag..

[29]  Philip M. Davis,et al.  Institutional Repositories: Evaluating the Reasons for Non-use of Cornell University's Installation of DSpace , 2007, D Lib Mag..

[30]  Thomas Engel,et al.  Basic Overview of Chemoinformatics , 2006, J. Chem. Inf. Model..

[31]  Matthew B. Jones,et al.  Managing heterogeneous ecological data using Morpho , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[32]  Anna Keller Gold Cyberinfrastructure, Data, and Libraries, Part 2: Libraries and the Data Challenge: Roles and Actions for Libraries , 2007, D Lib Mag..

[33]  Robin S. Smith Geospatial Data-sharing in UK Higher Education: informal repositories and users’ perspectives , 2007 .