Resurfacing Historical Scientific Data: A Case Study Involving Fruit Breeding Data

Objective: The objective of this paper is to illustrate the importance and complexities of working with historical analog data that exists on university campuses. Using a case study of fruit breeding data, we highlight issues and opportunities for librarians to help preserve and increase access to potentially valuable data sets. Methods: We worked in conjunction with researchers to inventory, describe, and increase access to a large, 100-year-old data set of analog fruit breeding data. This involved creating a spreadsheet to capture metadata about each data set, identifying data sets at risk for loss, and digitizing select items for deposit in our institutional repository. Results/Discussion: We illustrate that large amounts of data exist within biological and agricultural sciences departments and labs, and how past practices of data collection, record keeping, storage, and management have hindered data reuse. We demonstrate that librarians have a role in collaborating with researchers and providing direction in how to preserve analog data and make it available for reuse. This work may provide guidance for other science librarians pursuing similar projects. Conclusions: This case study demonstrates how science librarians can build or strengthen their role in managing and providing access to analog data by combining their data management skills with researchers’ needs to recover and reuse data. Correspondence: Shannon L. Farrell: sfarrell@umn.edu

[1]  Dominique Gravel,et al.  Sharing in Ecology and Evolution Moving toward a sustainable ecological science : don ’ t let data go to waste ! , 2013 .

[2]  R. MacNaughton,et al.  Maximizing the value of historical bedrock field observations: An example from northwest Canada , 2015 .

[3]  Rebecca D. Frank,et al.  Destruction/reconstruction: preservation of archaeological and zoological research data , 2015 .

[4]  David Bloom,et al.  From documents to datasets: A MediaWiki-based method of annotating and extracting species observations in century-old field notebooks , 2012, ZooKeys.

[5]  S. Nicholson A Semi-Quantitative, Regional Precipitation Data Set for Studying African Climates of the Nineteenth Century, Part I. Overview of the Data Set , 2001 .

[6]  P. Munro,et al.  Contesting African landscapes: A critical reappraisal of Sierra Leone’s competing forest cover histories , 2016 .

[7]  G. Bourrié,et al.  Anthropogenic contamination of groundwater with nitrate in arid region: case study of southern Hodna (Algeria) , 2013, Environmental Earth Sciences.

[8]  botanical libraries,et al.  Biodiversity Heritage Library , 2009 .

[9]  R. Primack,et al.  Uncovering, Collecting, and Analyzing Records to Investigate the Ecological Impacts of Climate Change: A Template from Thoreau's Concord , 2012 .

[10]  D. Bedford,et al.  Elucidation of the ‘Honeycrisp’ pedigree through haplotype analysis with a multi-family integrated SNP linkage map and a large apple (Malus×domestica) pedigree-connected SNP data set , 2017, Horticulture Research.

[11]  N. Thrift Environment and Planning D: Society and Space , 1995 .

[12]  B. Buma,et al.  A foundation of ecology rediscovered: 100 years of succession on the William S. Cooper plots in Glacier Bay, Alaska. , 2017, Ecology.

[13]  Paul Coleman,et al.  How I Learned To Stop Worrying , 1987 .

[15]  J. M. Welch,et al.  Measuring the Past: Free Digitized Sources of Historical International Economic Information , 2014 .

[16]  Shannon Farrell Value in those dusty file cabinets? Identifying analog data from our campus community , 2019 .