Risk Assessment for Scientific Data

Ongoing stewardship is required to keep data collections and archives in existence. Scientific data collections may face a range of risk factors that could hinder, constrain, or limit current or future data use. Identifying such risk factors to data use is a key step in preventing or minimizing data loss. This paper presents an analysis of data risk factors that scientific data collections may face, and a data risk assessment matrix to support data risk assessments to help ameliorate those risks. The goals of this work are to inform and enable effective data risk assessment by: a) individuals and organizations who manage data collections, and b) individuals and organizations who want to help to reduce the risks associated with data preservation and stewardship. The data risk assessment framework presented in this paper provides a platform from which risk assessments can begin, and a reference point for discussions of data stewardship resource allocations and priorities.

[1]  Division on Earth Risk Assessment in the Federal Government: Managing the Process , 1983 .

[2]  Paul Poli,et al.  Recent Advances in Satellite Data Rescue , 2017 .

[3]  Jane Greenberg,et al.  Where Have All the Scientific Data Gone? LIS Perspective on the Data-At-Risk Predicament , 2014, Coll. Res. Libr..

[4]  P. Slovic Trust, Emotion, Sex, Politics, and Science: Surveying the Risk‐Assessment Battlefield , 1999, Risk analysis : an official publication of the Society for Risk Analysis.

[5]  Heather M. Ryan Occam's Razor and File Format Endangerment Factors , 2014, iPRES.

[6]  Magenta Book,et al.  AUDIT AND CERTIFICATION OF TRUSTWORTHY DIGITAL REPOSITORIES , 2011 .

[7]  Thomer Andrea,et al.  Stronger together: the case for cross-sector collaboration in identifying and preserving at-risk data , 2017 .

[8]  Christopher A. Lee,et al.  Open Archival Information System (OAIS) Reference Model , 2010 .

[9]  John Chodacki,et al.  Data Mirror: Complementing Data Producers , 2017 .

[10]  Elizabeth Yakel,et al.  Trust in Digital Repositories , 2013, Int. J. Digit. Curation.

[11]  Ingrid Dillo,et al.  The Perceived Value of Acquiring Data Seals of Approval , 2017 .

[12]  Edward R. Cook,et al.  A cross‐taxa phenological dataset from Mohonk Lake, NY and its relationship to climate , 2007 .

[13]  Herman Stehouwer,et al.  Research data alliance , 2013 .

[14]  Sydney Levitus,et al.  The UNESCO-IOC-IODE "Global Oceanographic Data Archeology and Rescue" (GODAR) Project and "World Ocean Database" Project , 2012, Data Sci. J..

[15]  Nancy Y. McGovern Data rescue: observations from an archivist , 2017, CSOC.

[16]  John E. Thompson,et al.  History of Fish Presence and Absence Following Lake Acidification and Recovery in Lake Minnewaska, Shawangunk Ridge, NY , 2015 .

[17]  Donald E. Zimmerman,et al.  A group card sorting methodology for developing informational Web sites , 2002, Proceedings. IEEE International Professional Communication Conference.

[18]  Ge Peng,et al.  The State of Assessing Data Stewardship Maturity - An Overview , 2018, Data Sci. J..

[19]  Matthew S. Mayernik,et al.  Modernizing Library Metadata for Historical Weather and Climate Data Collections , 2017 .

[20]  Jane Greenberg,et al.  Metadata for Data Rescue and Data at Risk , 2011 .

[21]  D. Gallaher,et al.  The process of bringing dark data to light: The rescue of the early Nimbus satellite data , 2015 .

[22]  John L. Faundeen,et al.  Developing Criteria to Establish Trusted Digital Repositories , 2017, Data Sci. J..

[23]  Ayoung Yoon,et al.  Data reusers' trust development , 2017, J. Assoc. Inf. Sci. Technol..

[24]  Thomas Zimmermann,et al.  Card-sorting , 2016, Perspectives on Data Science for Software Engineering.

[25]  Irene V. Pasquetto,et al.  'What Data?' Records and Data Policy Coordination During Presidential Transitions , 2018, iConference.

[26]  Jeffrey L. Privette,et al.  A Unified Framework for Measuring Stewardship Practices Applied to Digital Environmental Datasets , 2015, Data Sci. J..

[27]  Bruce R. Barkstrom,et al.  Scientific Data Stewardship: Lessons Learned from a Satallite–Data Rescue Effort , 2007 .

[28]  Edward Iglesias,et al.  Optimizing Library Services- The OPAC , 2017 .

[29]  R. Elizabeth Griffin,et al.  When are Old Data New Data , 2015 .

[30]  Emily Maemura,et al.  Organizational assessment frameworks for digital preservation: A literature review and mapping , 2017, J. Assoc. Inf. Sci. Technol..

[31]  William K. Michener,et al.  NONGEOSPATIAL METADATA FOR THE ECOLOGICAL SCIENCES , 1997 .

[32]  John E. Thompson,et al.  Reconstructing a trophic cascade following unintentional introduction of golden shiner to Lake Minnewaska, New York, USA , 2016 .

[33]  Edward R. Cook,et al.  A Homogeneous Record (1896–2006) of Daily Weather and Climate at Mohonk Lake, New York* , 2010 .

[34]  Sarah Lamdan Lessons from DataRescue: The Limits of Grassroots Climate Change Data Preservation and the Need for Federal Records Law Reform , 2018 .

[35]  H. K. Ramapriyan NASA's EOSDIS, Trust and Certification , 2017 .

[36]  H. Frank Cervone,et al.  Project risk management , 2006, OCLC Syst. Serv..

[37]  Christoph Becker,et al.  The Design and Use of Assessment Frameworks in Digital Curation , 2020, J. Assoc. Inf. Sci. Technol..

[38]  Robert S. Chen,et al.  Curation of Scientific Data at Risk of Loss: Data Rescue and Dissemination , 2017 .

[39]  Margaret Janz Maintaining Access to Public Data-Lessons from Data Refuge , 2017 .

[40]  Alisa Mizikar,et al.  Flu.gov2010177Flu.gov. Washington, DC: US Department of Health and Human Services Last visited December 2009. URL: www.flu.gov Gratis , 2010 .

[41]  Jared Lyle,et al.  Retirement in the 1950s: Rebuilding a Longitudinal Research Database. , 2017, IASSIST quarterly.

[42]  Kerstin A. Lehnert,et al.  Rescue of long-tail data from the ocean bottom to the Moon: IEDA Data Rescue Mini-Awards , 2015 .

[43]  Ge Peng,et al.  Practical Application of a Data Stewardship Maturity Matrix for the NOAA OneStop Project , 2019 .

[44]  Terje Aven,et al.  Risk assessment and risk management: Review of recent advances on their foundation , 2016, Eur. J. Oper. Res..

[45]  Sergiu Gordea,et al.  A Decision Support System to Facilitate File Format Selection for Digital Preservation , 2017 .