Open Source Software for Digital Preservation Repositories: a Survey

In the digital age, the amount of data produced is growing exponentially. Governments and institutions can no longer rely on old methods for storing data and passing on the knowledge to future generations. Digital data preservation is a mandatory issue that needs proper strategies and tools. With this awareness, efforts are being made to create and perfect software solutions capable of responding to the challenge of properly preserving digital information. This paper focuses on the state-of-the-art in open-source software solutions for the digital preservation and curation field used to assimilate and disseminate information to designated audiences. Eleven open source projects for digital preservation are surveyed in areas such as supported standards and protocols, strategies for preservation, methodologies for reporting, dynamic of development, targeted operating systems, multilingual support and open source license. Furthermore, five of these open source projects, are further analysed, with focus on features deemed important for the area. Along open source solutions, the paper also briefly surveys the standards and protocols relevant for digital data preservation. The area of digital data preservation repositories has several open source solutions, which can form the base to overcome the challenges to reach mature and reliable digital data preservation.

[1]  MacKenzie Smith,et al.  DSpace: An Open Source Dynamic Digital Repository , 2003, D Lib Mag..

[2]  Herbert Van de Sompel,et al.  Resource Harvesting within the OAI-PMH Framework , 2004, D Lib Mag..

[3]  Brian F. Lavoie The Open Archival Information System Reference Model: Introductory Guide , 2004 .

[4]  John A. Kunze,et al.  Dublin Core Metadata for Resource Discovery , 1998, RFC.

[5]  José Carlos Ramalho,et al.  RODA and Crib : A Service-Oriented Digital Repository , 2008, iPRES.

[6]  Daniel V. Pitti Encoded archival description: An introduction and overview , 1999 .

[7]  Peter van Garderen Archivematica: Using Micro-Services And Open-Source Software To Deliver A Comprehensive Digital Curation Solution , 2010, iPRES.

[8]  Eric H. Schnell,et al.  docMD (DOCument Mediated Delivery) , 2003 .

[9]  Nikolay G. Markov,et al.  H.264/AVC Video Compression on Smartphones , 2017 .

[10]  Vincent Rijmen,et al.  The Design of Rijndael: AES - The Advanced Encryption Standard , 2002 .

[11]  Keitha Booth,et al.  Linking people and information: Web site access to National Library of New Zealand information and services , 2003, Electron. Libr..

[12]  Miguel Costa,et al.  A Survey on Web Archiving Initiatives , 2011, TPDL.

[13]  Adewole Adewumi,et al.  Institutional Repositories: Features, Architecture, Design and Implementation Technologies , 2011 .

[14]  Christopher A. Lee,et al.  Open Archival Information System (OAIS) Reference Model , 2010 .

[15]  Mike R. Beazley Eprints Institutional Repository Software: A Review , 2011 .

[16]  Steve Hitchcock,et al.  Preserving repository content: practical tools for repository managers , 2011, J. Digit. Inf..

[17]  Victoria Reich LOCKSS (lots of copies keep stuff safe) , 2006, iPRES.

[18]  Jung-ran Park,et al.  Metadata Object Description Schema (MODS) in Digital Repositories: An Exploratory Study of Metadata Use and Quality , 2009 .

[19]  Sandra Payette,et al.  Flexible and Extensible Digital Object and Repository Architecture (FEDORA) , 1998, ECDL.

[20]  Sally H. McCallum A Look at New Information Retrieval Protocols : SRU , OpenSearch / A 9 , CQL , and XQuery , 2006 .

[21]  John Gantz,et al.  The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East , 2012 .

[22]  Su-Shing Chen,et al.  The Paradox of Digital Preservation , 2001, Computer.

[23]  Ccsds Secretariat,et al.  Reference Model for an Open Archival Information System (OAIS) , 1999 .

[24]  Rida Benjelloun Archimède: a Canadian solution for institutional repository , 2005, Libr. Hi Tech.

[25]  Soma Bandyopadhyay,et al.  Role Of Middleware For Internet Of Things: A Study , 2011 .

[26]  Ian S. Burnett,et al.  MPEG-21 digital item declaration and Identification-principles and compression , 2005, IEEE Transactions on Multimedia.

[27]  Sally H. McCallum,et al.  New Metadata Standards for Digital Resources: MODS and METS , 2003 .

[28]  Rebecca S. Guenther,et al.  Practical Preservation: The PREMIS Experience , 2005, Libr. Trends.

[29]  Margaret L. Hedstrom,et al.  Digital Preservation: A Time Bomb for Digital Libraries , 1997, Comput. Humanit..

[30]  Danny Cohen,et al.  A Format for Bibliographic Records , 1995, RFC.

[31]  C. Lynch Big data: How do your data grow? , 2008, Nature.

[32]  Marcelo Arenas,et al.  Semantics and Complexity of SPARQL , 2006, International Semantic Web Conference.

[33]  Rebecca Guenther,et al.  The Application/MARC Content-type , 1997, RFC.