Towards a Theory of Information Preservation

Digital preservation is a pressing challenge to the library community. In this paper, we describe the initial results of our efforts towards understanding digital (as well as traditional) preservation problems from first principles. Our approach is to use the language of mathematics to formalize the concepts that are relevant to preservation. Our theory of preservation spaces draws upon ideas from logic and programming language semantics to describe the relationship between concrete objects and their information contents. We also draw on game theory to show how objects change over time as a result of uncontrollable environment effects and directed preservation actions. In the second half of this paper, we show how to use the mathematics of universal algebra as a language for objects whose information content depends on many components. We use this language to describe both migration and emulation strategies for digital preservation.

[1]  Reagan Moore,et al.  Towards self-validating knowledge-based archives , 2001, Proceedings Eleventh International Workshop on Research Issues in Data Engineering. Document Management for Data Intensive Business and Scientific Applications. RIDE 2001.

[2]  A. J. Jones Game theory; mathematical models of conflict. , 1980 .

[3]  Clifford A. Lynch,et al.  Canonicalization: A Fundamental Tool to Facilitate Preservation and Management of Digital Information , 1999, D-Lib Magazine.

[4]  Terry Cook Electronic records, paper minds: the revolution in information management and archives in the post/ custodial and post/ modernist era. [Based on a presentation delivered by the author during his November 1993 Australian tour.] , 1994 .

[5]  Margaret L. Hedstrom,et al.  Digital Preservation: A Time Bomb for Digital Libraries , 1997, Comput. Humanit..

[6]  Luciana Duranti,et al.  Diplomatics: New Uses for an Old Science , 1998 .

[7]  Jeff Rothenberg,et al.  Ensuring the Longevity of Digital Documents , 1995 .

[8]  P. Morris Introduction to Game Theory , 1994 .

[9]  Sue McKemmish,et al.  Are records ever actual , 1994 .

[10]  Wolfgang Wechler,et al.  Universal Algebra for Computer Scientists , 1992, EATCS Monographs on Theoretical Computer Science.

[11]  Anne J. Gilliland-Swetland,et al.  Enduring Paradigm , New Opportunities : The Value of the Archival Perspective in the Digital Environment , 2012 .

[12]  Hector Garcia-Molina,et al.  Implementing a Reliable Digital Object Archive , 2000, ECDL.

[13]  Ross Wilkinson,et al.  Preserving digital information forever , 2000, DL '00.

[14]  Hector Garcia-Molina,et al.  Archival storage for digital libraries , 1998, DL '98.

[15]  Hector Garcia-Molina,et al.  Modeling Archival Repositories for Digital Libraries , 2000, ECDL.

[16]  John C. Mitchell,et al.  Foundations for programming languages , 1996, Foundation of computing series.

[17]  John Garrett,et al.  Preserving Digital Information. Report of the Task Force on Archiving of Digital Information. , 1996 .