Digital archives are dedicated to the long-term preservation of electronic information and have the mandate to enable sustained access despite rapid technology changes. Persistent archives are confronted with heterogeneous data formats, helper applications, and platforms being used over the lifetime of the archive. This is not unlike the interoperability challenges, for which mediators are devised. To prevent technological obsolescence over time and across platforms, a migration approach for persistent archives is proposed based on an XML infrastructure.We extend current archival approaches that build upon standardized data formats and simple metadata mechanisms for collection management, by involving high-level conceptual models and knowledge representations as an integral part of the archive and the ingestion/migration processes. Infrastructure independence is maximized by archiving generic, executable specifications of (i) archival constraints (i.e., "model validators"), and (ii) archival transformations that are part of the ingestion process. The proposed architecture facilitates construction of self-validating and self-instantiating knowledge-based archives. We illustrate our overall approach and report on first experiences using a sample collection from a collaboration with the National Archives and Records Administration (NARA).
[1]
Chaitanya K. Baru,et al.
Collection-Based Persistent Digital Archives - Part 1
,
2000,
D Lib Mag..
[2]
Reagan Moore,et al.
Knowledge-based Grids
,
2001,
2001 Eighteenth IEEE Symposium on Mass Storage Systems and Technologies.
[3]
James A. Hendler,et al.
The Semantic Web" in Scientific American
,
2001
.
[4]
John Garrett,et al.
Preserving Digital Information. Report of the Task Force on Archiving of Digital Information.
,
1996
.
[5]
Chaitanya K. Baru,et al.
XML-based information mediation for digital libraries
,
1999,
DL '99.
[6]
Sriram Raghavan,et al.
Search Middleware and the Simple Digital Library Interoperability Protocol
,
2000,
D Lib Mag..
[7]
Michael Kifer,et al.
Logical foundations of object-oriented and frame-based languages
,
1995,
JACM.