Prototyping Digital Libraries Handling Heterogeneous Data Sources - The ETANA-DL Case Study

Information systems used in archaeology have several needs: interoperability among heterogeneous systems, making information available without significant delay, long-term preservation of data, and providing a suite of services to users. In this paper, we show how digital library techniques can be employed to provide solutions to three of these problems. We show this by describing a prototype for an archaeological Digital Library (ETANA-DL). First, ETANA-DL applies and extends the metadata harvesting approach to address some of the needs interoperability, rapid access to data, and data preservation. Second, we show that availability of a pool of components that implement common DL services has helped in rapidly creating the prototype, which was subsequently used for requirements elicitation. However, understanding complex archaeological information systems is a difficult task. Third, therefore, we describe our efforts to model these systems using the 5S framework, and show how the partially developed model has been used to implement complex services helping users carry out key tasks with the integrated data.

[1]  Rao Shen,et al.  ETANA-DL: a digital library for integrated handling of heterogeneous archaeological data , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[2]  Carl Lagoze,et al.  Dienst: an architecture for distributed document libraries , 1995, CACM.

[3]  Alberto H. F. Laender,et al.  The effectiveness of automatically structured queries in digital libraries , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[4]  Edward A. Fox,et al.  The Open Archives Initiative , 2001 .

[5]  Edward A. Fox,et al.  5SL: a language for declarative specification and generation of digital libraries , 2002, JCDL '02.

[6]  Tim Berners-Lee,et al.  Hypertext transfer protocol--http/i , 1993 .

[7]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[8]  Paul Miller,et al.  Why metadata matters in archaeology , 1997 .

[9]  Karen Markey Drabenstott Analytical review of the library of the future , 1980 .

[10]  A. S. Pollitt The key role of classification and indexing in view-based searching , 1998 .

[11]  Gail McMillan,et al.  Open Archives Initiative , 2000 .

[12]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[13]  Gregory R. Crane,et al.  Towards a cultural heritage digital library , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[14]  Qinwei Zhu,et al.  5SGraph: A Modeling Tool for Digital Libraries , 2002 .

[15]  Gerhard Weikum,et al.  Intelligent Search on XML Data: Applications, Languages, Models, Implementations, and Benchmarks , 2003 .

[16]  Edward A. Fox,et al.  Scenario-Based Generation of Digital Library Services , 2003, ECDL.

[17]  Udi Manber,et al.  WebGlimpse: combining browsing and searching , 1997 .

[18]  Edward A. Fox,et al.  Networked Digital Library of Theses and Dissertations: An International Effort Unlocking University Resources , 1997, D Lib Mag..

[19]  Dieter Fensel,et al.  Ontologies: A silver bullet for knowledge management and electronic commerce , 2002 .

[20]  E.A. Fox,et al.  ETANA-DL: managing complex information applications - an archaeology digital library , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[21]  Edward A. Fox,et al.  The XML log standard for digital libraries: analysis, evolution, and deployment , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[22]  Jun Wang,et al.  Java MARIAN: From an OPAC to a Modern Digital Library System , 2002, SPIRE.

[23]  Herbert Van de Sompel,et al.  Open Archives Initiative - Protocol for Metadata Harvesting - v.2.0 , 2002 .

[24]  E.A. Fox,et al.  An OAI compliant content-based image search component , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[25]  J. David Schloen Archaeological Data Models and Web Publication Using XML , 2001, Comput. Humanit..

[26]  Edward A. Fox,et al.  Streams, structures, spaces, scenarios, societies (5s): A formal model for digital libraries , 2004, TOIS.

[27]  Bernard Rous,et al.  The ACM digital library , 2001, CACM.

[28]  Paul Resnick,et al.  Recommender systems , 1997, CACM.

[29]  Roy T. Fielding,et al.  Hypertext Transfer Protocol - HTTP/1.1 , 1997, RFC.

[30]  Edward A. Fox,et al.  An Architecture for Multischeming in Digital Libraries , 2003, ICADL.

[31]  Edward A. Fox,et al.  Open digital libraries , 2002 .

[32]  Norbert Fuhr,et al.  A Query Language and User Interface for XML Information Retrieval , 2003, Intelligent Search on XML Data.

[33]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[34]  William Perrizo,et al.  DANA (Digital Archive Network for Anthropology): a model for digital archiving , 2002, SAC '02.