Integration of complex archeology digital libraries: An ETANA-DL experience

In this paper, we formalize the digital library (DL) integration problem and propose an overall approach based on the 5S (streams, structures, spaces, scenarios, and societies) framework. We then apply that framework to integrate domain-specific (archeological) DLs, illustrating our solutions for key problems in DL integration. An integrated Archeological DL, ETANA-DL, is used as a case study to justify and evaluate our DL integration approach. More specifically, we develop a minimal metamodel for archeological DLs within the 5S theory. We implement the 5SSuite tool set to cover the process of union DL generation, including requirements gathering, conceptual modeling, rapid prototyping, and code generation. 5SSuite consists of 5SGraph, 5SGen, and SchemaMapper, each of which plays an important role in DL integration. We also propose an approach to integrated DLs based on the 5S formalism, which provides a systematic method to design and implement DL exploring services.

[1]  Hongjun Lu,et al.  Discovering and reconciling value conflicts for numerical data integration , 2001, Inf. Syst..

[2]  Edward A. Fox,et al.  Requirements Gathering and Modeling of Domain-Specific Digital Libraries with the 5S Framework: An Archaeological Case Study with ETANA , 2005, ECDL.

[3]  Edward A. Fox,et al.  Schema mapper: a visualization tool for DL integration , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[4]  Edward A. Fox,et al.  Exploring the computing literature with visualization and stepping stones & pathways , 2006, Commun. ACM.

[5]  Carl Lagoze,et al.  Core services in the architecture of the national science digital library (NSDL) , 2002, JCDL '02.

[6]  J. David Schloen Archaeological Data Models and Web Publication Using XML , 2001, Comput. Humanit..

[7]  Edward A. Fox,et al.  Prototyping Digital Libraries Handling Heterogeneous Data Sources - The ETANA-DL Case Study , 2004, ECDL.

[8]  Qinwei Zhu,et al.  5SGraph: A Modeling Tool for Digital Libraries , 2002 .

[9]  Laura M. Haas,et al.  PESTO : An Integrated Query/Browser for Object Databases , 1996, VLDB.

[10]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[11]  Nicholas J. Belkin,et al.  Braque: Design of an Interface to Support User Interaction in Information Retrieval , 1993, Inf. Process. Manag..

[12]  George W. Furnas,et al.  Considerations for information environments and the NaviQue workspace , 1998, DL '98.

[13]  Sudha Ram,et al.  Digital Libraries for the Next Millennium: Challenges and Research Directions , 1999, Inf. Syst. Frontiers.

[14]  Herbert Van de Sompel,et al.  A Spectrum of Interoperability: The Site for Science Prototype for the NSDL , 2002, D Lib Mag..

[15]  Christopher Olston,et al.  ScentTrails: Integrating browsing and searching on the Web , 2003, TCHI.

[16]  Varghese S. Jacob,et al.  Industrial-strength data warehousing , 1998, CACM.

[17]  Sudha Ram,et al.  Information systems interoperability: What lies beneath? , 2004, TOIS.

[18]  Edward A. Fox,et al.  Exploring digital libraries: integrating browsing, searching, and visualization , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[19]  Gary Marchionini,et al.  Information Seeking in Electronic Environments , 1995 .

[20]  Marcos André Gonçalves Streams, Structures, Spaces,Scenarios, and Societies (5S): A Formal Digital Library Framework and Its Applications , 2004 .

[21]  Robin Ss Is there a difference , 1967 .

[22]  Vijayalakshmi Atluri,et al.  SI in digital libraries , 2000, CACM.

[23]  Edward A. Fox,et al.  Development of the coder system: A testbed for artificial intelligence methods in information retrieval , 1987, Inf. Process. Manag..

[24]  Hongjun Lu,et al.  DIRECT: a system for mining data value conversion rules from disparate data sources , 2002, Decis. Support Syst..

[25]  Wilhelm Hasselbring,et al.  Information system integration , 2000, CACM.

[26]  Henning Hopf Knowledge lost in information , 2007 .

[27]  Rao Shen,et al.  ETANA-DL: a digital library for integrated handling of heterogeneous archaeological data , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[28]  Edward A. Fox,et al.  Open digital libraries , 2002 .

[29]  Edward A. Fox,et al.  Visual Semantic Modeling of Digital Libraries , 2003, ECDL.

[30]  Gene Golovchinsky,et al.  Queries? Links? Is there a difference? , 1997, CHI.

[31]  Sriram Raghavan,et al.  Search Middleware and the Simple Digital Library Interoperability Protocol , 2000, D Lib Mag..

[32]  Manuel A. Pérez-Quiñones,et al.  Enhancing usability in CITIDEL: multimodal, multilingual, and interactive visualization interfaces , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[33]  Edward A. Fox,et al.  Development of a modern OPAC: from REVTOLC to MARIAN , 1993, SIGIR.

[34]  Chris North,et al.  Citiviz: A Visual User Interface to the CITIDEL System , 2004, ECDL.

[35]  Kevin Chen-Chuan Chang,et al.  Interoperability for digital libraries worldwide , 1998, CACM.

[36]  Edward A. Fox,et al.  Incremental, Semi-automatic, Mapping-Based Integration of Heterogeneous Collections into Archaeological Digital Libraries: Megiddo Case Study , 2005, ECDL.

[37]  Wilhelm Hasselbring Information System Integration: Introduction. , 2000 .

[38]  Sandra Payette,et al.  Interoperability for Digital Objects and Repositories: The Cornell/CNRI Experiments , 1999, D Lib Mag..