Integration of Heterogeneous Digital Libraries with Semi-automatic Mapping and Browsing: From Formalization to Specification to Visualization

In this paper, we formalize the digital library (DL) integration problem and propose an overall approach based on the 5S framework. We apply 5S to domain-specific (archaeological) DLs, illustrating our solutions for key problems in DL integration. We use ETANA-DL as a case study to describe the process of semi-automatically generating a union catalog and a unified browsing service in an archaeological DL. A visual schema mapping tool is developed for union catalog creation. A pilot user study aids tool evaluation. Our approach is further validated through application of a general browsing component to two integrated DLs.

[1]  Vijayalakshmi Atluri,et al.  SI in digital libraries , 2000, CACM.

[2]  Eric Lease Morgan An Introduction to the Search/Retrieve URL Service (SRU) , 2004 .

[3]  Edward A. Fox,et al.  Visual Semantic Modeling of Digital Libraries , 2003, ECDL.

[4]  Edward A. Fox,et al.  Enhancing usability in CITIDEL: multimodal, multilingual, and interactive visualization interfaces , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[5]  Carl Lagoze,et al.  Core services in the architecture of the national science digital library (NSDL) , 2002, JCDL '02.

[6]  Venkataraman Ramesh,et al.  Information Sharing Among Multiple Heterogeneous Data Sources Distributed Across The Internet , 1998, Proceedings of the Thirty-First Hawaii International Conference on System Sciences.

[7]  Kevin Chen-Chuan Chang,et al.  Interoperability for digital libraries worldwide , 1998, CACM.

[8]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[9]  Herbert Van de Sompel,et al.  The open archives initiative , 2001 .

[10]  Edward A. Fox,et al.  Building interoperable digital library services: MARIAN, open archives, and the NDLTD , 2001, SIGIR '01.

[11]  Gary Marchionini,et al.  Toward a worldwide digital library , 1998, CACM.

[12]  paul jacobs Ancient World, Digital World: Excavation at Halif , 2001 .

[13]  Carl Lagoze,et al.  NCSTRL: Design and deployment of a globally distributed digital library , 2000, J. Am. Soc. Inf. Sci..

[14]  Sudha Ram,et al.  Digital Libraries for the Next Millennium: Challenges and Research Directions , 1999, Inf. Syst. Frontiers.

[15]  Chris Clifton,et al.  SEMINT: A tool for identifying attribute correspondences in heterogeneous databases using neural networks , 2000, Data Knowl. Eng..

[16]  Edward A. Fox,et al.  Designing Protocols in Support of Digital Library Componentization , 2002, ECDL.

[17]  Qianyi Gu,et al.  Designing a language for creating conceptual browsing interfaces for digital libraries , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[18]  Peter B. Danzig,et al.  The Harvest Information Discovery and Access System , 1995, Comput. Networks ISDN Syst..

[19]  Marcos André Gonçalves Streams, Structures, Spaces,Scenarios, and Societies (5S): A Formal Digital Library Framework and Its Applications , 2004 .

[20]  Kurt Maly,et al.  Interoperable Heterogeneous Digital Libraries , 1998 .

[21]  Andreas Paepcke,et al.  Building the InfoBus: A Review of Technical Choices in the Stanford Digital Library Project , 2000 .

[22]  Edward A. Fox,et al.  Scenario-Based Generation of Digital Library Services , 2003, ECDL.

[23]  Wilhelm Hasselbring,et al.  Information system integration , 2000, CACM.

[24]  Ramana Rao,et al.  A focus+context technique based on hyperbolic geometry for visualizing large hierarchies , 1995, CHI '95.

[25]  Sudha Ram,et al.  Information systems interoperability: What lies beneath? , 2004, TOIS.

[26]  Chris North,et al.  Citiviz: A Visual User Interface to the CITIDEL System , 2004, ECDL.

[27]  Sriram Raghavan,et al.  Search Middleware and the Simple Digital Library Interoperability Protocol , 2000, D Lib Mag..

[28]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[29]  Edward A. Fox,et al.  Streams, structures, spaces, scenarios, societies (5s): A formal model for digital libraries , 2004, TOIS.

[30]  Laura M. Haas,et al.  Data-driven understanding and refinement of schema mappings , 2001, SIGMOD '01.

[31]  Wilhelm Hasselbring Information System Integration: Introduction. , 2000 .

[32]  John Moore,et al.  The Z39.50 information retrieval standard , 2000 .

[33]  Jun Wang,et al.  Java MARIAN: From an OPAC to a Modern Digital Library System , 2002, SPIRE.

[34]  Edward A. Fox,et al.  5SL: a language for declarative specification and generation of digital libraries , 2002, JCDL '02.

[35]  Rao Shen,et al.  ETANA-DL: a digital library for integrated handling of heterogeneous archaeological data , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[36]  Edward A. Fox,et al.  Prototyping Digital Libraries Handling Heterogeneous Data Sources - The ETANA-DL Case Study , 2004, ECDL.

[37]  Qinwei Zhu,et al.  5SGraph: A Modeling Tool for Digital Libraries , 2002 .

[38]  Edward A. Fox,et al.  Open digital libraries , 2002 .