Meta-data Extraction and Query Translation. Treatment of Semantic Heterogeneity

The project CARMEN ("Content Analysis, Retrieval and Metadata: Effective Networking") aimed among other goals at improving the expansion of searches in bibliographic databases into Internet searches. We pursued a set of different approaches to the treatment of semantic heterogeneity (meta-data extraction, query translation using statistic relations and cross-concordances). This paper describes the concepts and implementation of this approaches and the evaluation of the impact for the retrieval result.

[1]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[2]  Norbert Fuhr,et al.  CAP7: Searching and Browsing in Distributed Document Collections , 2000, ECDL.

[3]  Norbert Fuhr,et al.  The automatic indexing system AIR/PHYS - from research to applications , 1988, SIGIR '88.

[4]  Martin Braschler,et al.  Using Corpus-Based Approaches in a System for Multilingual Information Retrieval , 2000, Information Retrieval.

[5]  P. Schauble An information structure dealing with term dependance and polysemy , 1988, SIGIR '88.

[6]  Martin Braschler,et al.  Multilingual Information Retrieval Based on Document Alignment Techniques , 1998, ECDL.

[7]  Richard Hull,et al.  Managing semantic heterogeneity in databases: a theoretical prospective , 1997, PODS.

[8]  Peter Mutschke,et al.  Thematic mapping on bibliographic databases by cluster analysis: a description of the SDOC environment with SOLIS , 1995 .

[9]  Bruce R. Schatz,et al.  Semantic indexing for a complete subject discipline , 1999, DL '99.

[10]  Ali R. Hurson,et al.  Automated resolution of semantic heterogeneity in multidatabases , 1994, TODS.

[11]  Andrea S. Taylor,et al.  A Theoretical Perspective , 2000 .

[12]  Reginald Ferber Automated Indexing with Thesaurus Descriptors: A Co-occurence Based Approach to Multilingual Retrieval , 1997, ECDL.

[13]  Douglas R. Hale From research to application , 1991 .

[14]  Bruce R. Schatz,et al.  Performance and implications of semantic indexing in a distributed environment , 1999, CIKM '99.

[15]  Peter Mutschke,et al.  Heterogenitätsbehandlung bei textueller Information verschiedener Datentypen und Inhaltserschließungsverfahren , 2002 .

[16]  Jutta Marx,et al.  The Social Science Virtual Library Project Dealing with Semantic Heterogeneity at the Query Processing Level , 2001 .

[17]  Jürgen Krause,et al.  Vocabulary Switching and Automatic Metadata Extraction or How to Get Useful Information from a Digital Library , 2000, DELOS.