Cross-Lingual Information Retrieval and Semantic Interoperability for Cultural Heritage Repositories

This paper describes a computational linguistics-based approach for providing interoperability between multi-lingual systems in order to overcome crucial issues like cross-language and cross-collection retrieval. Our proposal is a system which improves capabilities of language-technology-based information extraction. In the last few years various theories have been developed and applied for making multicultural and multilingual resources easy to access. Important initiatives, like the development of the European Library and Europeana, aim to increase the availability of digital content from various types of providers and institutions. Therefore the accessibility to these resources requires the development of environments enabling to manage multilingual complexity. In this respect, we present a methodological framework which allows mapping both the data and the metadata among the languagespecific ontologies. The feasibility of crosslanguage information extraction and semantic search will be tested by implementing an early prototype system.

[1]  Masatoshi Yoshikawa,et al.  A combined statistical query term disambiguation in cross-language information retrieval , 2002, Proceedings. 13th International Workshop on Database and Expert Systems Applications.

[2]  Annibale Elia,et al.  Lexicon-Grammar, Electronic Dictionaries and Local Grammars of Italian , 2004 .

[3]  W. Bruce Croft,et al.  Phrasal translation and query expansion techniques for cross-language information retrieval , 1997, SIGIR '97.

[4]  Maurice Gross La construction de dictionnaires électroniques , 1989 .

[5]  Zdenek Zdrahal,et al.  Facilitating cross-language retrieval and machine translation by multilingual domain ontologies , 2010 .

[6]  C. Habel,et al.  Language , 1931, NeuroImage.

[7]  W. Bruce Croft,et al.  Resolving ambiguity for cross-language retrieval , 1998, SIGIR '98.

[8]  Aitor Soroa,et al.  Cross-lingual event-mining using wordnet as a shared knowledge interface , 2012 .

[9]  Maurice Gross,et al.  Méthodes en syntaxe : régime des constructions complétives , 1978 .

[10]  Alon Lavie,et al.  Cross Lingual and Semantic Retrieval for Cultural Heritage Appreciation , 2007, LaTeCH@ACL 2007.

[11]  Maddalen Lopez de Lacalle,et al.  Dictionary and Monolingual Corpus-based Query Translation for Basque-English CLIR , 2010, LREC.

[12]  Zellig S. Harris Transformations in Linguistic Structure , 1970 .

[13]  Federica Marano,et al.  Exploring formal models of linguistic data structuring. Enhanced solutions for knowledge management systems based on NLP applications , 2012 .

[14]  Rafael Capilla,et al.  Multilingual Information Access on the Web , 2015, Computer.

[15]  Gregory Grefenstette,et al.  Querying across languages: a dictionary-based approach to multilingual information retrieval , 1996, SIGIR '96.

[16]  Masatoshi Yoshikawa,et al.  Query term disambiguation for Web cross-language information retrieval using a search engine , 2000, IRAL '00.

[17]  Mark W. Davis,et al.  Free Resources And Advanced Alignment For Cross-Language Text Retrieval , 1997, TREC.

[18]  Ari Pirkola,et al.  The effects of query structure and dictionary setups in dictionary-based cross-language information retrieval , 1998, SIGIR '98.

[19]  Changning Huang,et al.  Improving query translation for cross-language information retrieval using statistical models , 2001, SIGIR '01.

[20]  Maurice Gross,et al.  Grammaire transformationnelle du français : syntaxe du verbe , 1968 .

[21]  Z. Harris Co-Occurrence and Transformation in Linguistic Structure , 1957 .

[22]  Paul Buitelaar,et al.  Ontologies in Cross-Language Information Retrieval , 2003, Wissensmanagement.

[23]  Johanna Monti,et al.  Multi-word unit processing in machine translation. Developing and using language resources for multi-word unit processing in machine translation , 2015 .

[24]  W. Bruce Croft,et al.  Dictionary Methods for Cross-Lingual Information Retrieval , 1996, DEXA.

[25]  Zellig S. Harris,et al.  A Grammar of English on Mathematical Principles , 1982 .

[26]  Max Silberztein,et al.  Dictionnaires électroniques et analyse automatique de textes : le système intex , 1993 .

[27]  Ruslan Mitkov,et al.  CLIR- and ontology-based approach for bilingual extraction of comparable documents , 2012 .