Integrating Query Translation and Document Translation in a Cross-language Information Retrieval System

Due to the explosive growth of the WWW, very large multilingual textual resources have motivated the researches in Cross-Language Information Retrieval and online Web Machine Translation. In this paper, the integration of language translation and text processing system is proposed to build a multilingual information system. A distributed English-Chinese system on WWW is introduced to illustrate how to integrate query translation, search engines, and web translation system. Since July 1997, more than 46,000 users have accessed our system and about 250,000 English web pages have been translated to pages in Chinese or bilingual English-Chinese versions. And the average satisfaction degree of users at document level is 67.47%.

[1]  W. Bruce Croft,et al.  Phrasal translation and query expansion techniques for cross-language information retrieval , 1997, SIGIR '97.

[2]  Khaled Radwan,et al.  Vers l'acces multilingue en langage naturel aux bases de donnees textuelles , 1994 .

[3]  Jin Yang,et al.  An Application of Machine Translation Technology in Multilingual Information Retrieval , 1996 .

[4]  Gregory Grefenstette,et al.  Querying across languages: a dictionary-based approach to multilingual information retrieval , 1996, SIGIR '96.

[5]  Yorick Wilks,et al.  New Mexico State University: Computing Research Laboratory , 1989 .

[6]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[7]  Hsin-Hsi Chen,et al.  Machine Translation: An Integrated Approach , 1995 .

[8]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[9]  Hsin-Hsi Chen,et al.  Acquisition of Subcategorization Frames from Large Scale Texts , 1994 .

[10]  Hsin-Hsi Chen,et al.  A Rule-Based and MT-Oriented Approach to Prepositional Phrase Attachment , 1996, COLING.

[11]  Pamela W. Jordan,et al.  Coping With Ambiguity in a Large-Scale Machine Translation System , 1994, COLING.

[12]  Jaime G. Carbonell,et al.  An Efficient Interlingua Translation System for Multi-lingual Document Production , 1991, MTSUMMIT.

[13]  Sergei Nirenburg,et al.  Multi-Purpose Development and Operation Environments for Natural Language Applications , 1992, ANLP.

[14]  Jonathan Slocum,et al.  The LRC Machine Translation System , 1985, Comput. Linguistics.

[15]  Mark W. Davis,et al.  QUILT: implementing a large-scale cross-language text retrieval system , 1997, SIGIR '97.

[16]  Yoshihiko Hayashi,et al.  TITAN: A Cross-linguistic Search Engine for the WWW , 1997 .

[17]  Hsin-Hsi Chen,et al.  Proper Name Extraction from Web Pages for Finding People in Internet , 1997, ROCLING.

[18]  Mark W. Davis,et al.  A TREC Evaluation of Query Translation Methods For Multi-Lingual Text Retrieval , 1995, TREC.

[19]  Makoto Nagao,et al.  A framework of a mechanical translation between Japanese and English by analogy principle , 1984 .

[20]  Hsin-Hsi Chen,et al.  An MT Meta-Server for Information Retrieval on WWW , 1997 .

[21]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.