An MT Meta-Server for Information Retrieval on WWW

In the past few years, the World Wide Web (WWW) grows explosively and has become the most useful and powerful information retrieval and accessing system in the Internet. However, the language barrier becomes the major problem to search and retrieve WWW documents in different languages. This paper deals with how to incorporate the capability of language translation into WWW. Several design issues are discussed, including which material is translated, what roles the HTML tags play in translation, the tradeoff between the speed performance and the translation performance, and what form the translated result is presented in. We propose a meta-server architecture to communicate with clients and servers. An online and real-time English-to-Chinese machine translation system is developed. It is acted as a proxy system and an MT service system for retrieving WWW documents. The MT meta-server can not only be used as an assistant tool to access and retrieve information by the end-users in the client, but also can be treated as a Chinese document generating system to produce the Chinese or the bilingual English-Chinese versions of documents dynamically.

[1]  Makoto Nagao,et al.  A framework of a mechanical translation between Japanese and English by analogy principle , 1984 .

[2]  Hsin-Hsi Chen,et al.  Machine Translation: An Integrated Approach , 1995 .

[3]  Dik Lun Lee,et al.  A World Wide Web Resource Discovery System , 1995, World Wide Web J..

[4]  Geoffrey Leech,et al.  The tagged LOB Corpus : user's manual , 1986 .

[5]  Hsin-Hsi Chen,et al.  A Rule-Based and MT-Oriented Approach to Prepositional Phrase Attachment , 1996, COLING.

[6]  Tim Bray,et al.  Measuring the Web , 1996, World Wide Web J..

[7]  Hsin-Hsi Chen,et al.  Extracting Noun Phrases from Large-Scale Texts: A Hybrid Approach and its Automatic Evaluation , 1994, ACL.

[8]  Jonathan Slocum,et al.  The LRC Machine Translation System , 1985, Comput. Linguistics.

[9]  John C. Brown,et al.  High speed feature unification and parsing , 1995, Natural Language Engineering.

[10]  Oren Etzioni,et al.  Multi-Engine Search and Comparison Using the MetaCrawler , 1995, World Wide Web J..

[11]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[12]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[13]  Oren Etzioni,et al.  Multi-Service Search and Comparison Using the MetaCrawler , 1995 .

[14]  Peter B. Danzig,et al.  Harvest: A Scalable, Customizable Discovery and Access System , 1994 .

[15]  Hsin-Hsi Chen,et al.  Acquisition of Subcategorization Frames from Large Scale Texts , 1994 .

[16]  Pamela W. Jordan,et al.  Coping With Ambiguity in a Large-Scale Machine Translation System , 1994, COLING.

[17]  Jaime G. Carbonell,et al.  An Efficient Interlingua Translation System for Multi-lingual Document Production , 1991, MTSUMMIT.