Enhancing a Dictionary for Transfer Rule Acquisition

The JMdict/EDICT Japanese-English Dictionary is a freely-available dictionary distributed in XML (JMdict)and text (EDICT) formats. It is widely used as a source of lexical material in dictionary systems and text-processing projects. We propose two refinements to make the dictionary more computationally tractable: marking entries where the English is not a translation equivalent and expanding contracted entries. We then propose and apply semi-automatic methods to refine existing entries. The resulting dictionary is shown to be more suitable for the construction of machine translation rules.

[1]  Francis Bond,et al.  Semi-automatic Refinement of the JMdict / EDICT Japanese-English Dictionary , 2007 .

[2]  Yuji Matsumoto,et al.  Combining resources for open source machine translation , 2007, TMI.

[3]  Stephan Oepen,et al.  Using a Bi-Lingual Dictionary in Lexical Transfer , 2006, EAMT.

[4]  Melanie Siegel,et al.  HPSG Analysis of Japanese , 2000 .

[5]  Masanobu Higashida,et al.  Extracting bilingual collocations from non-aligned parallel corpora , 1999, TMI.

[6]  Stephan Oepen,et al.  SEM-I rational MT : enriching deep grammars with a semantic interface for scalable machine translation. , 2005 .

[7]  Kentaro Ogura,et al.  Combining linguistic resources to create a machine-tractable Japanese-Malay dictionary , 2008, Lang. Resour. Evaluation.

[8]  Hitoshi Isahara,et al.  Automatic Construction of Japanese-Chinese Translation Dictionary Using English as Intermediary , 2005 .

[9]  FlickingerDan On building a more efficient grammar by exploiting types , 2000 .

[10]  Jim Breen,et al.  JMdict: a Japanese-Multilingual Dictionary , 2004 .

[11]  Takaaki TANAKA,et al.  Extraction of translation equivalents from non-parallel corpora , 1999, TMI.

[12]  Ralph Grishman,et al.  NOMLEX: a lexicon of nominalizations , 1998 .

[13]  Jonas Sjöbergh Creating a free digital Japanese-Swedish lexicon , 2005 .

[14]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[15]  Stephan Oepen,et al.  Open Source Machine Translation with DELPH-IN , 2005, MTSUMMIT.

[16]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[17]  Francis Bond,et al.  A method of creating new valency entries , 2007, Machine Translation.

[18]  Francis Bond,et al.  Using multiple pivots to align Korean and Japanese lexical resources , 2001 .

[19]  Wolfgang Wahlster,et al.  Verbmobil: Foundations of Speech-to-Speech Translation , 2000, Artificial Intelligence.