Corpus-based Generation of Numeral Classifier using Phrase Alignment

A severe problem for NLP applications dealing with multilingual language resources is the acquisition of knowledge that is obligatory in one language but not explicitly expressed in another language. In this paper, we focus on numeral classifiers, which are required in languages like Japanese but are usually not explicitly used in languages like English, which don't have such a classifier system.We propose a uniform method to assign the numeral classifiers of languages that have a numeral classifier system to the numerals of non-classifier languages. The omitted classifier information is extracted from a bilingual corpus based on phrasal correspondences in the contexts of the respective sentences.

[1]  Francis Bond,et al.  Reusing an ontology to generate numeral classifiers , 2000, COLING.

[2]  Arul Menezes,et al.  Achieving commercial-quality translation with example-based methods , 2001, MTSUMMIT.

[3]  Yuji Matsumoto,et al.  Acquisition of Phrase-level Bilingual Correspondence using Dependency Structure , 2000, COLING.

[4]  Kentaro Ogura,et al.  Classifiers in Japanese-to-English Machine Translation , 1996, COLING.

[5]  Kenji Imamura,et al.  Hierarchical Phrase Alignment Harmonized with Parsing , 2001, NLPRS.

[6]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[7]  Ralph Grishman,et al.  Alignment of Shared Forests for Bilingual Corpora , 1996, COLING.

[8]  Virach Sornlertlamvanich,et al.  Classifier Assignment By Corpus-Based Approach , 1994, COLING.

[9]  Eiichiro Sumita,et al.  Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World , 2002, LREC.

[10]  Eiichiro Sumita Example-based machine translation using DP-matching between work sequences , 2001, DDMMT@ACL.

[11]  William Croft,et al.  Semantic universals in classifier systems , 1994 .

[12]  Pamela A. Downing Numeral Classifier Systems: The Case of Japanese , 1996 .

[13]  Hitoshi Iida,et al.  Experiments and Prospects of Example-Based Machine Translation , 1991, ACL.

[14]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[15]  Yuji Matsumoto,et al.  Sructural Matching of Parallel Texts , 1993, ACL.

[16]  Hiroyuki Kaji,et al.  Learning Translation Templates From Bilingual Text , 1992, COLING.