论文信息 - Enhanced Japanese Electronic Dictionary Look-up

Enhanced Japanese Electronic Dictionary Look-up

This paper describes the process of data preparation and reading generation for an ongoing project aimed at improving the accessibility of unknown words for learners of foreign languages, focusing initially on Japanese. Rather then requiring absolute knowledge of the readings of words in the foreign language, we allow look-up of dictionary entries by readings which learners can predictably be expected to associate with them. We automatically extract an exhaustive set of phonemic readings for each grapheme segment and learn basic morpho-phonological rules governing compound word formation, associating a probability with each. Then we apply the naive Bayes model to generate a set of readings and give each a likeliness score based on previously extracted evidence and corpus frequencies.

[1] Natsuko Tsujimura,et al. An Introduction to Japanese Linguistics , 1997 .

[2] Timothy J. Vance,et al. An introduction to Japanese phonology , 1987 .

[3] Anthony J. Vitale,et al. Algorithms for Grapheme-Phoneme Translation for English and French: Applications for Database Searches and Speech Synthesis , 1997, CL.

[4] Hozumi Tanaka,et al. Construction of a Japanese Learner-Friendly Dictionary Interface , 2002 .

[5] Hozumi Tanaka,et al. A Comparative Study of Unsupervised Grapheme-Phoneme Alignment Methods , 2000 .

[6] Hozumi Tanaka,et al. The applications of unsupervised learning to Japanese grapheme-phoneme alignment , 1999 .

[7] James Breen. A WWW Japanese Dictionary , 2000 .

[8] Caroline B. Huang,et al. Generation of pronunciations from orthographies using transformation-based error-driven learning , 1994, ICSLP.

[9] Timothy Baldwin,et al. The Analysis of Japanese Relative Clauses , 1998 .

[10] Gerard Salton,et al. Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..