The Construction of a Lexically Motivated Corpus | the Problem with Dening Lexical Units |
暂无分享,去创建一个
We are currently constructing a special language corpus, with particular emphasis on such lexically oriented studies and applications as terminology and information retrieval. In the paper, we focus on the problem with de ning the lexical units in running texts, which is essential for the construction of a lexically motivated corpus. Although we focus on Japanese in this paper, the problem discussed here may be relevant to any other language.
[1] Éric Gaussier,et al. Towards Automatic Extraction of Monolingual and Bilingual Terminology , 1994, COLING.
[2] Chantal Enguehard,et al. Automatic Natural Acquisition of a Terminology , 1995, J. Quant. Linguistics.
[3] Sophia Ananiadou,et al. A Methodology for Automatic Term Recognition , 1994, COLING.
[4] Slava M. Katz,et al. Technical terminology: some linguistic properties and an algorithm for identification in text , 1995, Natural Language Engineering.