论文信息 - Adapting International Standard for Asian Language Technologies

Adapting International Standard for Asian Language Technologies

Corpus-based approaches and statistical approaches have been the main stream of natural language processing research for the past two decades. Language resources play a key role in such approaches, but there is an insufficient amount of language resources in many Asian languages. In this situation, standardisation of language resources would be of great help in developing resources in new languages. This paper presents the latest development efforts of our project which aims at creating a common standard for Asian language resources that is compatible with an international standard. In particular, the paper focuses on i) lexical specification and data categories relevant for building multilingual lexical resources for Asian languages; ii) a core upper-layer ontology needed for ensuring multilingual interoperability and iii) the evaluation platform used to test the entire architectural framework.

[1] Adam Pease,et al. Towards a standard upper ontology , 2001, FOIS.

[2] M. Swadesh. Lexico-Statistical Dating of Prehistoric Ethnic Contacts , 1952 .

[3] Jia-Fei Hong,et al. Towards a Conceptual Core for Multicultural Processing: A Multilingual Ontology Based on the Swadesh List , 2007, IWIC.

[4] Chu-Ren Huang,et al. Infrastructure for Standardization of Asian Language Resources , 2006, ACL.

[5] Chu-Ren Huang,et al. Asian language processing: current state-of-the-art , 2006, Lang. Resour. Evaluation.

[6] Chu-Ren Huang,et al. Constructing Taxonomy of Numerative Classifiers for Asian Languages , 2008, IJCNLP.

[7] Claudia Soria,et al. Lexical Markup Framework (LMF) , 2006, LREC.

[8] Nicoletta Calzolari,et al. SIMPLE: A General Framework for the Development of Multilingual Lexicons , 2000, LREC.

[9] Laurent Prevot,et al. Basic lexicon and shared ontology for multilingual resources: A sumo+ milo hybrid approach , 2007 .