论文信息 - Automatic Alignment of Persian and English Lexical Resources: A Structural-Linguistic Approach

Automatic Alignment of Persian and English Lexical Resources: A Structural-Linguistic Approach

Cross-lingual mapping of linguistic resources such as corpora, ontologies, lexicons and thesauri is very important for developing cross-lingual (CL) applications such as machine translation, CL information retrieval and question answering. Developing mapping techniques for lexical ontologies of different languages is not only important for inter-lingual tasks but also can be implied to build lexical ontologies for a new language based on existing ones. In this paper we propose a two-phase approach for mapping a Persian lexical resource to Princeton's WordNet. In the first phase, Persian words are mapped to WordNet synsets using some heuristic improved linguistic approaches. In the second phase, the previous mappings are evaluated (accepted or rejected) according to the structural similarities of WordNet and Persian thesaurus. Although we applied it to Persian, our proposed approach, SBU methodology is language independent. As there is no lexical ontology for Persian, our approach helps in building one for this language too.

Mehrnoush Shamsfard | Rahim Dehkharghani

[1] Daniel M. Bikel,et al. Automatic WordNet Mapping Using Word Sense Disambiguation , 2000, EMNLP.

[2] Mehrnoush Shamsfard,et al. Towards Semi Automatic Construction of a Lexical Ontology for Persian , 2008, LREC.

[3] Lluís Padró,et al. Mapping WordNets Using Structural Information , 2000, ACL.

[4] Horacio Rodríguez,et al. Combining Multiple Methods for the Automatic Construction of Multilingual WordNets , 1997, ArXiv.

[5] Horacio Rodríguez,et al. Selecting the Correct English Synset for a Spanish Sense , 2004, LREC.

[6] Márton Miháltz,et al. Results and Evaluation of Hungarian Nominal WordNet v 1 . 0 , 2004 .

[7] Roy Rada,et al. Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[8] Karina Gibert,et al. Towards Binding Spanish Senses to Wordnet Senses through Taxonomy Alignment , 2004 .

[9] Lluís Padró,et al. Mapping Multilingual Hierarchies Using Relaxation Labeling , 1999, EMNLP.

[10] Pushpak Bhattacharyya,et al. Mapping and Structural Analysis of Multi-lingual Wordnets , 2007, IEEE Data Eng. Bull..

[11] Karina Gibert,et al. Semiautomatic Creation of Taxonomies , 2002, COLING 2002.