论文信息 - Aplicación de transductores de estado-finito a los procesos de unificación de términos (Application of transducers of finite state to unification processes of term variants)

Aplicación de transductores de estado-finito a los procesos de unificación de términos (Application of transducers of finite state to unification processes of term variants)

Application of transducers of state-finite to unification processes of term variants. An approach based on techniques of state-finite has applied to the processes of unification of terms in Spanish. The algorithms of conflation are computational procedures utilized in some Information Retrieval (RI) systems for the unification of term variants, semantically equivalent, to a normalized form. The programs that carry out habitually this process are called: stemmers and lematizadores. The objective of this work is to evaluate the deficiencies and errors of the lemmatizers in the conflation of terms. The method utilized for the construction of the lemmatizer has been based on the implementation of a linguistic tool that allows to build electronic dictionaries represented internally in Finite-State Transducers (FST). The lexical resources developed have been applied to a corpus of verification to evaluate the performance of these lexical parsers. The metric of evaluation utilized has been an adaptation of coverage and precision measures. The results show that the main limitation of unification processes of term variants through technology of state-finite is the infra-analysis.

Carmen Galvez

[1] Peter Willett,et al. Applications of n-grams in textual information systems , 1998, J. Documentation.

[2] C. Douglas Johnson,et al. Formal Aspects of Phonological Description , 1972 .

[3] Atro Voutilainen,et al. A language-independent system for parsing unrestricted text , 1995 .

[4] Max Silberztein,et al. Text Indexation with INTEX , 1999, Comput. Humanit..

[5] Yves Schabes,et al. Deterministic Part-of-Speech Tagging with Finite-State Transducers , 1995, Comput. Linguistics.

[6] David A. Hull. Stemming algorithms: a case study for detailed evaluation , 1996 .

[7] Julie Beth Lovins,et al. Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[8] P. H. Matthews,et al. Morphology: An Introduction to the Theory of Word-Structure , 1974 .

[9] Kimmo Koskenniemi,et al. A General Computational Model for Word-Form Recognition and Production , 1984 .

[10] Chris D. Paice,et al. Method for Evaluation of Stemming Algorithms Based on Error Counting , 1996, J. Am. Soc. Inf. Sci..

[11] Fernando Pereira,et al. Sentence modeling and parsing , 1997 .