论文信息 - INAOE_UPV-CORE: Extracting Word Associations from Document Corpora to estimate Semantic Textual Similarity

INAOE_UPV-CORE: Extracting Word Associations from Document Corpora to estimate Semantic Textual Similarity

This paper presents three methods to evaluate the Semantic Textual Similarity (STS). The first two methods do not require labeled training data; instead, they automatically extract semantic knowledge in the form of word associations from a given reference corpus. Two kinds of word associations are considered: cooccurrence statistics and the similarity of word contexts. The third method was done in collaboration with groups from the Universities of Paris 13, Matanzas and Alicante. It uses several word similarity measures as features in order to construct an accurate prediction model for the STS.

Paolo Rosso | Manuel Montes-y-Gómez | Luis Villaseñor Pineda | Fernando Sánchez-Vega

[1] Peter D. Turney. Measuring Semantic Similarity by Latent Relational Analysis , 2005, IJCAI.

[2] Davide Buscaldi,et al. IRIT: Textual Similarity Combining Conceptual Similarity with an N-Gram Comparison Method , 2012, SemEval@NAACL-HLT.

[3] Paolo Rosso,et al. Clustering Abstracts of Scientific Texts Using the Transition Point Technique , 2006, CICLing.

[4] Euripides G. M. Petrakis,et al. Information Retrieval by Semantic Similarity , 2006, Int. J. Semantic Web Inf. Syst..

[5] Eneko Agirre,et al. SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity , 2012, *SEMEVAL.

[6] Rada Mihalcea,et al. UNT: A Supervised Synergistic Approach to Semantic Text Similarity , 2012, *SEMEVAL.

[7] Peter D. Turney. Similarity of Semantic Relations , 2006, CL.

[8] Philip Resnik,et al. Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..