论文信息 - Sbdlrhmn: A Rule-based Human Interpretation System for Semantic Textual Similarity Task

Sbdlrhmn: A Rule-based Human Interpretation System for Semantic Textual Similarity Task

In this paper, we describe the system architecture used in the Semantic Textual Similarity (STS) task 6 pilot challenge. The goal of this challenge is to accurately identify five levels of semantic similarity between two sentences: equivalent, mostly equivalent, roughly equivalent, not equivalent but sharing the same topic and no equivalence. Our participations were two systems. The first system (rule-based) combines both semantic and syntax features to arrive at the overall similarity. The proposed rules enable the system to adequately handle domain knowledge gaps that are inherent when working with knowledge resources. As such one of its main goals, the system suggests a set of domain-free rules to help the human annotator in scoring semantic equivalence of two sentences. The second system is our baseline in which we use the Cosine Similarity between the words in each sentence pair.

Catherine Blake | Samir AbdelRahman

[1] Rada Mihalcea,et al. Text-to-Text Semantic Similarity for Automatic Short Answer Grading , 2009, EACL.

[2] M. Dolores del Castillo,et al. SyMSS: A syntax-based measure for short-text semantic similarity , 2011, Data Knowl. Eng..

[3] Eleazar Eskin,et al. Detecting Text Similarity over Short Passages: Exploring Linguistic Feature Combinations via Machine Learning , 1999, EMNLP.

[4] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[5] Dan Roth,et al. Understanding the Value of Features for Coreference Resolution , 2008, EMNLP.

[6] Gerald Salton,et al. Automatic text processing , 1988 .

[7] Eneko Agirre,et al. SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity , 2012, *SEMEVAL.

[8] Catherine Blake,et al. The Role of Sentence Structure in Recognizing Textual Entailment , 2007, ACL-PASCAL@ACL.

[9] Wenyin Liu,et al. A short text modeling method combining semantic and statistical information , 2010, Inf. Sci..

[10] Oren Etzioni,et al. Open Information Extraction: The Second Generation , 2011, IJCAI.

[11] Dan Klein,et al. Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.