Merging Structural and Taxonomic Similarity for Text Retrieval Using Relational Descriptions

Information retrieval effectiveness has become a crucial issue with the enormous growth of available digital documents and the spread of Digital Libraries. Search and retrieval are mostly carried out on the textual content of documents, and traditionally only at the lexical level. However, pure term-based queries are very limited because most of the information in natural language is carried by the syntactic and logic structure of sentences. To take into account such a structure, powerful relational languages, such as first-order logic, must be exploited. However, logic formulae constituents are typically uninterpreted (they are considered as purely syntactic entities), whereas words in natural language express underlying concepts that involve several implicit relationships, as those expressed in a taxonomy. This problem can be tackled by providing the logic interpreter with suitable taxonomic knowledge.

[1]  J. W. Lloyd,et al.  Foundations of logic programming; (2nd extended ed.) , 1987 .

[2]  Fabio Massimo Zanzotto,et al.  Learning shallow semantic rules for textual entailment , 2007 .

[3]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[4]  Enrico Motta,et al.  An Ontology-Driven Similarity Algorithm , 2004 .

[5]  Graeme Hirst,et al.  Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[6]  Peter Clark,et al.  Recognizing Textual Entailment with Logical Inference , 2008, TAC.

[7]  Nicola Fanizzi,et al.  Learning Logic Models for Automated Text Categorization , 2001, AI*IA.

[8]  Stefano Ferilli,et al.  A General Similarity Framework for Horn Clause Logic , 2009, Fundam. Informaticae.

[9]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[10]  Floriana Esposito AI*IA 2001: Advances in Artificial Intelligence , 2001, Lecture Notes in Computer Science.

[11]  Eugene Agichtein,et al.  Combining Lexical, Syntactic, and Semantic Evidence for Textual Entailment Classification , 2008, TAC.

[12]  John Wylie Lloyd,et al.  Foundations of Logic Programming , 1987, Symbolic Computation.

[13]  Letizia Tanca,et al.  Logic Programming and Databases , 1990, Surveys in Computer Science.

[14]  Céline Rouveirol,et al.  Extensions of Inversion of Resolution Applied to Theory Completion , 1992 .

[15]  Nicola Fanizzi,et al.  A Generalization Model Based on OI-implication for Ideal Theory Refinement , 2001, Fundam. Informaticae.

[16]  Pat Langley,et al.  Editorial: On Machine Learning , 1986, Machine Learning.

[17]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.