The Use of Ontology for Semantic Representation of Documents

paper deals with the use of ontologies for Information Retrieval.Roughly, the proposed approach consists in identifying important concepts in documents using two criterions, co-occurrence and semantic relatedness and then disambiguating them via an external general purpose ontology, namely WordNet. Matching the ontology and a document results in a set of scored concept-senses (nodes) with weighted links. This representation, called semantic core of a document best reveals the semantic content of the document. We regard our approach, of which the first evaluation results are encouraging, as a short but strong step toward the long term goal of Intelligent Indexing and Semantic Retrieval.

[1]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[2]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[3]  Mohand Boughanem,et al.  Désambiguïsation et expansion de requêtes dans un SRI. Etude de l'apport des liens sémantiques , 2003, Ingénierie des Systèmes d Inf..

[4]  Bernardo Magnini,et al.  Integrating Subject Field Codes into WordNet , 2000, LREC.

[5]  Nicola Guarino,et al.  OntoSeek: content-based access to the Web , 1999, IEEE Intell. Syst..

[6]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[7]  Graeme Hirst,et al.  Lexical chains as representations of context for the detection and correction of malapropisms , 1995 .

[8]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[9]  Rada Mihalcea,et al.  Semantic Indexing using WordNet Senses , 2000 .

[10]  Larry L. Peterson,et al.  Reasoning about naming systems , 1993, TOPL.

[11]  Eneko Agirre,et al.  Word Sense Disambiguation using Conceptual Density , 1996, COLING.

[12]  Feng Luo,et al.  Ontology construction for information selection , 2002, 14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings..

[13]  Ellen M. Voorhees,et al.  The Text REtrieval Conference (TREC-2001) (10th, Gaithersburg, Maryland, November 13-16, 2001). NIST Special Publication. , 2000 .

[14]  Piek Vossen,et al.  Extending, trimming and fusing WordNet for technical documents , 2001 .

[15]  Myoung-Ho Kim,et al.  Information Retrieval Based on Conceptual Distance in is-a Hierarchies , 1993, J. Documentation.

[16]  W. Bruce Croft,et al.  The use of phrases and structured queries in information retrieval , 1991, SIGIR '91.

[17]  Steffen Staab,et al.  Semi-Automatic Engineering of Ontologies from Text , 2000, ICSE 2000.