Méthodologie de transformation d'un thesaurus en une ontologie de domaine

Information Retrieval techniques make use of terms that are automatically extracted from documents; these terms are used to give information access. In this paper we propose an approach to enrich semantically this extraction by adding knowledge from thesauri. More specifically, the methodology we promote in this paper aims at transforming a thesaurus into a domain ontology which will then be used to semantically index documents (indexes are concepts rather than terms). We also propose techniques that implement this transformation as well as an evaluation in the field of astronomy.

[1]  Susan Gauch,et al.  Search improvement via automatic query reformulation , 1991, TOIS.

[2]  Martin L. King,et al.  Towards a Methodology for Building Ontologies , 1995 .

[3]  Nathalie Aussenac-Gilles,et al.  Modélisation du domaine par une méthode fondée sur l'analyse de corpus , 2000 .

[4]  Gregory Grefenstette,et al.  Use of syntactic context to produce term association lists for text retrieval , 1992, SIGIR '92.

[5]  Asunción Gómez-Pérez,et al.  Towards a Method to Conceptualize Domain Ontologies , 1996 .

[6]  Steffen Staab,et al.  KAON - Towards a Large Scale Semantic Web , 2002, EC-Web.

[7]  References , 1971 .

[8]  Frehiwot Fisseha,et al.  Reengineering Thesauri for New Applications: The AGROVOC Example , 2006, J. Digit. Inf..

[9]  Schubert Foo,et al.  Ontology research and development. Part 1 - a review of ontology generation , 2002, J. Inf. Sci..

[10]  Robert J. Gaizauskas,et al.  Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms , 2004, TREC.

[11]  C. J. van Rijsbergen,et al.  The selection of good search terms , 1981, Inf. Process. Manag..

[12]  Marti A. Hearst,et al.  Cat-a-Cone: an interactive interface for specifying searches and viewing retrieval results using a large category hierarchy , 1997, SIGIR '97.

[13]  Maria Lapata The Semantics of Relationships: An Interdisciplinary Perspective , 2003 .

[14]  Didier Bourigault,et al.  UPERY : un outil d’analyse distributionnelle étendue pour la construction d’ontologies à partir de corpus , 2002, JEPTALNRECITAL.

[15]  Avigdor Gal,et al.  OntoBuilder: fully automatic extraction and consolidation of ontologies from Web sources , 2004, Proceedings. 20th International Conference on Data Engineering.

[16]  Asunción Gómez-Pérez,et al.  METHONTOLOGY: From Ontological Art Towards Ontological Engineering , 1997, AAAI 1997.

[17]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[18]  Carolyn J. Crouch,et al.  Experiments in automatic statistical thesaurus construction , 1992, SIGIR '92.

[19]  Hiroshi Nakagawa,et al.  Concept Based Query Expansion , 1997 .

[20]  Stefan Schulz,et al.  Building a Very Large Ontology from Medical Thesauri , 2004, Handbook on Ontologies.

[21]  Harith Alani,et al.  Augmenting Thesaurus Relationships: Possibilities for Retrieval , 2001, J. Digit. Inf..

[22]  Josiane Mothe,et al.  Ontologies as Background Knowledge to Explore Document Collections , 2004, RIAO.

[23]  Josiane Mothe,et al.  IRAIA: A Portal Technology with a Semantic Layer Coordinating Multimedia Retrieval and Cross-Owner Content Building , 2003 .

[24]  Bob J. Wielinga,et al.  From thesaurus to ontology , 2001, K-CAP '01.

[25]  Paola Velardi,et al.  Using text processing techniques to automatically enrich a domain ontology , 2001, FOIS.

[26]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[27]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[28]  Xiaomeng Su,et al.  A Comparative Study of Ontology Languages and Tools , 2002, CAiSE.

[29]  Hele-Mai Haav,et al.  A Survey of Concept-based Information Retrieval Tools on the Web , 2001 .