Extraction de sous-ontologies autonomes par fermeture des opérateurs hyponymie et hyperonymie

Facing the exponentially increasing amount of data, new challenges consist in retrieving the good information at the right time, organizing and filtering data, visualizing them and using them in a specific decision context. For a decade, ontologies have been successfully used as semantic guides for these tasks. Nevertheless the size of ontologies that are shared and accepted as standards in a given domain may rapidly grow beyond the human capacity to grasp information. Dealing with large ontologies proved to be problematic for applications that require user interactions such as ontology-based document annotation or ontology modification/evolution. This problem can be partially overcome by providing the user with a sub-ontology focused on his/her task. Focusing on "is-a" relationships, an ontology can be represented as a direct acyclic graph. Given a reference-ontology and a set of user's concepts of interest, this paper proposes a formal definition of relevant concepts. This set of relevant concepts is made of user's concepts of interest plus some of their hyponyms and hyperonyms. Those extra concepts are added for enlighten user's concepts relationships in the resulting sub-ontology to make it self-explanatory. The set of those relevant concepts is defined using the closure of classical graph operators i.e.: least common ancestor (lca) and greatest common descendant (gcd). Efficient algorithms are also provided to identify relevant concepts and to extract the corresponding self-explanatory sub-ontology. The resulting program, called OntoFocus, may be freely used at the address http://www.ontotoolkit.mines-ales.fr/.

[1]  Tharam S. Dillon,et al.  Ontology extraction using views for semantic web , 2006 .

[2]  David A. Bader,et al.  Graph Algorithms , 2011, Encyclopedia of Parallel Computing.

[3]  Joakim Lundeberg,et al.  Effect of infliximab on mRNA expression profiles in synovial tissue of rheumatoid arthritis patients , 2006, Arthritis research & therapy.

[4]  Ian Horrocks,et al.  Modular Reuse of Ontologies: Theory and Practice , 2008, J. Artif. Intell. Res..

[5]  Andrzej Lingas,et al.  Faster algorithms for finding lowest common ancestors in directed acyclic graphs , 2007, Theor. Comput. Sci..

[6]  David Taniar,et al.  A Service Oriented Architecture for Extracting and Extending Sub-Ontologies in the Semantic Grid , 2007, 21st International Conference on Advanced Information Networking and Applications (AINA '07).

[7]  Dennis McLeod,et al.  Retrieval effectiveness of an ontology-based model for information selection , 2004, The VLDB Journal.

[8]  Rafael Berlanga Llavori,et al.  Safe and Economic Re-Use of Ontologies: A Logic-Based Methodology and Tool Support , 2008, OWLED.

[9]  Sylvie Ranwez,et al.  Ontological Distance Measures for Information Visualisation on Conceptual Maps , 2006, OTM Workshops.

[10]  Patrick Lincoln,et al.  Efficient implementation of lattice operations , 1989, TOPL.

[11]  Johannes Nowak,et al.  Fast Lowest Common Ancestor Computations in Dags , 2007, ESA.

[12]  Johann-Christoph Freytag,et al.  Ontology Based Query Processing in Database Management Systems , 2003, OTM.

[13]  Marta Sabou,et al.  Learning web service ontologies: an automatic extraction method and its evaluation , 2005 .

[14]  David Taniar,et al.  A distributed approach to sub-ontology extraction , 2004, 18th International Conference on Advanced Information Networking and Applications, 2004. AINA 2004..

[15]  Thomas Lengauer,et al.  Improved scoring of functional groups from gene expression data by decorrelating GO graph structure , 2006, Bioinform..

[16]  E. Sonnhammer,et al.  OrthoDisease: A database of human disease orthologs , 2004, Human mutation.

[17]  R. Navigli,et al.  Automatically extending, pruning and trimming general purpose ontologies , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[18]  J. Wenny Rahayu,et al.  OntoMove: A Knowledge Based Framework for Semantic Requirement Profiling and Resource Acquisition , 2007, 2007 Australian Software Engineering Conference (ASWEC'07).

[19]  Steven Skiena,et al.  Finding least common ancestors in directed acyclic graphs , 2001, SODA '01.

[20]  Martin Vingron,et al.  Improved detection of overrepresentation of Gene-Ontology annotations with parent-child analysis , 2007, Bioinform..

[21]  Jennifer Widom,et al.  Exploiting hierarchical domain structure to compute similarity , 2003, TOIS.

[22]  Steffen Staab,et al.  What Is an Ontology? , 2009, Handbook on Ontologies.

[23]  Akrivi Katifori,et al.  Ontology visualization methods—a survey , 2007, CSUR.

[24]  J. Wenny Rahayu,et al.  Large scale ontology visualisation using ontology extraction , 2005, Int. J. Web Grid Serv..

[25]  Andreas Abecker,et al.  Ontologies for Knowledge Management , 2004, Handbook on Ontologies.

[26]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[27]  LepourasGeorge,et al.  Ontology visualization methodsa survey , 2007 .

[28]  Martin Vingron,et al.  Ontologizer 2.0 - a multifunctional tool for GO term enrichment analysis and data exploration , 2008, Bioinform..

[29]  Jordi Conesa,et al.  Pruning Bio-Ontologies , 2007, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07).

[30]  Alfonso Valencia,et al.  Defining functional distances over Gene Ontology , 2008, BMC Bioinformatics.

[31]  Alan L. Rector,et al.  Web ontology segmentation: analysis, classification and use , 2006, WWW '06.