i2dee: An Integrated and Interactive Data Exploration Environment Used for Ontology Design

Many communities need to organize and structure data to improve their utilization and sharing. Much research has been focused on this problem. Many solutions are based on a Terminological and Ontological Resource (TOR) which represents the domain knowledge for a given application. However TORs are often designed without taking into account heterogeneous data from specific resources. For example, in the biomedical domain, these sources may be medical reports, bibliographical resources or biological data extracted from GOA, Gene Ontology or KEGG. This paper presents an integrated visual environment for knowledge engineering. It integrates heterogeneous data from domain databases. Relevant concepts and relations are thus extracted from data resources, using several analysis and treatment processes. The resulting ontology embryo is visualized through a user friendly adaptive interface displaying a knowledge map. The experiments and evaluations dealt with in this paper concern biological data.

[1]  Sylvie Ranwez,et al.  Concept Maps for Designing Adaptive Knowledge Maps , 2006, Inf. Vis..

[2]  Didier Bourigault,et al.  LEXTER, a Natural Language Processing Tool for Terminology Extraction , 1996 .

[3]  Christian Jacquemin,et al.  Term Extraction and Automatic Indexing , 2005 .

[4]  Hyungsuk Ji,et al.  A Model for Matching Semantic Maps between Languages (French/English, English/French) , 2003, CL.

[5]  Roger K. Moore Computer Speech and Language , 1986 .

[6]  D. Swanson Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge , 2015, Perspectives in biology and medicine.

[7]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[8]  Ruslan Mitkov,et al.  The Oxford handbook of computational linguistics , 2003 .

[9]  Riichiro Mizoguchi,et al.  Ontology Engineering Environments , 2004, Handbook on Ontologies.

[10]  Andy Lauriston Automatic recognition of complex terms: Problems and the TERMINO solution , 1994 .

[11]  Bruno Gaume,et al.  Forms of meaning, meaning of forms , 2002, J. Exp. Theor. Artif. Intell..

[12]  F B ROGERS,et al.  Medical Subject Headings , 1948, Nature.

[13]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[14]  M. Crampes,et al.  An integrated visual approach for music indexing and dynamic playlist composition , 2006, Electronic Imaging.

[15]  Ching Y. Suen,et al.  n-Gram Statistics for Natural Language Understanding and Text Processing , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Hans Ulrich Block Compiling Trace & Unification Grammar , 1994 .

[17]  Zellig S. Harris,et al.  Mathematical structures of language , 1968, Interscience tracts in pure and applied mathematics.

[18]  Mats Rooth,et al.  Structural Ambiguity and Lexical Relations , 1991, ACL.

[19]  Adeline Nazarenko,et al.  Symbolic word clustering for medium-size corpora , 1996, COLING.

[20]  Jean Véronis,et al.  HyperLex: lexical cartography for information retrieval , 2004, Comput. Speech Lang..

[21]  Jeffrey Heer,et al.  prefuse: a toolkit for interactive information visualization , 2005, CHI.

[22]  Asunción Gómez-Pérez,et al.  Methodologies, tools and languages for building ontologies: Where is their meeting point? , 2003, Data Knowl. Eng..

[23]  Peter Eades,et al.  A Heuristic for Graph Drawing , 1984 .

[24]  Allan Borodin,et al.  Link analysis ranking: algorithms, theory, and experiments , 2005, TOIT.

[25]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[26]  Béatrice Daille,et al.  Conceptual Structuring through Term Variations , 2003, ACL 2003.

[27]  Mountaz Hascoët,et al.  Multi-level Exploration of Citation Graphs , 2004, ECDL.