Precision and Recall for Ontology Enrichment

Ontology enrichment algorithms propose new concepts to given concepts in a domain specific ontology. The paper is dedicated to the quality of ontology enrichment algorithms in terms of precision and recall. We remain open for the proposition of several new concepts to a given one instead of exactly one to a given one. Our first contribution is the generalization of known quality measures concerning recall for robust enrichment algorithms. In order to achieve independence from user evaluations and only rely on ontologies, text corpora and the output of the algorithms, there will be no appropriate explanation of precision in automatic evaluation but hints on precision. Our second contribution is an explanation of these hints on precision in corpus-based ontology enrichment. Finally we show, which measures allow a straightforward computation.

[1]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[2]  David McLean,et al.  An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources , 2003, IEEE Trans. Knowl. Data Eng..

[3]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[4]  Ramez Elmasri,et al.  Fundamentals of Database Systems, 2nd Edition , 1994 .

[5]  Ralf Steinmetz,et al.  Ontology enrichment with texts from the WWW , 2002 .

[6]  Gilles Bisson,et al.  Designing Clustering Methods for Ontology Building - The Mo'K Workbench , 2000, ECAI Workshop on Ontology Learning.

[7]  Nicola Guarino,et al.  Evaluating ontological decisions with OntoClean , 2002, CACM.

[8]  Claire Nedellec,et al.  Corpus-Based Learning of Semantic Relations by the ILP System, Asium , 2001, Learning Language in Logic.

[9]  Gregory Grefenstette,et al.  Use of syntactic context to produce term association lists for text retrieval , 1992, SIGIR '92.

[10]  Steffen Staab,et al.  Ontology Learning Part One - On Discoverying Taxonomic Relations from the Web , 2002 .

[11]  Jennifer Widom,et al.  Exploiting hierarchical domain structure to compute similarity , 2003, TOIS.

[12]  Laurent Romary,et al.  Vulcain - An Ontology-Based Information Extraction System , 2002, NLDB.

[13]  Ian Horrocks,et al.  Using an Expressive Description Logic: FaCT or Fiction? , 1998, KR.

[14]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[15]  K. D. Joshi,et al.  A collaborative approach to ontology design , 2002, CACM.

[16]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[17]  Gerd Stumme,et al.  FCA-MERGE: Bottom-Up Merging of Ontologies , 2001, IJCAI.

[18]  Uwe Quasthoff Tools for automatic lexicon maintenance: acquisition, error correction, and the generation of missing values , 1998 .

[19]  George A. Miller,et al.  WordNet: A Lexical Database for the English Language , 2002 .

[20]  Sayan Mukherjee,et al.  Choosing Multiple Parameters for Support Vector Machines , 2002, Machine Learning.

[21]  Lillian Lee,et al.  Measures of Distributional Similarity , 1999, ACL.