A Software Processing Chain for Evaluating Thesaurus Quality

Thesauri are knowledge models commonly used for information classification and retrieval whose structure is defined by standards that describe the main features the concepts and relations must have. However, following these standards requires a deep knowledge of the field the thesaurus is going to cover and experience in their creation. To help in this task, this paper describes a software processing chain that provides different validation components that evaluates the quality of the main thesaurus features.

[1]  Organización Internacional de Normalización ISO 25964-1 : Information and documentation -- Thesauri and interoperability with other vocabularies -- Part 1: Thesauri for information retrieval , 2011 .

[2]  Nicola Guarino,et al.  Sweetening WORDNET with DOLCE , 2003, AI Mag..

[3]  María Pinto A user view of the factors affecting quality of thesauri in social science databases , 2008 .

[4]  Antoine Isaac,et al.  SKOS Simple Knowledge Organization System Primer , 2009 .

[5]  Guus Schreiber,et al.  Thesaurus-Based Search in Large Heterogeneous Collections , 2008, SEMWEB.

[6]  Kalina Bontcheva,et al.  GATE: an Architecture for Development of Robust HLT applications , 2002, ACL.

[7]  Kai Eckert,et al.  Usage-driven maintenance of knowledge organization systems , 2012 .

[8]  Osma Suominen,et al.  Assessing and Improving the Quality of SKOS Vocabularies , 2014, Journal on Data Semantics.

[9]  Bernhard Haslhofer,et al.  Perception and relevance of quality issues in web vocabularies , 2013, I-SEMANTICS '13.

[10]  Padmini Srinivasan,et al.  Thesaurus Construction , 1992, Information Retrieval: Data Structures & Algorithms.

[11]  Simon K. Milton,et al.  Towards Quality Measures for Evaluating Thesauri , 2010, MTSR.

[12]  Javier Nogueras-Iso,et al.  An automatic method for reporting the quality of thesauri , 2016, Data Knowl. Eng..

[13]  Jacques Savoy Report on CLEF-2001 Experiments , 2001, CLEF.

[14]  Robert E. Tarjan,et al.  Depth-First Search and Linear Graph Algorithms , 1972, SIAM J. Comput..

[15]  Widad Mustafa El Hadi,et al.  Structures and relations in knowledge organization : proceedings of the fifth International ISKO Conference, 25-29 August 1998, Lille, France , 1998 .

[16]  Sean Bechhofer,et al.  SKOS Simple Knowledge Organization System Reference , 2009 .

[17]  Dagobert Soergel,et al.  Indexing languages and thesauri : construction and maintenance , 1974 .

[18]  Asunción Gómez-Pérez,et al.  Validating Ontologies with OOPS! , 2012, EKAW.

[19]  David Bawden,et al.  Thesaurus Construction and Use: A Practical Manual , 2000 .