Thesaurus maintenance, alignment and publication as linked data: the AGROVOC use case

The AGROVOC multilingual thesaurus maintained by the Food and Agriculture Organisation (FAO) of the United Nations is now published as linked data. In order to reach this goal AGROVOC was expressed in Simple Knowledge Organisation System (SKOS) and its concepts provided with dereferenceable URIs. AGROVOC is now aligned with ten other multilingual Knowledge Organisation Systems (KOS) related to agriculture, using the SKOS properties exact match and close match. Alignments were automatically produced in Eclipse using a custom-designed tool and then validated by a domain expert. The resulting data is publicly available to both humans and machines using a SPARQL endpoint together with a modified version of Pubby, a lightweight front-end tool for publishing linked data. This paper describes the process that led to the current linked data AGROVOC and discusses current and future applications and directions. This paper extends a shorter version presented at MTSR 2011.

[1]  Jérôme Euzenat,et al.  An API for Ontology Alignment , 2004, SEMWEB.

[2]  Diego Calvanese,et al.  The Description Logic Handbook , 2007 .

[3]  Brian McBride,et al.  Jena: Implementing the RDF Model and Syntax Specification , 2001, SemWeb.

[4]  Henrik Eriksson,et al.  The evolution of Protégé: an environment for knowledge-based systems development , 2003, Int. J. Hum. Comput. Stud..

[5]  Robert Stevens,et al.  SKOS with OWL: Don't be Full-ish! , 2008, OWLED.

[6]  Véronique Malaisé,et al.  A Method to Convert Thesauri to SKOS , 2006, ESWC.

[7]  York Sure,et al.  Converting the TheSoz to SKOS , 2009 .

[8]  Johannes Keizer,et al.  Requirements for the Treatment of Multilinguality in Ontologies within FAO , 2007, OWLED.

[9]  Frehiwot Fisseha,et al.  Reengineering Thesauri for New Applications: The AGROVOC Example , 2006, J. Digit. Inf..

[10]  Joachim Neubert,et al.  Bringing the "Thesaurus for Economics" on to the Web of Linked Data , 2009, LDOW.

[11]  Johannes Keizer,et al.  Linked Data for Fighting Global Hunger: Experiences in setting standards for Agricultural Information Management , 2010, Linking Enterprise Data.

[12]  David Yarowsky,et al.  A method for disambiguating word senses in a large corpus , 1992, Comput. Humanit..

[13]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[14]  Holger Knublauch,et al.  The Protégé OWL Plugin: An Open Development Environment for Semantic Web Applications , 2004, SEMWEB.

[15]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[16]  Johannes Keizer,et al.  Thesaurus Alignment for Linked Data Publishing , 2011, Dublin Core Conference.

[17]  Raphael Volz,et al.  Cooking the Semantic Web with the OWL API , 2003, SEMWEB.

[18]  Pradeep Ravikumar,et al.  A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.