Interchanging lexical resources on the Semantic Web

Lexica and terminology databases play a vital role in many NLP applications, but currently most such resources are published in application-specific formats, or with custom access interfaces, leading to the problem that much of this data is in “data silos” and hence difficult to access. The Semantic Web and in particular the Linked Data initiative provide effective solutions to this problem, as well as possibilities for data reuse by inter-lexicon linking, and incorporation of data categories by dereferencable URIs. The Semantic Web focuses on the use of ontologies to describe semantics on the Web, but currently there is no standard for providing complex lexical information for such ontologies and for describing the relationship between the lexicon and the ontology. We present our model, lemon, which aims to address these gaps while building on existing work, in particular the Lexical Markup Framework, the ISOcat Data Category Registry, SKOS (Simple Knowledge Organization System) and the LexInfo and LIR ontology-lexicon models.

[1]  Michael Kifer,et al.  Rule Interchange Format: The Framework , 2008, RuleML.

[2]  Paul Buitelaar,et al.  Ontology and the Lexicon: Ontology-based semantic lexicons: mapping between terms and object descriptions , 2010 .

[3]  Adam Pease,et al.  Linking FrameNet to the Suggested Upper Merged Ontology , 2006, FOIS.

[4]  Ralph Grishman,et al.  Comlex Syntax: Building a Computational Lexicon , 1994, COLING.

[5]  Antoine Isaac,et al.  SKOS Use Cases and Requirements , 2009 .

[6]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[7]  Asunción Gómez-Pérez,et al.  Enriching ontologies with multilingual information† , 2008, Natural Language Engineering.

[8]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[9]  S. Farrar,et al.  Markup and the GOLD Ontology , 2003 .

[10]  Benoît Sagot,et al.  The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French , 2010, LREC.

[11]  Paul Buitelaar,et al.  Towards Linguistically Grounded Ontologies , 2009, ESWC.

[12]  Antoine Isaac SKOS (Simple Knowledge Organization System) , 2011 .

[13]  Philipp Cimiano,et al.  Linking Lexical Resources and Ontologies on the Semantic Web with Lemon , 2011, ESWC.

[14]  Aldo Gangemi,et al.  Conversion of WordNet to a standard RDF/OWL representation , 2006, LREC.

[15]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[16]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[17]  German Rigau,et al.  Book Reviews: EuroWordNet: A Multilingual Database with Lexical Semantic Networks , 1999, CL.

[18]  Adam Kilgarriff,et al.  "I Don’t Believe in Word Senses" , 1997, Comput. Humanit..

[19]  Jun'ichi Tsujii,et al.  GENIA corpus - a semantically annotated corpus for bio-textmining , 2003, ISMB.

[20]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[21]  Paul Buitelaar,et al.  LexInfo: A declarative model for the lexicon-ontology interface , 2011, J. Web Semant..

[22]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[23]  Claudia Soria,et al.  Lexical Markup Framework (LMF) , 2006, LREC.

[24]  Sean Bechhofer,et al.  SKOS Simple Knowledge Organization System Reference , 2009 .

[25]  Wim Peters,et al.  Extending the Inter-Lingual-Index with new concepts , 1999 .

[26]  Graeme Hirst Review of EuroWordNet: a multilingual database with lexical semantic networks by Piek Vossen. Kluwer Academic Publishers 1998. , 1999 .

[27]  Marc Kemps-Snijders,et al.  ISOcat: Corralling Data Categories in the Wild , 2008, LREC.

[28]  Laurent Romary,et al.  Standardization of the formal representation of lexical information for NLP , 2009, ArXiv.