Automatic Enrichment of WordNet with Common-Sense Knowledge

WordNet represents a cornerstone in the Computational Linguistics field, linking words to meanings (or senses) through a taxonomical representation of synsets, i.e., clusters of words with an equivalent meaning in a specific context often described by few definitions (or glosses) and examples. Most of the approaches to the Word Sense Disambiguation task fully rely on these short texts as a source of contextual information to match with the input text to disambiguate. This paper presents the first attempt to enrich synsets data with common-sense definitions, automatically retrieved from ConceptNet 5, and disambiguated accordingly to WordNet. The aim was to exploit the shared- and immediate-thinking nature of common-sense knowledge to extend the short but incredibly useful contextual information of the synsets. A manual evaluation on a subset of the entire result (which counts a total of almost 600K synset enrichments) shows a very high precision with an estimated good recall.

[1]  Peter Wiemer-Hastings,et al.  Latent semantic analysis , 2004, Annu. Rev. Inf. Sci. Technol..

[2]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[3]  Emanuele Pianta,et al.  Beyond Lexical Units: Enriching WordNets with Phrasets , 2003, EACL.

[4]  Simone Paolo Ponzetto,et al.  BabelNet: Building a Very Large Multilingual Semantic Network , 2010, ACL.

[5]  Roberto Navigli,et al.  Validating and Extending Semantic Knowledge Bases using Video Games with a Purpose , 2014, ACL.

[6]  Hsin-Hsi Chen,et al.  Combining WordNet and ConceptNet for Automatic Query Expansion: A Learning Approach , 2008, AIRS.

[7]  German Rigau,et al.  WordNet Enrichment with Classification Systems , 2007 .

[8]  Catherine Havasi,et al.  Representing General Relational Knowledge in ConceptNet 5 , 2012, LREC.

[9]  Adam Pease,et al.  The Suggested Upper Merged Ontology: A Large Ontology for the Semantic Web and its Applic ations , 2002 .

[10]  Eric Tsui,et al.  TaxoFolk: A hybrid taxonomy-folksonomy structure for knowledge classification and navigation , 2011, Expert Syst. Appl..

[11]  Krister Lindén,et al.  Using a Bilingual Resource to Add Synonyms to a Wordnet , 2012 .

[12]  Jamie Murphy,et al.  Towards a folk taxonomy of popular new media marketing terms , 2013 .

[13]  Andrea Bocco,et al.  ArchiWordNet: Integrating WordNet with Domain-Specific Knowledge , 2003 .

[14]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[15]  Luigi Di Caro,et al.  Navigating within news collections using tag-flakes , 2011, J. Vis. Lang. Comput..

[16]  Montse Cuadros,et al.  Exploring the Integration of WordNet and FrameNet , 2009 .

[17]  Junpeng Chen,et al.  Combining ConceptNet and WordNet for Word Sense Disambiguation , 2011, IJCNLP.

[18]  Olatz Ansa,et al.  Enriching very large ontologies using the WWW , 2000, ECAI Workshop on Ontology Learning.

[19]  Paola Velardi,et al.  Extending and Enriching WordNet with OntoLearn , 2004 .

[20]  Maria Ruiz-Casado,et al.  Automatising the learning of lexical patterns: An application to the enrichment of WordNet by extracting semantic relationships from Wikipedia , 2007, Data Knowl. Eng..

[21]  Adam Pease,et al.  Mapping WordNet to the SUMO Ontology , 2003 .