English WordNet 2020: Improving and Extending a WordNet for English using an Open-Source Methodology

WordNet, while one of the most widely used resources for NLP, has not been updated for a long time, and as such a new project English WordNet has arisen to continue the development of the model under an open-source paradigm. In this paper, we detail the second release of this resource entitled “English WordNet 2020”. The work has focused firstly, on the introduction of new synsets and senses and developing guidelines for this and secondly, on the integration of contributions from other projects. We present the changes in this edition, which total over 15,000 changes over the previous release.

[1]  Michael J. Denkowski,et al.  A Survey of Techniques for Unsupervised Word Sense Induction , 2009 .

[2]  John P. McCrae,et al.  English WordNet 2019 – An Open-Source WordNet for English , 2019, GWC.

[3]  Maciej Piasecki,et al.  Towards Emotive Annotation in plWordNet 4.0 , 2018, GWC.

[4]  Francis Bond,et al.  A Survey of WordNets and their Licenses , 2011 .

[5]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[6]  Francis Bond,et al.  Linking and Extending an Open Multilingual Wordnet , 2013, ACL.

[7]  John P. McCrae,et al.  CILI: the Collaborative Interlingual Index , 2016, GWC.

[8]  Gerard de Melo,et al.  OpenWordNet-PT: An Open Brazilian Wordnet for Reasoning , 2012, COLING.

[9]  Amanda Hicks,et al.  The Colloquial WordNet: Extending Princeton WordNet with Neologisms , 2017, LDK.

[10]  John P. McCrae,et al.  Toward a truly multilingual GlobalWordnet Grid , 2016, GWC.

[11]  Philipp Cimiano,et al.  Modelling the Semantics of Adjectives in the Ontology-Lexicon Interface , 2014, CogALex@COLING.

[12]  Gerhard Weikum,et al.  Towards a universal wordnet by learning from combined evidence , 2009, CIKM.

[13]  Suresh Manandhar,et al.  Word Sense Induction Using Graphs of Collocations , 2008, ECAI.

[14]  M. Piasecki,et al.  Implementation of the Verb Model in plWordNet 4.0 , 2018, GWC.

[15]  Timothy Baldwin,et al.  Multiword Expressions: A Pain in the Neck for NLP , 2002, CICLing.

[16]  Joakim Nivre,et al.  A Multiword Expression Data Set: Annotating Non-Compositionality and Conventionalization for English Noun Compounds , 2015, MWE@NAACL-HLT.

[17]  Daniel Jurafsky,et al.  Learning to Merge Word Senses , 2007, EMNLP.

[18]  Olga Babko-Malaya,et al.  Different Sense Granularities for Different Applications , 2004, HLT-NAACL 2004.

[19]  Ewa Rudnicka,et al.  Towards the Methodology for Extending Princeton WordNet , 2015 .

[20]  Fabricio Chalub,et al.  Extending Wordnet to Geological Times , 2018, GWC.

[21]  John P. McCrae Mapping WordNet Instances to Wikipedia , 2018, GWC.