Adding Pronunciation Information to Wordnets

We describe on-going work consisting in adding pronunciation information to wordnets, as such information can indicate specific senses of a word. Many wordnets associate with their senses only a lemma form and a part-of-speech tag. At the same time, we are aware that additional linguistic information can be useful for identifying a specific sense of a wordnet lemma when encountered in a corpus. While work already deals with the addition of grammatical number or grammatical gender information to wordnet lemmas,we are investigating the linking of wordnet lemmas to pronunciation information, adding thus a speech-related modality to wordnets

[1]  Roberto Navigli,et al.  Nasari: Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities , 2016, Artif. Intell..

[2]  Claudia Soria,et al.  Lexical Markup Framework (LMF) , 2006, LREC.

[3]  Piek Vossen,et al.  Open Dutch WordNet , 2016, GWC.

[4]  John P. McCrae,et al.  CILI: the Collaborative Interlingual Index , 2016, GWC.

[5]  Christo Kirov,et al.  Very-large Scale Parsing and Normalization of Wiktionary Morphological Paradigms , 2016, LREC.

[6]  Thierry Declerck OntoLex as a possible Bridge between WordNets and full lexical Descriptions , 2019, GWC.

[7]  Francis Bond,et al.  Linking and Extending an Open Multilingual Wordnet , 2013, ACL.

[8]  Thierry Declerck,et al.  Enriching Open Multilingual Wordnets with Morphological Features , 2019, CLiC-it.

[9]  Asunción Gómez-Pérez,et al.  Interchanging lexical resources on the Semantic Web , 2012, Language Resources and Evaluation.

[10]  Christiane Fellbaum,et al.  Publishing and Linking WordNet using lemon and RDF , 2014 .

[11]  Eleni Metheniti,et al.  Wikinflection: Massive Semi-Supervised Generation of Multilingual Inflectional Corpus from Wiktionary , 2018 .

[12]  Eleni Metheniti,et al.  Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus , 2020, LREC.

[13]  Simone Paolo Ponzetto,et al.  BabelNet: Building a Very Large Multilingual Semantic Network , 2010, ACL.

[14]  Tanja Schultz,et al.  Wiktionary as a source for automatic pronunciation extraction , 2010, INTERSPEECH.

[15]  Thierry Declerck,et al.  Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology , 2019, LDK.

[16]  Jorge Gracia del Río,et al.  Validating the OntoLex-lemon Lexicography Module with K Dictionaries’ Multilingual Data , 2019 .

[17]  Philipp Cimiano,et al.  The OntoLex-Lemon Model: Development and Applications , 2017 .

[18]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[19]  Isa Maks,et al.  Integrating Lexical Units, Synsets and Ontology in the Cornetto Database , 2008, LREC.

[20]  Denis Jouvet,et al.  Building a Pronunciation Lexicon for a Speech Transcription System from Wiktionary Pronunciations only , 2011 .

[21]  Francis Bond,et al.  A Survey of WordNets and their Licenses , 2011 .