Tools for Building an Interlinked Synonym Lexicon Network

This paper presents the structure, features and design of a new interlinked verbal synonym lexical resource called CzEngClass and the editor tool being developed to assist the work. This lexicon captures cross-lingual (Czech and English) synonyms, using valency behavior of synonymous verbs in relation to semantic roles as one of the criteria for defining such interlingual synonymy. The tool, called Synonym Class Editor SynEd, is a user-friendly tool specifically customized to build and edit individual entries in the lexicon. It helps to keep the cross-lingual synonym classes consistent and linked to internal as well as to well-known external lexical resources. The structure of SynEd also allows to keep and edit the appropriate syntactic and semantic information for each Synonym Class member. The editor makes it possible to display examples of class members’ usage in translational context in a parallel corpus. SynEd is platform independent and may be used for multiple languages. SynEd, CzEngClass and services based on them will be openly available.

[1]  Martha Palmer,et al.  Propbank Instance Annotation Guidelines Using a Dedicated Editor, Jubilee , 2010, LREC.

[2]  Wolfgang Teubert Text corpora and multilingual lexicography , 2007 .

[3]  Yuji Matsumoto,et al.  Use of XML and Relational Databases for Consistent Development and Maintenance of Lexicons and Annotated Corpora , 2002, LREC.

[4]  Ian Johnson,et al.  Lexicon Creator: A Tool for Building Lexicons for Proofing Tools and Search Technologies , 2008 .

[5]  Clement Jonquet,et al.  The Lexicon Builder Web service: Building Custom Lexicons from two hundred Biomedical Ontologies. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[6]  Karel Pala,et al.  Czech WordNet 1.9 PDT , 2011 .

[7]  Karel Pala,et al.  Using Ontologies for Semi-automatic Linking VerbaLex with FrameNet , 2010, LREC.

[8]  Karel Pala,et al.  Building Czech Wordnet , 2004 .

[9]  Jiří Materna Linking Czech Verb Valency Lexicon VerbaLex with FrameNet , 2009 .

[10]  Jirí Materna Probabilistic Semantic Frames , 2014 .

[11]  Tomasz Walkowiak,et al.  The Concept of Lexical Platform , 2017, LDK Workshops.

[12]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[13]  Marie Mikulová,et al.  Prague Dependency Treebank 3.0 , 2013 .

[14]  Thomas Ahlswede A Tool Kit for Lexicon Building , 1985, ACL.

[15]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[16]  Martha Palmer,et al.  Verbnet: a broad-coverage, comprehensive verb lexicon , 2005 .

[17]  Marie Mikulová,et al.  PDT-Vallex: Czech Valency lexicon linked to treebanks , 2014 .

[18]  Zdenka Uresová,et al.  CzEngVallex: a Bilingual Czech-English Valency Lexicon , 2016, Prague Bull. Math. Linguistics.

[19]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[20]  Michel Décary An Editor for the Explanatory and Combinatory Dictionary of Contemporary French (DECFC) , 1990, Comput. Linguistics.

[21]  Nabil Gader,et al.  Hand-Crafting a Lexical Network With a Knowledge-Based Graph Editor , 2012 .

[22]  Silvie Cinková,et al.  EngVallex - English Valency Lexicon , 2014 .

[23]  Silvie Cinková From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description , 2006, LREC.

[24]  Tomaz Erjavec,et al.  Lexicon Construction and Corpus Annotation of Historical Language with the CoBaLT Editor , 2012, LaTeCH@EACL.

[25]  A. Kilgarriff Putting the corpus into the dictionary , 2015 .

[26]  Martha Palmer,et al.  Propbank Frameset Annotation Guidelines Using a Dedicated Editor, Cornerstone , 2010, LREC.

[27]  Eva Hajicová,et al.  Creating a Verb Synonym Lexicon Based on a Parallel Corpus , 2018, LREC.