Migrating Cornetto Lexicon to New XML Database Engine

The original Cornetto project started to develop a new complex-structured lexicon for the Dutch language. The lexicon building process works with information from two current electronic dictionaries -- the Referentie Bestand Nederlands (RBN), which contains FrameNet-like structures, and the Dutch wordnet (DWN) with the usual wordnet structures. The resulting Cornetto lexicon is stored in a system called Cornetto database, which is built over the Dictionary Editor and Browser platform. In this paper, we describe a transition of the Cornetto database system to a new database backend based on large set of tests that were run on four selected (out of twenty) available XML database systems. We present the technical details of the Cornetto editing process and the results before and after the database transition.

[1]  Erhard Rahm,et al.  Multi-user Evaluation of XML Data Management Systems with XMach-1 , 2002, EEXTT.

[2]  Aleš Horák,et al.  DEBVisDic - First Version of New Client-Server Wordnet Browsing and Editing Tool , 2005 .

[3]  Torsten Grust,et al.  MonetDB/XQuery: a fast XQuery processor powered by a relational engine , 2006, SIGMOD Conference.

[4]  Aleš Horák,et al.  New clients for dictionary writing on the DEB platform , 2006 .

[5]  Ge Yu,et al.  What makes the differences: benchmarking XML database implementations , 2005, TOIT.

[6]  Martin Bukatovic,et al.  Which XML Storage for Knowledge and Ontology Systems? , 2010, KES.

[7]  Jason Eisner,et al.  Lexical Semantics , 2020, The Handbook of English Linguistics.

[8]  Hiroaki Sato,et al.  FrameNet as a “Net” , 2004, LREC.

[9]  Maxim N. Grinev,et al.  Sedna: A Native XML DBMS , 2006, SOFSEM.

[10]  Stéphane Bressan,et al.  Efficient XML Data Management: An Analysis , 2002, EC-Web.

[11]  Awais Rashid,et al.  XML Data Management: Native XML and XML-Enabled Database Systems , 2003 .

[12]  Djoerd Hiemstra,et al.  PFTijah: text search in an XML database system , 2006 .

[13]  Wolfgang Meier,et al.  eXist: An Open Source Native XML Database , 2002, Web, Web-Services, and Database Systems.

[14]  Piek Vossen,et al.  EuroWordNet: A multilingual database with lexical semantic networks , 1998, Springer Netherlands.

[15]  Aleš Horák,et al.  Cornetto Tools and Methodology for Interlinking Lexical Units,Synsets and Ontology , 2008 .