Inference and Reconciliation in a Crowdsourced Lexical-Semantic Network

Lexical-semantic network construction and validation is a major issue in NLP. No matter the construction strategies used, automatically inferring new relations from already existing ones is a way to improve the global quality of the resource by densifying the network. In this context, the purpose of an inference engine is to formulate new conclusions (i.e. relations between terms) from already existing premises (also relations) on the network. In this paper we devise an inference engine for the JeuxDeMots lexical network which contains terms and typed relations between terms. In the JeuxDeMots project, the lexical network is constructed with the help of a game with a purpose and thousands of players. Polysemous terms may be refined in several senses (bank may be a bank-financial institution or a bank-river) but as the network is indefinitely under construction (in the context of a Never Ending Learning approach) some senses may be missing. The approach we propose is based on the triangulation method implementing semantic transitivity with a blocking mechanism for avoiding proposing dubious new relations. Inferred relations are proposed to contributors to be validated. In case of invalidation, a reconciliation strategy is undertaken to identify the cause of the wrong inference : an exception, an error in the premises or a transitivity confusion due to polysemy with the identification of the proper word senses at stake.

[1]  Rada Mihalcea,et al.  Open Mind Word Expert: Creating Large Annotated Data Collections with Web Users’ Help , 2003, LINC@EACL.

[2]  Mathieu Lafourcade,et al.  Making people play for Lexical Acquisition with the JeuxDeMots prototype , 2007 .

[3]  Piek Vossen,et al.  EuroWordNet: A multilingual database with lexical semantic networks , 1998, Springer Netherlands.

[4]  Laura A. Dabbish,et al.  Designing games with a purpose , 2008, CACM.

[5]  Didier Schwab,et al.  Évaluation et consolidation d'un réseau lexical grâce à un assistant ludique pour le " mot sur le bout de la langue " , 2011 .

[6]  Benoît Sagot,et al.  Construction d’un wordnet libre du français à partir de ressources multilingues , 2008, JEPTALNRECITAL.

[7]  Donghui Feng,et al.  Acquiring High Quality Non-Expert Knowledge from On-Demand Workforce , 2009, PWNLP@IJCNLP.

[8]  Mathieu Lafourcade,et al.  JeuxDeMots : un prototype ludique pour l'émergence de relations entre termes , 2008 .

[9]  Andrea Marchetti,et al.  SemKey: A Semantic Collaborative Tagging System , 2007 .

[10]  Iryna Gurevych,et al.  Wisdom of crowds versus wisdom of linguists – measuring the semantic relatedness of words , 2009, Natural Language Engineering.

[11]  Martin Hepp,et al.  Games with a Purpose for the Semantic Web , 2008, IEEE Intelligent Systems.

[12]  Yolanda Gil,et al.  Improving the design of intelligent acquisition interfaces for collecting world knowledge from web contributors , 2005, K-CAP '05.

[13]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[14]  Qiang Dong,et al.  Hownet And The Computation Of Meaning , 2006 .

[15]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[16]  Udo Kruschwitz,et al.  Phrase Detectives: A Web-based collaborative annotation game , 2008 .

[17]  H. Lieberman Common Consensus : a web-based game for collecting commonsense goals , 2007 .

[18]  Simone Paolo Ponzetto,et al.  BabelNet: Building a Very Large Multilingual Semantic Network , 2010, ACL.

[19]  Bernard Victorri,et al.  Construction d'espaces sémantiques à l'aide de dictionnaires de synonymes , 1998 .