Linked Disambiguated Distributional Semantic Networks

We present a new hybrid lexical knowledge base that combines the contextual information of distributional models with the conciseness and precision of manually constructed lexical networks. The computation of our count-based distributional model includes the induction of word senses for single-word and multi-word terms, the disambiguation of word similarity lists, taxonomic relations extracted by patterns and context clues for disambiguation in context. In contrast to dense vector representations, our resource is human readable and interpretable, and thus can be easily embedded within the Semantic Web ecosystem.

[1]  Christian Biemann,et al.  JoBimViz: A Web-based Visualization for Graph-based Distributional Semantic Models , 2015, ACL.

[2]  Chris Biemann,et al.  Exploiting the Leipzig Corpora Collection , 2006 .

[3]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[4]  Stefan Evert,et al.  The Statistics of Word Cooccur-rences: Word Pairs and Collocations , 2004 .

[5]  Eneko Agirre,et al.  Random Walks and Neural Network Language Models on Knowledge Bases , 2015, NAACL.

[6]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[7]  Christian Chiarcos,et al.  Linking Linguistic Resources: Examples from the Open Linguistics Working Group , 2012, Linked Data in Linguistics.

[8]  Christian Biemann,et al.  Chinese Whispers - an Efficient Graph Clustering Algorithm and its Application to Natural Language Processing Problems , 2006 .

[9]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[10]  Gerhard Weikum,et al.  YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract , 2013, IJCAI.

[11]  Hinrich Schütze,et al.  AutoExtend: Extending Word Embeddings to Embeddings for Synsets and Lexemes , 2015, ACL.

[12]  Christian Biemann,et al.  Text: now in 2D! A framework for lexical expansion with contextual similarity , 2013, J. Lang. Model..

[13]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[14]  Christiane Fellbaum,et al.  Publishing and Linking WordNet using lemon and RDF , 2014 .

[15]  Christian Biemann,et al.  A Single Word is not Enough: Ranking Multiword Expressions Using Distributional Semantics , 2015, EMNLP.

[16]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[17]  Stefano Faralli,et al.  Growing Multi-Domain Glossaries from a Few Seeds using Probabilistic Topic Models , 2013, EMNLP.

[18]  Christian Biemann,et al.  Domain-Specific Corpus Expansion with Focused Webcrawling , 2016, LREC.

[19]  Zhiyuan Liu,et al.  A Unified Model for Word Sense Representation and Disambiguation , 2014, EMNLP.

[20]  Roberto Navigli,et al.  NASARI: a Novel Approach to a Semantically-Aware Representation of Items , 2015, NAACL.

[21]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.