The Role of Common-Sense Knowledge in Assessing Semantic Association

Natural language processing techniques often aim at automatically extracting semantics from texts. However, they usually need some available semantic knowledge contained in dictionaries and resources such as WordNet, Wikipedia, and FrameNet. In this respect, there is a large literature about the creation of novel semantic resources as well as attempts to integrate existing ones. In this context, we here focus on common-sense knowledge, which shows to have interesting characteristics as well as challenging issues such as ambiguity, vagueness, and inconsistency. In this paper, we make use of a large-scale and crowdsourced common-sense knowledge base, i.e., ConceptNet, to qualitatively evaluate its role in the perception of semantic association among words. We then propose an unsupervised method to disambiguate and integrate ConceptNet instances into WordNet, demonstrating how the enriched resource improves the recognition of semantic association. Finally, we describe a novel approach to label semantically associated words by exploiting the functional and behavioral information usually contained in common sense, demonstrating how this enhances the explanation (and the use) of relatedness and similarity with non-numeric information.

[1]  Luigi Di Caro,et al.  Word Similarity Perception: an Explorative Analysis , 2015, EAPCogSci.

[2]  三嶋 博之 The theory of affordances , 2008 .

[3]  S. Dehaene,et al.  Language-specific tuning of visual cortex? Functional properties of the Visual Word Form Area. , 2002, Brain : a journal of neurology.

[4]  Luigi Di Caro,et al.  Automatic Enrichment of WordNet with Common-Sense Knowledge , 2016, LREC.

[5]  Catherine Havasi,et al.  Representing General Relational Knowledge in ConceptNet 5 , 2012, LREC.

[6]  Patrick Pantel,et al.  Explaining Similarity of Terms , 2008, COLING.

[7]  Simone Paolo Ponzetto,et al.  BabelNet: Building a Very Large Multilingual Semantic Network , 2010, ACL.

[8]  Ferdinand de Saussure Course in General Linguistics , 1916 .

[9]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[10]  Ehud Rivlin,et al.  Placing search in context: the concept revisited , 2002, TOIS.

[11]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[12]  Junpeng Chen,et al.  Combining ConceptNet and WordNet for Word Sense Disambiguation , 2011, IJCNLP.

[13]  L. Murphy Semantic Relations and the Lexicon , 2003 .

[14]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[15]  Michael Wilson,et al.  MRC psycholinguistic database: Machine-usable dictionary, version 2.00 , 1988 .

[16]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[17]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[18]  R. Chaffin,et al.  The similarity and diversity of semantic relations , 1984, Memory & cognition.

[19]  Felix Hill,et al.  SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation , 2014, CL.

[20]  Marco Baroni,et al.  Dead parrots make bad pets: Exploring modifier effects in noun phrases , 2014, *SEM@COLING.

[21]  Samuel Fillenbaum,et al.  Words as feature complexes: False recognition of antonyms and synonyms. , 1969 .

[22]  Mary Snell-Hornby 3.3 Scenes-and-frames semantics , 1988 .

[23]  Peter Gärdenfors,et al.  Conceptual spaces - the geometry of thought , 2000 .

[24]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[25]  S. Dumais Latent Semantic Analysis. , 2005 .

[26]  Catherine Havasi,et al.  ConceptNet 5: A Large Semantic Network for Relational Knowledge , 2013, The People's Web Meets NLP.

[27]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[28]  Mark S. Seidenberg,et al.  Semantic feature production norms for a large set of living and nonliving things , 2005, Behavior research methods.

[29]  Charles J. Fillmore,et al.  SCENES- AND- FRAMES SEMANTICS. , 1977 .

[30]  Francis Bond,et al.  Linking and Extending an Open Multilingual Wordnet , 2013, ACL.

[31]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[32]  Gemma Boleda,et al.  Distributional Semantics in Technicolor , 2012, ACL.