MSC+: Language pattern learning for word sense induction and disambiguation

Abstract Identifying the correct meaning of words in context or discovering new word senses is particularly useful for several tasks such as question answering, information extraction, information retrieval, and text summarization. However, specially in the context of user-generated contents and on-line communication (e.g. Twitter), new meanings are continuously crafted by speakers as the result of existing words being used in novel contexts. Consequently, lexical semantics inventories and systems have difficulties to cope with semantic drifting problems. In this work, we propose an approach to induce and disambiguate word senses of some target words in collections of short texts, such as tweets, through the use of fuzzy lexico-semantic patterns that we define as sequences of Morpho-semantic Components (MSC). We learn these patterns, that we call M S C + patterns, from text data automatically. Experimental results show that instances of some M S C + patterns arise in a number of tweets, but sometimes using different words to convey the sense of the respective MSC in some tweets where pattern instances appear. The exploitation of M S C + patterns when they induce semantics on target words enable effective word sense disambiguation mechanisms leading to improvements in the state of the art.

[1]  Roberto Navigli,et al.  Nasari: Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities , 2016, Artif. Intell..

[2]  Roberto Navigli,et al.  Clustering and Diversifying Web Search Results with Graph-Based Word Sense Induction , 2013, CL.

[3]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[4]  Julio Gonzalo,et al.  The role of named entities in Web People Search , 2009, EMNLP.

[5]  Andrés Montoyo,et al.  Spreading semantic information by Word Sense Disambiguation , 2017, Knowl. Based Syst..

[6]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[7]  Zhe Zhao,et al.  Identify Shifts of Word Semantics through Bayesian Surprise , 2018, SIGIR.

[8]  David Jurgens,et al.  SemEval-2013 Task 13: Word Sense Induction for Graded and Non-Graded Senses , 2013, SemEval@NAACL-HLT.

[9]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[10]  Horacio Saggion,et al.  A text summarization method based on fuzzy rules and applicable to automated assessment , 2019, Expert Syst. Appl..

[11]  Fabian M. Suchanek,et al.  YAGO3: A Knowledge Base from Multilingual Wikipedias , 2015, CIDR.

[12]  Mauro Dragoni,et al.  Computational advertising in social networks: an opinion mining-based approach , 2018, SAC.

[13]  Ming Zhou,et al.  Joint Inference of Named Entity Recognition and Normalization for Tweets , 2012, ACL.

[14]  Tiejun Zhao,et al.  PengYuan@PKU: Extracting Infrequent Sense Instance with the Same N-Gram Pattern for the SemEval-2010 Task 15 , 2010, SemEval@ACL.

[15]  Nikos Pelekis,et al.  The Baquara2 knowledge-based framework for semantic enrichment and analysis of movement data , 2015, Data Knowl. Eng..

[16]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[17]  Eneko Agirre,et al.  Personalizing PageRank for Word Sense Disambiguation , 2009, EACL.

[18]  Iryna Gurevych,et al.  Lexical-semantic resources: yet powerful resources for automatic personality classification , 2017, GWC.

[19]  Simone Paolo Ponzetto,et al.  BabelNet: Building a Very Large Multilingual Semantic Network , 2010, ACL.

[20]  Timothy Baldwin,et al.  Word Sense Induction for Novel Sense Detection , 2012, EACL.

[21]  Domenico Beneventano,et al.  Word Sense Induction with Multilingual Features Representation , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[22]  Patrick Pantel,et al.  DIRT @SBT@discovery of inference rules from text , 2001, KDD '01.

[23]  Yuefeng Li,et al.  Conceptual annotation of text patterns , 2017, Comput. Intell..

[24]  Jian Su,et al.  Entity Linking Leveraging Automatically Generated Annotation , 2010, COLING.

[25]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[26]  Ulrich Schiel,et al.  An approach for building lexical-semantic resources based on heterogeneous information sources , 2015, SAC.

[27]  Roberto Navigli,et al.  SemEval-2013 Task 11: Word Sense Induction and Disambiguation within an End-User Application , 2013, SemEval@NAACL-HLT.

[28]  Maria Ruiz-Casado,et al.  Automatising the learning of lexical patterns: An application to the enrichment of WordNet by extracting semantic relationships from Wikipedia , 2007, Data Knowl. Eng..

[29]  Christian Biemann,et al.  Chinese Whispers - an Efficient Graph Clustering Algorithm and its Application to Natural Language Processing Problems , 2006 .

[30]  Stefano Faralli,et al.  Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction and Disambiguation , 2017, EACL.

[31]  Sanja Stajner,et al.  Making It Simplext , 2015, ACM Trans. Access. Comput..

[32]  Norita Md Norwawi,et al.  Lexical Disambiguation in Natural Language Questions (NLQs) , 2017, ArXiv.

[33]  Daniel Jurafsky,et al.  Do Multi-Sense Embeddings Improve Natural Language Understanding? , 2015, EMNLP.

[34]  Jean Véronis,et al.  HyperLex: lexical cartography for information retrieval , 2004, Comput. Speech Lang..

[35]  Andrew McCallum,et al.  Efficient Graph-based Word Sense Induction by Distributional Inclusion Vector Embeddings , 2018, TextGraphs@NAACL-HLT.

[36]  Christian Biemann,et al.  Making Sense of Word Embeddings , 2016, Rep4NLP@ACL.

[37]  Dipankar Das,et al.  Lexical Resource for Medical Events: A Polarity Based Approach , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[38]  Bruno Crémilleux,et al.  Discovering Linguistic Patterns Using Sequence Mining , 2012, CICLing.

[39]  Mirella Lapata,et al.  Bayesian Word Sense Induction , 2009, EACL.

[40]  Jun Hu,et al.  What Is New in Our City? A Framework for Event Extraction Using Social Media Posts , 2015, PAKDD.

[41]  Eneko Agirre,et al.  Two graph-based algorithms for state-of-the-art WSD , 2006, EMNLP.

[42]  Cesar H. Comin,et al.  Clustering algorithms: A comparative approach , 2016, PloS one.

[43]  Kalina Bontcheva,et al.  Extracting Information from Social Media with GATE , 2016 .

[44]  Rada Mihalcea,et al.  Word sense disambiguation with pattern learning and automatic feature selection , 2002, Natural Language Engineering.

[45]  Roberto Navigli A Quick Tour of Word Sense Disambiguation, Induction and Related Approaches , 2012, SOFSEM.