Contextualized Diachronic Word Representations

Diachronic word embeddings play a key role in capturing interesting patterns about how language evolves over time. Most of the existing work focuses on studying corpora spanning across several decades, which is understandably still not a possibility when working on social media-based user-generated content. In this work, we address the problem of studying semantic changes in a large Twitter corpus collected over five years, a much shorter period than what is usually the norm in di-achronic studies. We devise a novel attentional model, based on Bernoulli word embeddings, that are conditioned on contextual extra-linguistic (social) features such as network, spatial and socioeconomic variables, which are associated with Twitter users, as well as topic-based features. We posit that these social features provide an inductive bias that helps our model to overcome the narrow time-span regime problem. Our extensive experiments reveal that our proposed model is able to capture subtle semantic shifts without being biased towards frequency cues and also works well when certain con-textual features are absent. Our model fits the data better than current state-of-the-art dynamic word embedding models and therefore is a promising tool to study diachronic semantic changes over small time periods.

[1]  Yoav Goldberg,et al.  Adversarial Removal of Demographic Attributes from Text Data , 2018, EMNLP.

[2]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[3]  Eric Fleury,et al.  Socioeconomic Dependencies of Linguistic Patterns in Twitter: a Multivariate Analysis , 2018, WWW.

[4]  David M. Blei,et al.  Dynamic Embeddings for Language Evolution , 2018, WWW.

[5]  Stephan Mandt,et al.  Dynamic Word Embeddings , 2017, ICML.

[6]  Guillaume Lample,et al.  What you can cram into a single $&!#* vector: Probing sentence embeddings for linguistic properties , 2018, ACL.

[7]  Erik Velldal,et al.  Diachronic word embeddings and semantic shifts: a survey , 2018, COLING.

[8]  Hassan Sajjad,et al.  Distant Supervision for Tweet Classification Using YouTube Labels , 2015, ICWSM.

[9]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[10]  Yulia Tsvetkov,et al.  A bottom up approach to category mapping and meaning change , 2015, NetWordS.

[11]  Slav Petrov,et al.  Temporal Analysis of Language through Neural Language Models , 2014, LTCSS@ACL.

[12]  Hui Xiong,et al.  Dynamic Word Embeddings for Evolving Semantic Discovery , 2017, WSDM.

[13]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[14]  Lars Borin,et al.  Survey of Computational Approaches to Diachronic Conceptual Change , 2018, ArXiv.

[15]  H. Robbins A Stochastic Approximation Method , 1951 .

[16]  Daniel Jurafsky,et al.  Understanding Neural Networks through Representation Erasure , 2016, ArXiv.

[17]  Brendan Kennedy,et al.  Incorporating Demographic Embeddings Into Language Understanding , 2019, Cogn. Sci..

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  Christopher D. Manning,et al.  Multiword Expression Identification with Tree Substitution Grammars: A Parsing tour de force with French , 2011, EMNLP.

[20]  Benjamin Müller,et al.  ELMoLex: Connecting ELMo and Lexicon Features for Dependency Parsing , 2018, CoNLL Shared Task.

[21]  Jason Weston,et al.  #TagSpace: Semantic Embeddings from Hashtags , 2014, EMNLP.

[22]  Andreas Blank,et al.  Historical Semantics and Cognition , 1999 .

[23]  Dirk Hovy,et al.  Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting , 2018, EMNLP.

[24]  Steven Skiena,et al.  Statistically Significant Detection of Linguistic Change , 2014, WWW.

[25]  David M. Blei,et al.  Exponential Family Embeddings , 2016, NIPS.

[26]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[27]  Jure Leskovec,et al.  Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change , 2016, ACL.

[28]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.