Diachronic Synonymy and Polysemy: Exploring Dynamic Relation Between Forms and Meanings of Words Based on Word Embeddings

In recent years, there has been a large number of publications that use distributed methods to track temporal changes in lexical semantics. However, most current researches only state the simple fact that the meaning of words has changed, lacking more detailed and in-depth analysis. We combine linguistic theory and word embedding model to study Chinese diachronic semantics. Specifically, two methods of word analogy and word similarity are associated with diachronic synonymy and diachronic polysemy respectively, and the aligned diachronic word embeddings are used to detect the changes of relationship between forms and meanings of words. Through experiments and case studies, our method achieves the ideal result. We also find that the evolution of Chinese vocabulary is closely related to social development, and there is a certain correlation between the polysemy and synonymy of the word meaning.

[1]  Patrick Juola,et al.  The Time Course of Language Change , 2003, Comput. Humanit..

[2]  Marco Baroni,et al.  A distributional similarity approach to the detection of semantic change in the Google Books Ngram corpus. , 2011, GEMS.

[3]  Maarten Marx,et al.  UvA-DARE (Digital Academic Repository) Words are Malleable: Computing Semantic Shifts in Political and Media Discourse , 2017 .

[4]  Sourav S. Bhowmick,et al.  Omnia Mutantur, Nihil Interit: Connecting Past with Present by Finding Corresponding Terms across Time , 2015, ACL.

[5]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[6]  Terrence Szymanski,et al.  Temporal Word Analogies: Identifying Lexical Replacement with Diachronic Word Embeddings , 2017, ACL.

[7]  Slav Petrov,et al.  Temporal Analysis of Language through Neural Language Models , 2014, LTCSS@ACL.

[8]  Jianxin Li,et al.  Incrementally Learning the Hierarchical Softmax Function for Neural Language Models , 2017, AAAI.

[9]  Björn-Olav Dozo,et al.  Quantitative Analysis of Culture Using Millions of Digitized Books , 2010 .

[10]  Elly van Gelderen,et al.  A History of the English Language , 2000 .

[11]  Beate Hampe,et al.  Germanic Future Constructions. A Usage-Based Approach to Language Change , 2009 .

[12]  J. R. Firth,et al.  A Synopsis of Linguistic Theory, 1930-1955 , 1957 .

[13]  H. Varian,et al.  Predicting the Present with Google Trends , 2009 .

[14]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[15]  S. Gries Particle Movement: a Cognitive and Functional Approach* , 2022 .

[16]  Steven Skiena,et al.  Statistically Significant Detection of Linguistic Change , 2014, WWW.

[17]  Jure Leskovec,et al.  Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change , 2016, ACL.

[18]  Alexander Mehler,et al.  On the Linearity of Semantic Change: Investigating Meaning Variation via Dynamic Graph Models , 2016, ACL.

[19]  F. Saussure,et al.  Course in General Linguistics , 1960 .

[20]  Hans-Jörg Schmid,et al.  The NeoCrawler: identifying and retrieving neologisms from the internet and monitoring ongoing change , 2011 .