A Genetic Algorithm for Discovering Linguistic Communities in Spatiosocial Tensors with an Application to Trilingual Luxemburg

Multimodal social networks are omnipresent in Web 2.0 with virtually every human communication action taking place there. Nonetheless, language remains by far the main premise such communicative acts unfold upon. Thus, it is statutory to discover language communities especially in social data stemming from historically multilingual countries such as Luxemburg. An adjacency tensor is especially suitable for representing such spatiosocial data. However, because of its potentially large size, heuristics should be developed for locating community structure efficiently. Linguistic structure discovery has a plethora of applications including digital marketing and online political campaigns, especially in case of prolonged and intense cross-linguistic contact. This conference paper presents TENSOR-G, a flexible genetic algorithm for approximate tensor clustering along with two alternative fitness functions derived from language variation or diffusion properties. The Kruskal tensor decomposition serves as a benchmark and the results obtained from a set of trilingual Luxemburgian tweets are analyzed with linguistic criteria.

[1]  David E. Goldberg,et al.  Genetic algorithms and Machine Learning , 1988, Machine Learning.

[2]  Georgios Drakopoulos,et al.  A Space Efficient Scheme for Persistent Graph Representation , 2014, 2014 IEEE 26th International Conference on Tools with Artificial Intelligence.

[3]  Stephen Pax Leonard,et al.  Language change and digital media: A review of conceptions and evidence , 2011 .

[4]  Kazuko Matsumoto,et al.  The role of social networks in the post-colonial multilingual island of Palau: Mechanisms of language maintenance and shift , 2010 .

[5]  R. Dixon The rise and fall of languages , 1997 .

[6]  Eric P. Xing,et al.  Diffusion of Lexical Change in Social Media , 2012, PloS one.

[7]  R. M. W. Dixon The Rise and Fall of Languages by R. M. W. Dixon , 1997 .

[8]  Jean-Francois Cardoso,et al.  Eigen-structure of the fourth-order cumulant tensor with application to the blind source separation problem , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[9]  Tamara G. Kolda,et al.  Efficient MATLAB Computations with Sparse and Factored Tensors , 2007, SIAM J. Sci. Comput..

[10]  Lars Backstrom,et al.  Find me if you can: improving geographical prediction with social and spatial proximity , 2010, WWW '10.

[11]  Jacob Eisenstein,et al.  Sociolinguistic Variation in Online Social Media , 2015 .

[12]  Kenneth DeJong,et al.  Learning with genetic algorithms: An overview , 1988, Machine Learning.

[13]  D.E. Goldberg,et al.  Classifier Systems and Genetic Algorithms , 1989, Artif. Intell..

[14]  R. M. W. Dixon,et al.  The rise and fall of languages , 1997 .

[15]  Brigitte Pakendorf,et al.  Historical linguistics and molecular anthropology , 2014 .

[16]  Vasileios Megalooikonomou,et al.  Augmenting fMRI-generated brain connectivity with temporal information , 2016, 2016 7th International Conference on Information, Intelligence, Systems & Applications (IISA).

[17]  Jennifer Golbeck,et al.  Bridging languages in social networks: How multilingual users of Twitter connect language communities? , 2012, ASIST.

[18]  Iosif Vissarionovich Stalin Marxism and Problems of Linguistics , 2003 .

[19]  Vasileios Megalooikonomou,et al.  An adaptive higher order scheduling policy with an application to biosignal processing , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[20]  Lev Michael,et al.  Social dimensions of language change , 2014 .

[21]  Tamara G. Kolda,et al.  Temporal Link Prediction Using Matrix and Tensor Factorizations , 2010, TKDD.

[22]  Rebecca Maybaum,et al.  Language Change as a Social Process: Diffusion Patterns of Lexical Innovations in Twitter , 2013 .

[23]  Mohsen Guizani,et al.  5G wireless backhaul networks: challenges and research advances , 2014, IEEE Network.

[24]  J. Milroy,et al.  Linguistic change, social network and speaker innovation , 1985, Journal of Linguistics.

[26]  Evangelos E. Papalexakis,et al.  Understanding Multilingual Social Networks in Online Immigrant Communities , 2015, WWW.

[27]  Matthew Rowe,et al.  Language Innovation and Change in On-line Social Networks , 2015, HT.

[28]  Georgios Drakopoulos,et al.  Tensor fusion of social structural and functional analytics over Neo4j , 2016, 2016 7th International Conference on Information, Intelligence, Systems & Applications (IISA).

[29]  Scott A. Hale Global connectivity and multilinguals in the Twitter network , 2014, CHI.

[30]  Terttu Nevalainen,et al.  Social networks and language change in Tudor and Stuart London – only connect? , 2015, English Language and Linguistics.

[31]  Spyros Sioutas,et al.  Tensor-Based Semantically-Aware Topic Clustering of Biomedical Documents , 2017, Comput..

[32]  David Sanchez,et al.  Dialectometric analysis of language variation in Twitter , 2017, VarDial.

[33]  Carl-Fredrik Westin,et al.  Processing and visualization for diffusion tensor MRI , 2002, Medical Image Anal..

[34]  B. Evans,et al.  The Routledge Handbook of Historical Linguistics , 2015 .

[35]  Andreas Kanavos,et al.  Tensor-based document retrieval over Neo4j with an application to PubMed mining , 2016, 2016 7th International Conference on Information, Intelligence, Systems & Applications (IISA).

[36]  Yaron Matras,et al.  Languages in contact in a world marked by change and mobility , 2013 .

[37]  Ed H. Chi,et al.  Language Matters In Twitter: A Large Scale Study , 2011, ICWSM.

[38]  Nikos D. Sidiropoulos,et al.  Tensor Algebra and Multidimensional Harmonic Retrieval in Signal Processing for MIMO Radar , 2010, IEEE Transactions on Signal Processing.

[39]  W. Labov Transmission and Diffusion , 2007 .

[40]  Rahul Goel,et al.  The Social Dynamics of Language Change in Online Networks , 2016, SocInfo.

[41]  Matthew Rowe,et al.  Birds of a Feather Talk Together: User Influence on Language Adoption , 2017, HICSS.

[42]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..