Capturing the diversity of multilingual societies

Cultural diversity encoded within languages of the world is at risk, as many languages have become endangered in the last decades in a context of growing globalization. To preserve this diversity, it is first necessary to understand what drives language extinction, and which mechanisms might enable coexistence. Here, we study language shift mechanisms using theoretical and datadriven perspectives. A large-scale empirical analysis of multilingual societies using Twitter and census data yields a wide diversity of spatial patterns of language coexistence. It ranges from a mixing of language speakers to segregation with multilinguals on the boundaries of disjoint linguistic domains. To understand how these different states can emerge and, especially, become stable, we propose a model in which language coexistence is reached when learning the other language is facilitated and when bilinguals favor the use of the endangered language. Simulations carried out in a metapopulation framework highlight the importance of spatial interactions arising from people mobility to explain the stability of a mixed state or the presence of a boundary between two linguistic regions. Further, we find that the history of languages is critical to understand their present state.

[1]  Alessandro Vespignani,et al.  The Twitter of Babel: Mapping World Languages through Microblogging Platforms , 2012, PloS one.

[2]  Bruno Gonçalves,et al.  Crowdsourcing Dialect Characterization through Twitter , 2014, PloS one.

[3]  M. Barthelemy,et al.  Human mobility: Models and applications , 2017, 1710.00004.

[4]  Julia Kastner,et al.  Endangered Languages Language Loss And Community Response , 2016 .

[5]  M. White Segregation and diversity measures in population distribution. , 1986, Population index.

[6]  C. Habel,et al.  Language , 1931, NeuroImage.

[7]  Diglossia , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[8]  W. Marsden I and J , 2012 .

[9]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[10]  Neus Isern,et al.  Language extinction and linguistic fronts , 2014, Journal of The Royal Society Interface.

[11]  S. Simon Cities in Translation: Intersections of Language and Memory , 2011 .

[12]  Juan Pablo Pinasco,et al.  Coexistence of Languages is possible , 2006 .

[13]  James Burridge,et al.  Spatial Evolution of Human Dialects , 2017, 1703.00533.

[14]  S. Strogatz,et al.  Linguistics: Modelling the dynamics of language death , 2003, Nature.

[15]  Casey J. Dawkins Space and the Measurement of Income Segregation , 2007 .

[16]  F. Vazquez,et al.  Agent based models of language competition: macroscopic descriptions and order–disorder transitions , 2010, 1002.1251.

[17]  Maxime Lenormand,et al.  Immigrant community integration in world cities , 2016, PloS one.

[18]  William S-Y. Wang,et al.  Modelling endangered languages: The effects of bilingualism and social structure , 2008 .

[19]  M. V. D. Panne,et al.  Displacement Interpolation Using Lagrangian Mass Transport , 2011 .

[20]  Gero Vogl,et al.  Quantifying the driving factors for language shift in a bilingual region , 2017, Proceedings of the National Academy of Sciences.

[21]  Maxi San Miguel,et al.  Ordering dynamics with two non-excluding options: bilingualism in language competition , 2006, physics/0609079.

[22]  James Steele,et al.  Ecological Models of Language Competition , 2008 .

[23]  Diansheng Guo,et al.  Understanding U.S. regional linguistic variation with Twitter data analysis , 2016, Comput. Environ. Urban Syst..

[24]  Michael E. Krauss The world's languages in crisis , 2015 .

[25]  Bruno Gonçalves,et al.  The Fall of the Empire: The Americanization of English , 2017, ArXiv.

[26]  Peter J. Bickel,et al.  The Earth Mover's distance is the Mallows distance: some insights from statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[27]  Ricard V Solé,et al.  Diversity, competition, extinction: the ecophysics of language change , 2010, Journal of The Royal Society Interface.

[28]  Leonidas J. Guibas,et al.  A metric for distributions with applications to image databases , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[29]  Maxi San Miguel,et al.  Agent-based models of language competition , 2013 .

[30]  Marco Patriarca,et al.  The Role of bilinguals in Language Competition , 2014, Adv. Complex Syst..

[31]  Anne Kandler,et al.  Demography and Language Competition , 2009, Human biology.

[32]  M. Barber,et al.  Diversity , 2010, The Fairchild Books Dictionary of Fashion.

[33]  Alessandro Vespignani,et al.  Modeling the spatial spread of infectious diseases: The GLobal Epidemic and Mobility computational model , 2010, J. Comput. Sci..

[34]  Maxi San Miguel,et al.  Is the Voter Model a model for voters? , 2013, Physical review letters.

[35]  Dong Nguyen,et al.  Audience and the Use of Minority Languages on Twitter , 2015, ICWSM.

[36]  W. Deming,et al.  On a Least Squares Adjustment of a Sampled Frequency Table When the Expected Marginal Totals are Known , 1940 .

[37]  R. B. Baldauf,et al.  Language Planning: From Practice to Theory , 1997 .

[38]  C. Baker Foundations of Bilingual Education and Bilingualism , 1993 .

[39]  Michael Werman,et al.  Fast and robust Earth Mover's Distances , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[40]  A. Portes,et al.  E Pluribus Unum: Bilingualism and Loss of Language in the Second Generation. , 1998 .

[41]  S. Fienberg An Iterative Procedure for Estimation in Contingency Tables , 1970 .

[42]  P. Vogt,et al.  A systematic and interdisciplinary review of mathematical models of language competition , 2021 .

[43]  Sean F. Reardon,et al.  Response: Segregation and Social Distance—A Generalized Approach to Segregation Measurement , 2002 .

[44]  Nicolas Courty,et al.  POT: Python Optimal Transport , 2021, J. Mach. Learn. Res..

[45]  Jacob Eisenstein,et al.  Confounds and Consequences in Geotagged Twitter Data , 2015, EMNLP.

[46]  Maxi San Miguel,et al.  Modeling two-language competition dynamics , 2012, Adv. Complex Syst..

[47]  I. Hanski Metapopulation dynamics , 1998, Nature.

[48]  Marco Patriarca,et al.  Influence of geography on language competition , 2008, 0807.3100.

[49]  Jonathan Dunn,et al.  Mapping languages: the Corpus of Global Language Use , 2020, Lang. Resour. Evaluation.

[50]  K. Dietz,et al.  A structured epidemic model incorporating geographic mobility among regions. , 1995, Mathematical biosciences.

[51]  S. Fortunato,et al.  Statistical physics of social dynamics , 2007, 0710.3256.

[52]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[53]  Suzanne Romaine,et al.  The Bilingual and Multilingual Community , 2012 .

[54]  J. Burridge,et al.  Inferring the drivers of language change using spatial models , 2021, Journal of Physics: Complexity.

[55]  J. Mira,et al.  Interlinguistic similarity and language death dynamics , 2005 .