The Social Dynamics of Language Change in Online Networks

Language change is a complex social phenomenon, revealing pathways of communication and sociocultural influence. But, while language change has long been a topic of study in sociolinguistics, traditional linguistic research methods rely on circumstantial evidence, estimating the direction of change from differences between older and younger speakers. In this paper, we use a data set of several million Twitter users to track language changes in progress. First, we show that language change can be viewed as a form of social influence: we observe complex contagion for phonetic spellings and “netspeak” abbreviations (e.g., lol), but not for older dialect markers from spoken language. Next, we test whether specific types of social network connections are more influential than others, using a parametric Hawkes process model. We find that tie strength plays an important role: densely embedded social ties are significantly better conduits of linguistic influence. Geographic locality appears to play a more limited role: we find relatively little evidence to support the hypothesis that individuals are more influenced by geographically local social ties, even in their usage of geographical dialect markers.

[1]  Henry A. Kautz,et al.  Finding your friends and following them to where you are , 2012, WSDM '12.

[2]  Susan C. Herring,et al.  Grammar and Electronic Communication , 2012 .

[3]  Lisa J. Green,et al.  African American English: African American English , 2002 .

[4]  Rizal Setya Perdana What is Twitter , 2013 .

[5]  B. Latour,et al.  Laboratory Life: The Construction of Scientific Facts , 1979 .

[6]  Yosihiko Ogata,et al.  On Lewis' simulation method for point processes , 1981, IEEE Trans. Inf. Theory.

[7]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[8]  Jacob Eisenstein,et al.  Confounds and Consequences in Geotagged Twitter Data , 2015, EMNLP.

[9]  Fang Wu,et al.  Social Networks that Matter: Twitter Under the Microscope , 2008, First Monday.

[10]  J. Hunter African American English: A Linguistic Introduction , 2002 .

[11]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[12]  David Crystal,et al.  Language and the Internet , 2001 .

[13]  Partha Niyogi,et al.  A Dynamical Systems Model for Language Change , 1994, Complex Syst..

[14]  M. Macy,et al.  Complex Contagions and the Weakness of Long Ties1 , 2007, American Journal of Sociology.

[15]  W. Labov The social motivation of a sound change , 1963 .

[16]  Kira Hall,et al.  Identity and interaction: a sociocultural linguistic approach , 2005, Discourse Studies.

[17]  W. Labov Principles Of Linguistic Change , 1994 .

[18]  William Labov Penelope Eckert, Linguistic variation as social practice. Oxford: Blackwell, 2000. Pp. xvi, 240. Hb $62.95, pb $28.95. , 2002, Language in Society.

[19]  Jacob Eisenstein,et al.  AUDIENCE-MODULATED VARIATION IN ONLINE SOCIAL MEDIA , 2015 .

[20]  Walt Wolfram,et al.  The Linguistic Variable: Fact and Fantasy , 1991 .

[21]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[22]  Stephen Pax Leonard,et al.  Language change and digital media: A review of conceptions and evidence , 2011 .

[23]  Mary Bucholtz,et al.  Hella Nor Cal or Totally So Cal? , 2007 .

[24]  Jure Leskovec,et al.  SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity , 2015, KDD.

[25]  J. Milroy,et al.  Social network and social class: Toward an integrated sociolinguistic model , 1992, Language in Society.

[26]  Steven Skiena,et al.  Statistically Significant Detection of Linguistic Change , 2014, WWW.

[27]  Ravi Kumar,et al.  Influence and correlation in social networks , 2008, KDD.

[28]  Lars Backstrom,et al.  Find me if you can: improving geographical prediction with social and spatial proximity , 2010, WWW '10.

[29]  P. Trudgill Sex, covert prestige and linguistic change in the urban British English of Norwich , 1972, Language in Society.

[30]  Lauren Squires Enregistering internet language , 2010, Language in Society.

[31]  Lada A. Adamic,et al.  The role of social networks in information diffusion , 2012, WWW.

[32]  Bakuwa Japhet,et al.  A critique of Latour and Woolgar''s argument for the social construction of scientific facts in laboratory Life: the construction of scientific facts (1986) , 2013 .

[33]  Jure Leskovec,et al.  Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change , 2016, ACL.

[34]  Jacob Eisenstein,et al.  What to do about bad language on the internet , 2013, NAACL.

[35]  John R. Rickford,et al.  GEOGRAPHICAL DIVERSITY, RESIDENTIAL SEGREGATION, AND THE VITALITY OF AFRICAN AMERICAN VERNACULAR ENGLISH AND ITS SPEAKERS , 2010 .

[36]  Thomas L. Griffiths,et al.  Language Evolution by Iterated Learning With Bayesian Agents , 2007, Cogn. Sci..

[37]  H. Samy Alim,et al.  Language in the USA: Hip Hop Nation Language , 2015 .

[38]  Jennifer Neville,et al.  Randomization tests for distinguishing social influence and homophily effects , 2010, WWW '10.

[39]  L. Gasser,et al.  Centers and peripheries: Network roles in language change , 2010 .

[40]  Lisa J. Green African American English: Contents , 2002 .

[41]  Robin I. M. Dunbar Neocortex size as a constraint on group size in primates , 1992 .

[42]  A. Hawkes Spectra of some self-exciting and mutually exciting point processes , 1971 .

[43]  Hongbo Deng,et al.  Identifying and labeling search tasks via query-based hawkes processes , 2014, KDD.

[44]  P. Eckert Linguistic variation as social practice , 2000 .

[45]  Wendy Liu,et al.  Homophily and Latent Attribute Inference: Inferring Latent Attributes of Twitter Users from Neighbors , 2012, ICWSM.

[46]  Hongyuan Zha,et al.  Learning Parametric Models for Social Infectivity in Multi-Dimensional Hawkes Processes , 2014, AAAI.

[47]  S. Tagliamonte,et al.  LINGUISTIC RUIN? LOL! INSTANT MESSAGING AND TEEN LANGUAGE , 2008 .

[48]  Li Wang,et al.  How Noisy Social Media Text, How Diffrnt Social Media Sources? , 2013, IJCNLP.

[49]  Barbara Johnstone,et al.  "Dahntahn" Pittsburgh: Monophthongal /aw/ and Representations of Localness in Southwestern Pennsylvania , 2002 .

[50]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[51]  Jacob Eisenstein Systematic patterning in phonologically‐motivated orthographic variation , 2015 .

[52]  Eric P. Xing,et al.  Sparse Additive Generative Models of Text , 2011, ICML.

[53]  W. Labov Principles of Linguistic Change: Cognitive and Cultural Factors , 2010 .