Extracting Social Networks from Language Statistics

ABSTRACT Knowledge regarding social information is commonly believed to be derived from sources such as formal relationships and interviews and can be plotted as complex networks. We explored whether social networks can also be extracted through other means by using language statistics. In three computational studies we computed first-order and higher-order (latent semantic analysis) co-occurrences of story characters in three novels. These statistical linguistic frequencies entered in a multidimensional scaling analysis yielded a two-dimensional solution that correlated with the two-dimensional networks of characters generated by experts. An experimental study in which participants were asked to estimate social networks showed that human estimates are similar to computational estimates. These results demonstrated that language statistics based on texts can be used to generate social networks.

[1]  Max M. Louwerse,et al.  Symbol Interdependency in Symbolic and Embodied Cognition , 2011, Top. Cogn. Sci..

[2]  R. Zajonc Attitudinal effects of mere exposure. , 1968 .

[3]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[4]  Dorota Garczarczyk,et al.  Keeping track of motion events in translation. A case of Spanish translation of J.K. Rowling’s Harry Potter and the Chamber of Secrets , 2012 .

[5]  Tara Moayad,et al.  Proper names in the arabic translation of harry potter and the goblet of fire , 2013 .

[6]  Gabriel Recchia,et al.  Effect size matters: the role of language statistics and perceptual simulation in conceptual processing , 2015 .

[7]  Ken A Paller,et al.  Subliminal Smells can Guide Social Preferences , 2007, Psychological science.

[8]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[9]  R. Zajonc Mere Exposure: A Gateway to the Subliminal , 2001 .

[10]  Keith Oatley,et al.  The Function of Fiction is the Abstraction and Simulation of Social Experience , 2008, Perspectives on psychological science : a journal of the Association for Psychological Science.

[11]  M. Louwerse,et al.  The linguistic and embodied nature of conceptual processing , 2010, Cognition.

[12]  Rolf A. Zwaan,et al.  Language Encodes Geographical Information , 2009, Cogn. Sci..

[13]  L. M. New moon , 1994, Nature.

[14]  Kathleen McKeown,et al.  Extracting Social Networks from Literary Fiction , 2010, ACL.

[15]  Gemma Samuell,et al.  Harry Potter and the Order of the Phoenix , 2008, SIGGRAPH '08.

[16]  A. Friedman,et al.  Bidimensional regression: assessing the configural similarity and accuracy of cognitive maps and other two-dimensional data sets. , 2003, Psychological methods.

[17]  Barry Wellman,et al.  Geography of Twitter networks , 2012, Soc. Networks.

[18]  E. Ebbesen,et al.  Spatial ecology: Its effects on the choice of friends and enemies , 1976 .

[19]  Max M. Louwerse,et al.  Representing Spatial Structure Through Maps and Language: Lord of the Rings Encodes the Spatial Structure of Middle Earth , 2012, Cogn. Sci..

[20]  Marco Rosa,et al.  Four degrees of separation , 2011, WebSci '12.

[21]  Catherine Morgan Good vs. evil: the role of the soundtrack in developing a dichotomy in "Harry potter and the sorcerer's stone" , 2011 .

[22]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[23]  Paul W. Eastwick,et al.  Familiarity does indeed promote attraction in live interaction. , 2011, Journal of personality and social psychology.

[24]  A. Baum,et al.  Compensatory Response to Anticipated Densities1 , 1979 .

[25]  Eric A. Weiss,et al.  Association for computing machinery (ACM) , 2003 .

[26]  M. Louwerse,et al.  Neurological Evidence Linguistic Processes Precede Perceptual Simulation in Conceptual Processing , 2012, Front. Psychology.

[27]  Dan Cosley,et al.  Inferring social ties from geographic coincidences , 2010, Proceedings of the National Academy of Sciences.

[28]  杨熠 Harry Potter and the Deathly Hallows——十年青春,完美谢幕 , 2011 .

[29]  K. Barraclough Eclipse , 2006, BMJ : British Medical Journal.

[30]  N. Milburn To Dwell Among Friends: Personal Networks in Town and City. , 1983 .

[31]  Danielle S. McNamara,et al.  Handbook of latent semantic analysis , 2007 .

[32]  Owen Rambow,et al.  Social Network Analysis of Alice in Wonderland , 2012, CLfL@NAACL-HLT.