Community Structure in Online Collegiate Social Networks

We apply the tools of network analysis to study the roles of university organizations and affiliations in structuring the social networks of students by examining the graphs of Facebook “friendships” at five American universities at a single point in time. In particular, we investigate each single-institution network’s community structure, which we obtain by partitioning the graphs using an eigenvector method. We employ both graphical and quantitative tools, including pair-counting methods that we interpret through statistical analysis and permutation tests, to measure the correlations between the network communities and a set of self-identified user characteristics (residence, class year, major, and high school). We additionally investigate single-gender subsets of the university networks and also examine the impact of incomplete demographic information in the data. Our study across five universities allows one to make comparative observations about the online social lives at the different institutions, which can in turn be used to infer differences in offline lives. It also illustrates how to examine different instances of social networks constructed in similar environments, while emphasizing the array of social forces that combine to form simplified “communities” obtainable by the consideration of the friendship links. In an appendix, we review the basic properties and statistics of the employed paircounting similarity coefficients and recall, in simplified notation, a useful analytical formula for the z-score of the Rand coefficient.

[1]  Ricardo J. G. B. Campello,et al.  A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment , 2007, Pattern Recognit. Lett..

[2]  Erik M Bollt,et al.  Local method for detecting communities. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Guido Caldarelli,et al.  Scale-Free Networks , 2007 .

[4]  S. Strogatz Exploring complex networks , 2001, Nature.

[5]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[6]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[7]  M. Meilă Comparing clusterings---an information based distance , 2007 .

[8]  Mason A. Porter,et al.  Community Structure in the United States House of Representatives , 2007, ArXiv.

[9]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[10]  Michael Useem,et al.  The Inner Circle: Large Corporations and the Rise of Business Political Activity in the U. S. and U.K. , 1986 .

[11]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[12]  Bernardo A. Huberman,et al.  Rhythms of social interaction: messaging within a massive online network , 2006, ArXiv.

[13]  N. Christakis,et al.  and Tastes, ties, and time: A new social network dataset using Facebook. , 2008 .

[14]  Cliff Lampe,et al.  A familiar face(book): profile elements as signals in an online social network , 2007, CHI.

[15]  Nicholas Christakis,et al.  The Taste for Privacy: An Analysis of College Student Privacy Settings in an Online Social Network , 2008, J. Comput. Mediat. Commun..

[16]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  A. Clauset Finding local community structure in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Tad Hogg,et al.  Multiple Relationship Types in Online Communities and Social Networks , 2008, AAAI Spring Symposium: Social Information Processing.

[19]  Roger Guimerà,et al.  Team Assembly Mechanisms Determine Collaboration Network Structure and Team Performance , 2005, Science.

[20]  Minas Gjoka,et al.  A Walk in Facebook: Uniform Sampling of Users in Online Social Networks , 2009, ArXiv.

[21]  S. Wasserman,et al.  Logit models and logistic regressions for social networks: I. An introduction to Markov graphs andp , 1996 .

[22]  B EllisonNicole,et al.  Social Network Sites , 2007 .

[23]  Mason A. Porter,et al.  Random Walker Ranking for NCAA Division I-A Football , 2007, Am. Math. Mon..

[24]  Sergey N. Dorogovtsev,et al.  Critical phenomena in complex networks , 2007, ArXiv.

[25]  Jesús A. De Loera,et al.  Algebraic unimodular counting , 2001, Math. Program..

[26]  Tad Hogg,et al.  Friends and foes: ideological social networking , 2008, CHI.

[27]  Pamela R. Haunschild Interorganizational imitation: The impact of interlocks on corporate acquisition activity , 1993 .

[28]  Andrew Ward,et al.  Embeddedness, Social Identity and Mobility: Why Firms Leave the NASDAQ and Join the New York Stock Exchange , 2000 .

[29]  Steven L. Puller,et al.  The Old Boy (and Girl) Network: Social Network Formation on University Campuses , 2008 .

[30]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[31]  N. Mantel The detection of disease clustering and a generalized regression approach. , 1967, Cancer research.

[32]  M. Newman,et al.  Robustness of community structure in networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  Ken P Kleinman,et al.  Much Ado About Nothing , 2007, The American statistician.

[34]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[35]  Alex Arenas,et al.  Analysis of the structure of complex networks at different resolution levels , 2007, physics/0703218.

[36]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[38]  T. Vicsek,et al.  Community structure and ethnic preferences in school friendship networks , 2006, physics/0611268.

[39]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[40]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[41]  Robert L. Brennan,et al.  MEASURING AGREEMENT WHEN TWO OBSERVERS CLASSIFY PEOPLE INTO CATEGORIES NOT DEFINED IN ADVANCE , 1974 .

[42]  Jasmine Novak,et al.  Geographic routing in social networks , 2005, Proc. Natl. Acad. Sci. USA.

[43]  R. Burt Structural Holes and Good Ideas1 , 2004, American Journal of Sociology.

[44]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[45]  J. Reichardt,et al.  Statistical mechanics of community detection. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[46]  D. Wilkinson,et al.  Social Network Collaborative Filtering , 2008 .

[47]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[48]  Garry Robins,et al.  An introduction to exponential random graph (p*) models for social networks , 2007, Soc. Networks.

[49]  Satoru Kawai,et al.  An Algorithm for Drawing General Undirected Graphs , 1989, Inf. Process. Lett..

[50]  Danah Boyd,et al.  Social network sites: definition, history, and scholarship , 2007, IEEE Engineering Management Review.

[51]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[52]  Mark H. Chignell,et al.  Identifying active subgroups in online communities , 2007, CASCON.

[53]  John Scott What is social network analysis , 2010 .

[54]  Downloaded from , 1997 .

[55]  Richard J. Brook,et al.  Agreement between observers when the categories are not specified in advance , 1984 .

[56]  D. R. White,et al.  Structural cohesion and embeddedness: A hierarchical concept of social groups , 2003 .

[57]  Amanda L. Traud,et al.  Community Structure in Congressional Cosponsorship Networks , 2007, 0708.1191.

[58]  D. Watts The “New” Science of Networks , 2004 .

[59]  David L. Wallace,et al.  A Method for Comparing Two Hierarchical Clusterings: Comment , 1983 .

[60]  Larissa Hiorth,et al.  Being There and Being Here Gendered Customising of Mobile 3G Practices Through a Case Study in Seoul , 2005 .

[61]  Roger Guimerà,et al.  Extracting the hierarchical organization of complex systems , 2007, Proceedings of the National Academy of Sciences.

[62]  F. Harary,et al.  The cohesiveness of blocks in social networks: Node connectivity and conditional density , 2001 .

[63]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[64]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[65]  Mason A. Porter,et al.  Community Structure in the United States House of Representatives , 2007 .

[66]  M. Newman,et al.  Mixing Patterns and Community Structure in Networks , 2002, cond-mat/0210146.

[67]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[68]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[69]  U. Brandes,et al.  Maximizing Modularity is hard , 2006, physics/0608255.

[70]  A-L Barabási,et al.  Structure and tie strengths in mobile communication networks , 2006, Proceedings of the National Academy of Sciences.

[71]  Tom A. B. Snijders,et al.  A comparison of various approaches to the exponential random graph model: A reanalysis of 102 student networks in school classes , 2007, Soc. Networks.

[72]  Lawrence Hubert Nominal scale response agreement as a generalized correlation , 1977 .

[73]  Mehran Sahami,et al.  Evaluating similarity measures: a large-scale study in the orkut social network , 2005, KDD '05.

[74]  C. Mallows,et al.  A Method for Comparing Two Hierarchical Clusterings , 1983 .

[75]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[76]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[77]  David Kempe,et al.  Modularity-maximizing graph communities via mathematical programming , 2007, 0710.2533.

[78]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[79]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[80]  R. Burt Brokerage and Closure: An Introduction to Social Capital , 2005 .

[81]  Andreas Noack,et al.  Modularity clustering is force-directed layout , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[82]  Radhika Gajjala,et al.  Shifting Frames: Race, Ethnicity and Intercultural Communication in Online Social Networking and Virtual Work. , 2007 .

[83]  Claudio Castellano,et al.  Community Structure in Graphs , 2007, Encyclopedia of Complexity and Systems Science.

[84]  Elena Kulisnkaya,et al.  Large sample results for permutation tests of association , 1994 .

[85]  Mason A. Porter,et al.  A network analysis of committees in the United States House of Representatives , 2005, ArXiv.

[86]  John W. Van Ness A Method for Comparing Two Hierarchical Clusterings: Comment , 1983 .

[87]  D. Boyd Why Youth (Heart) Social Network Sites: The Role of Networked Publics in Teenage Social Life , 2007 .

[88]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[89]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[90]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[91]  M. Newman,et al.  Community structure in the U.S. House of Representatives. , 2006, Chaos.

[92]  P. Mucha,et al.  Party Polarization in Congress: A Social Networks Approach , 2009 .

[93]  P. Good Permutation, Parametric, and Bootstrap Tests of Hypotheses , 2005 .