Comparing Community Structure to Characteristics in Online Collegiate Social Networks

We study the structure of social networks of students by examining the graphs of Facebook “friendships” at five U.S. universities at a single point in time. We investigate the community structure of each single-institution network and employ visual and quantitative tools, including standardized pair-counting methods, to measure the correlations between the network communities and a set of self-identified user characteristics (residence, class year, major, and high school). We review the basic properties and statistics of the employed pair-counting indices and recall, in simplified notation, a useful formula for the $z$-score of the Rand coefficient. Our study illustrates how to examine different instances of social networks constructed in similar environments, emphasizes the array of social forces that combine to form “communities,” and leads to comparative observations about online social structures, which reflect offline social structures. We calculate the relative contributions of different characteristics to the community structure of individual universities and compare these relative contributions at different universities. For example, we examine the importance of common high school affiliation at large state universities and the varying degrees of influence that common major can have on the social structure at different universities. The heterogeneity of the communities that we observe indicates that university networks typically have multiple organizing factors rather than a single dominant one.

[1]  M. Newman,et al.  Mixing Patterns and Community Structure in Networks , 2002, cond-mat/0210146.

[2]  Alex Arenas,et al.  Analysis of the structure of complex networks at different resolution levels , 2007, physics/0703218.

[3]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[5]  Downloaded from , 1997 .

[6]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[7]  Richard J. Brook,et al.  Agreement between observers when the categories are not specified in advance , 1984 .

[8]  D. R. White,et al.  Structural cohesion and embeddedness: A hierarchical concept of social groups , 2003 .

[9]  Marco Gonzalez,et al.  Tastes, ties, and time: A new social network dataset using Facebook.com , 2008, Soc. Networks.

[10]  Amanda L. Traud,et al.  Community Structure in Congressional Cosponsorship Networks , 2007, 0708.1191.

[11]  U. Brandes,et al.  Maximizing Modularity is hard , 2006, physics/0608255.

[12]  Minas Gjoka,et al.  Walking in Facebook: A Case Study of Unbiased Sampling of OSNs , 2010, 2010 Proceedings IEEE INFOCOM.

[13]  Steven L. Puller,et al.  The Old Boy (and Girl) Network: Social Network Formation on University Campuses , 2008 .

[14]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[15]  N. Mantel The detection of disease clustering and a generalized regression approach. , 1967, Cancer research.

[16]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[17]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[18]  S.,et al.  An Efficient Heuristic Procedure for Partitioning Graphs , 2022 .

[19]  Robert L. Brennan,et al.  MEASURING AGREEMENT WHEN TWO OBSERVERS CLASSIFY PEOPLE INTO CATEGORIES NOT DEFINED IN ADVANCE , 1974 .

[20]  Bernardo A. Huberman,et al.  Rhythms of social interaction: messaging within a massive online network , 2006, ArXiv.

[21]  Jasmine Novak,et al.  Geographic routing in social networks , 2005, Proc. Natl. Acad. Sci. USA.

[22]  R. Burt Structural Holes and Good Ideas1 , 2004, American Journal of Sociology.

[23]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[24]  Michael Useem,et al.  The Inner Circle: Large Corporations and the Rise of Business Political Activity in the U. S. and U.K. , 1986 .

[25]  Sergey N. Dorogovtsev,et al.  Critical phenomena in complex networks , 2007, ArXiv.

[26]  Jesús A. De Loera,et al.  Algebraic unimodular counting , 2001, Math. Program..

[27]  P. Mucha,et al.  The unreasonable effectiveness of tree-based theory for networks with clustering. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  D. Wilkinson,et al.  Social Network Collaborative Filtering , 2008 .

[29]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[30]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[31]  Tad Hogg,et al.  Friends and foes: ideological social networking , 2008, CHI.

[32]  J. Reichardt,et al.  Statistical mechanics of community detection. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  Garry Robins,et al.  An introduction to exponential random graph (p*) models for social networks , 2007, Soc. Networks.

[34]  Satoru Kawai,et al.  An Algorithm for Drawing General Undirected Graphs , 1989, Inf. Process. Lett..

[35]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  A. Clauset Finding local community structure in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  Andreas Noack,et al.  Modularity clustering is force-directed layout , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  Radhika Gajjala,et al.  Shifting Frames: Race, Ethnicity and Intercultural Communication in Online Social Networking and Virtual Work. , 2007 .

[39]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[40]  Tad Hogg,et al.  Multiple Relationship Types in Online Communities and Social Networks , 2008, AAAI Spring Symposium: Social Information Processing.

[41]  D. Boyd Why Youth (Heart) Social Network Sites: The Role of Networked Publics in Teenage Social Life , 2007 .

[42]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Tom A. B. Snijders,et al.  A comparison of various approaches to the exponential random graph model: A reanalysis of 102 student networks in school classes , 2007, Soc. Networks.

[44]  Claudio Castellano,et al.  Community Structure in Graphs , 2007, Encyclopedia of Complexity and Systems Science.

[45]  Elena Kulisnkaya,et al.  Large sample results for permutation tests of association , 1994 .

[46]  Mason A. Porter,et al.  A network analysis of committees in the United States House of Representatives , 2005, ArXiv.

[47]  Danah Boyd,et al.  Social network sites: definition, history, and scholarship , 2007, IEEE Engineering Management Review.

[48]  Larissa Hiorth,et al.  Being There and Being Here Gendered Customising of Mobile 3G Practices Through a Case Study in Seoul , 2005 .

[49]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[50]  David Kempe,et al.  Modularity-maximizing graph communities via mathematical programming , 2007, 0710.2533.

[51]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[52]  Lawrence Hubert Nominal scale response agreement as a generalized correlation , 1977 .

[53]  C. Mallows,et al.  A Method for Comparing Two Hierarchical Clusterings , 1983 .

[54]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[55]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[56]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[57]  Mason A. Porter,et al.  Communities in Networks , 2009, ArXiv.

[58]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[59]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[60]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[61]  Mason A. Porter,et al.  Community Structure in the United States House of Representatives , 2007 .

[62]  Ricardo J. G. B. Campello,et al.  A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment , 2007, Pattern Recognit. Lett..

[63]  Erik M Bollt,et al.  Local method for detecting communities. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[64]  M. Meilă Comparing clusterings---an information based distance , 2007 .

[65]  Guido Caldarelli,et al.  Scale-Free Networks , 2007 .

[66]  S. Strogatz Exploring complex networks , 2001, Nature.

[67]  B EllisonNicole,et al.  Social Network Sites , 2007 .

[68]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[69]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[70]  L. Hubert,et al.  Comparing partitions , 1985 .

[71]  Roger Guimerà,et al.  Team Assembly Mechanisms Determine Collaboration Network Structure and Team Performance , 2005, Science.

[72]  Minas Gjoka,et al.  A Walk in Facebook: Uniform Sampling of Users in Online Social Networks , 2009, ArXiv.

[73]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[74]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[75]  Cliff Lampe,et al.  A familiar face(book): profile elements as signals in an online social network , 2007, CHI.

[76]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[77]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[78]  P. Mucha,et al.  Party Polarization in Congress: A Social Networks Approach , 2009 .

[79]  P. Good Permutation, Parametric, and Bootstrap Tests of Hypotheses , 2005 .

[80]  T. Vicsek,et al.  Community structure and ethnic preferences in school friendship networks , 2006, physics/0611268.

[81]  M. Newman,et al.  Robustness of community structure in networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[82]  Ken P Kleinman,et al.  Much Ado About Nothing , 2007, The American statistician.

[83]  A-L Barabási,et al.  Structure and tie strengths in mobile communication networks , 2006, Proceedings of the National Academy of Sciences.

[84]  John Scott What is social network analysis , 2010 .

[85]  Mason A. Porter,et al.  Random Walker Ranking for NCAA Division I-A Football , 2007, Am. Math. Mon..

[86]  P. Mucha,et al.  Party Polarization in Congress: A Social Networks Approach , 2009, 0907.3509.

[87]  Roger Guimerà,et al.  Extracting the hierarchical organization of complex systems , 2007, Proceedings of the National Academy of Sciences.

[88]  Pamela R. Haunschild Interorganizational imitation: The impact of interlocks on corporate acquisition activity , 1993 .