Tracking cohesive subgroups over time in inferred social networks

As a first step in the development of community trackers for large-scale online interaction, this paper shows how cohesive subgroup analysis using the Social Cohesion Analysis of Networks (SCAN; Chin and Chignell 2008) and Data-Intensive Socially Similar Evolving Community Tracker (DISSECT; Chin and Chignell 2010) methods can be applied to the problem of identifying cohesive subgroups and tracking them over time. Three case studies are reported, and the findings are used to evaluate how well the SCAN and DISSECT methods work for different types of data. In the largest of the case studies, variations in temporal cohesiveness are identified across a set of subgroups extracted from the inferred social network. Further modifications to the DISSECT methodology are suggested based on the results obtained. The paper concludes with recommendations concerning further research that would be beneficial in addressing the community tracking problem for online data.

[1]  A. Tversky,et al.  Similarity of rectangles: An analysis of subjective dimensions , 1975 .

[2]  Myra Spiliopoulou,et al.  Mining and Visualizing the Evolution of Subgroups in Social Networks , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[3]  Deepayan Chakrabarti,et al.  Evolutionary clustering , 2006, KDD '06.

[4]  Corinna Cortes,et al.  Communities of interest , 2001, Intell. Data Anal..

[5]  M Cieplak 蛋白質の折りたたみにおける協調性と接触秩序 | 文献情報 | J-GLOBAL 科学技術総合リンクセンター , 2004 .

[6]  Christos Faloutsos,et al.  Graph evolution: Densification and shrinking diameters , 2006, TKDD.

[7]  Massimo Marchiori,et al.  Method to find community structures based on information centrality. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Mark H. Chignell,et al.  DISSECT: Data-Intensive Socially Similar Evolving Community Tracker , 2010, Computational Social Network Analysis.

[10]  W. Powell,et al.  Network Dynamics and Field Evolution: The Growth of Interorganizational Collaboration in the Life Sciences1 , 2005, American Journal of Sociology.

[11]  Sara E. Sterling Aggregation Techniques to Characterize Social Networks , 2012 .

[12]  Ravi Kumar,et al.  Structure and evolution of online social networks , 2006, KDD '06.

[13]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[14]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[16]  Ming Ouyang,et al.  A vector partitioning approach to detecting community structure in complex networks , 2008, Comput. Math. Appl..

[17]  Yun Chi,et al.  Facetnet: a framework for analyzing communities and their evolutions in dynamic networks , 2008, WWW.

[18]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[19]  Eytan Adar,et al.  Implicit Structure and the Dynamics of Blogspace , 2004 .

[20]  M. A. Muñoz,et al.  Journal of Statistical Mechanics: An IOP and SISSA journal Theory and Experiment Detecting network communities: a new systematic and efficient algorithm , 2004 .

[21]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[22]  Philip S. Yu,et al.  GraphScope: parameter-free mining of large time-evolving graphs , 2007, KDD '07.

[23]  Mark H. Chignell,et al.  A social hypertext model for finding community in blogs , 2006, HYPERTEXT '06.

[24]  Peter A. Gloor,et al.  Capturing team dynamics through temporal social surfaces , 2005, Ninth International Conference on Information Visualisation (IV'05).

[25]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[26]  Thierry Chanier,et al.  How Social Network Analysis can help to Measure Cohesion in Collaborative Distance-Learning , 2003, CSCL.

[27]  J. Orford Implementation of criteria for partitioning a dendrogram , 1976 .

[28]  A. Richardsen,et al.  Cohesion as a Basic Bond in Groups , 1983 .

[29]  Fred A. Mael,et al.  Social identity theory and the organization , 1989 .

[30]  B. Wellman Structural analysis: From method and metaphor to theory and substance. , 1988 .

[31]  Huan Liu,et al.  Community evolution in dynamic multi-mode networks , 2008, KDD.

[32]  Danyel Fisher,et al.  Using egocentric networks to understand communication , 2005, IEEE Internet Computing.

[33]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[34]  Caroline Haythornthwaite,et al.  Automated Discovery and Analysis of Social Networks from Threaded Discussions , 2008 .

[35]  Sergiy Butenko,et al.  Clique Relaxations in Social Network Analysis: The Maximum k-Plex Problem , 2011, Oper. Res..

[36]  Caroline Haythornthwaite,et al.  Studying Online Social Networks , 2006, J. Comput. Mediat. Commun..

[37]  David L. Hicks,et al.  Detecting Hidden Hierarchy in Terrorist Networks: Some Case Studies , 2008, ISI Workshops.

[38]  Mark H. Chignell,et al.  Automatic detection of cohesive subgroups within social hypertext: A heuristic approach , 2008, New Rev. Hypermedia Multim..

[39]  A. Barabasi,et al.  Evolution of the social network of scientific collaborations , 2001, cond-mat/0104162.

[40]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[41]  Bin Wu,et al.  Community detection in large-scale social networks , 2007, WebKDD/SNA-KDD '07.

[42]  Thomas Schank,et al.  UvA-DARE ( Digital Academic Repository ) Animating the development of social networks over time using a dynamic extension of multidimensional scaling , 2008 .

[43]  T. Snijders,et al.  Modeling the Coevolution of Networks and Behavior , 2007 .

[44]  Weixiong Zhang,et al.  An Efficient Spectral Algorithm for Network Community Discovery and Its Applications to Biological and Social Networks , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[45]  Charles T. Zahn,et al.  Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters , 1971, IEEE Transactions on Computers.

[46]  O. Daescu,et al.  Centrality Measures for the Human Red Blood Cell Interactome , 2007, 2007 IEEE Dallas Engineering in Medicine and Biology Workshop.

[47]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[48]  Danyel Fisher,et al.  Visualizing the Signatures of Social Roles in Online Discussion Groups , 2007, J. Soc. Struct..

[49]  Bernardo A. Huberman,et al.  Email as spectroscopy: automated discovery of community structure within organizations , 2003 .

[50]  Xiaofan Wang,et al.  Evolution of a large online social network , 2009 .

[51]  Jeroen K. Vermunt,et al.  What is special about social network analysis , 2006 .

[52]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[53]  An-Ping Zeng,et al.  The Connectivity Structure, Giant Strong Component and Centrality of Metabolic Networks , 2003, Bioinform..

[54]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[55]  A. Clauset Finding local community structure in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[56]  Alvin Chin,et al.  “Social cohesion analysis of networks: a novel method for identifying cohesive subgroups in social hypertext” by Alvin Chin, with Jessica Rubart as coordinator , 2009, SIGWEB Newsl..

[57]  Daniel A. McFarland,et al.  Dynamic Network Visualization1 , 2005, American Journal of Sociology.

[58]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[59]  R. Alba A graph‐theoretic definition of a sociometric clique† , 1973 .

[60]  Thomas Erickson,et al.  The World-Wide-Web as social hypertext , 1996, CACM.

[61]  G. Tomlinson,et al.  YouTube as a source of information on immunization: a content analysis. , 2007, JAMA.

[62]  Cameron A. Marlow Audience, structure and authority in the weblog community , 2004 .

[63]  Yun Chi,et al.  Analyzing communities and their evolutions in dynamic social networks , 2009, TKDD.

[64]  R. Sokal,et al.  Principles of numerical taxonomy , 1965 .

[65]  Philip S. Yu,et al.  Mining Community Structure of Named Entities from Web Pages and Blogs , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[66]  Mária Bieliková,et al.  An Approach for Community Cutting , 2005 .

[67]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[68]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[69]  Mark H. Chignell,et al.  Automated Delineation of Subgroups in Web Video: A Medical Activism Case Study , 2010, J. Comput. Mediat. Commun..

[70]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[71]  Ravi Kumar,et al.  Structure and evolution of blogspace , 2004, CACM.

[72]  Jure Leskovec,et al.  Statistical properties of community structure in large social and information networks , 2008, WWW.

[73]  Jiawei Han,et al.  ACM Transactions on Knowledge Discovery from Data: Introduction , 2007 .

[74]  Mark H. Chignell,et al.  Identifying subcommunities using cohesive subgroups in social hypertext , 2007, HT '07.

[75]  Brian Everitt,et al.  Cluster analysis , 1974 .

[76]  S. Borgatti,et al.  LS sets, lambda sets and other cohesive subsets , 1990 .