Social sub-group identification using social graph and semantic analysis

Social networks proliferate daily life, many are part of big groups within social networks, many of these groups contain people unknown to you, but with whom you share interests. Some of these shared interest are based on context which has real-time properties (e.g. Who would like to go to a jazz concert during an AI conference). This paper presents a possible method for identifying such subgroups. We aim to do this by creating a social graph based on one or more “comparators”; Frequency (how often two members interact) and content (how often the two talk about the same thing). After the graph is created we apply standard graph segmentation (clique estimation) techniques to identify the subgroups. We propose a system which continuously polls social network for updates, to keep the social graph up-to-date and based on that suggest new subgroups. As a result the system will suggest new subgroups in a timely and near real-time manner. This paper will focus on the methods behind the creation of the weighted social graph and the subsequent pruning of the social graph.

[1]  Sebastian Funk,et al.  Word usage mirrors community structure in the online social network Twitter , 2013, EPJ Data Science.

[2]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[3]  Bernardo A. Huberman,et al.  E-Mail as Spectroscopy: Automated Discovery of Community Structure within Organizations , 2005, Inf. Soc..

[4]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[5]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[6]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[7]  Bernardo A. Huberman,et al.  Email as spectroscopy: automated discovery of community structure within organizations , 2003 .

[8]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Tereza Iofciu,et al.  Finding Communities of Practice from User Profiles Based on Folksonomies , 2006, EC-TEL Workshops.

[10]  Muhammad Abulaish,et al.  Mining an enriched social graph to model cross-thread community interactions and interests , 2012, MSM '12.

[11]  Kalina Bontcheva,et al.  Architectural elements of language engineering robustness , 2002, Natural Language Engineering.

[12]  Marián Hönsch,et al.  Detecting User Communities Based on Latent and Dynamic Interest on a News Portal , 2011 .

[13]  Ken Wakita,et al.  Finding community structure in mega-scale social networks: [extended abstract] , 2007, WWW '07.

[14]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Amin Saberi,et al.  Exploring the community structure of newsgroups , 2004, KDD.

[16]  Hamish Cunningham,et al.  GATE-a General Architecture for Text Engineering , 1996, COLING.

[17]  Stephen E. Fienberg,et al.  User Interest and Interaction Structure in Online Forums , 2010, ICWSM.

[18]  Vicenç Gómez,et al.  Statistical analysis of the social network and discussion threads in slashdot , 2008, WWW.

[19]  S. Strogatz Exploring complex networks , 2001, Nature.

[20]  Alex Arenas,et al.  Emergence of clustering, correlations, and communities in a social network model , 2003 .

[21]  A. Arenas,et al.  Community analysis in social networks , 2004 .