iTop: interaction based topic centric community discovery on twitter

Automatic detection of communities (or cohesive groups of actors in social network) in online social media platforms based on user interests and interaction is a problem that has recently attracted a lot of research attention. Mining user interactions on Twitter to discover such communities is a technically challenging information retrieval task. We present an algorithm - iTop - to discover interaction based topic centric communities by mining user interaction signals (such as @-messages and retweets) which indicate cohesion. iTop takes any topic as an input keyword and exploits local information to infer global topic-centric communities. We evaluate the discovered communities along three dimensions: graph based (node-edge quality), empirical-based (Twitter lists) and semantic based (frequent n-grams in tweets). We conduct experiments on a publicly available scrape of Twitter provided by InfoChimps via a web service. We perform a case study on two diverse topics - 'Computer Aided Design (CAD)' and 'Kashmir' to demonstrate the efficacy of iTop. Empirical results from both case studies show that iTop is successfully able to discover topic-centric, interaction based communities on Twitter.

[1]  Huan Liu,et al.  Community evolution in dynamic multi-mode networks , 2008, KDD.

[2]  A. Clauset Finding local community structure in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Fang Wu,et al.  Social Networks that Matter: Twitter Under the Microscope , 2008, First Monday.

[4]  A. Banerjee,et al.  Social Topic Models for Community Extraction , 2008 .

[5]  C. Lee Giles,et al.  Efficient identification of Web communities , 2000, KDD '00.

[6]  Ponnurangam Kumaraguru,et al.  Mining YouTube to Discover Extremist Videos, Users and Hidden Communities , 2010, AIRS.

[7]  Jure Leskovec,et al.  Statistical properties of community structure in large social and information networks , 2008, WWW.

[8]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[9]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Jennifer Jie Xu,et al.  Mining communities and their relationships in blogs: A study of online hate groups , 2007, Int. J. Hum. Comput. Stud..

[11]  L. Venkata Subramaniam,et al.  Probabilistic model for discovering topic based communities in social networks , 2011, CIKM '11.

[12]  Mohit Naresh Kewalramani Community Detection in Twitter , 2011 .

[13]  Huan Liu,et al.  Uncoverning Groups via Heterogeneous Interaction Analysis , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[14]  Ken Wakita,et al.  Finding community structure in mega-scale social networks: [extended abstract] , 2007, WWW '07.

[15]  Yiannis Kompatsiaris,et al.  Bridge Bounding: A Local Approach for Efficient Community Discovery in Complex Networks , 2009, 0902.0871.

[16]  Huan Liu,et al.  Graph Mining Applications to Social Network Analysis , 2010, Managing and Mining Graph Data.

[17]  Alessandro Vespignani,et al.  Modeling Users' Activity on Twitter Networks: Validation of Dunbar's Number , 2011, PloS one.

[18]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[19]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Yun Chi,et al.  Combining link and content for community detection: a discriminative approach , 2009, KDD.

[21]  Martin G. Everett,et al.  Models of core/periphery structures , 2000, Soc. Networks.

[22]  Kevin J. Lang,et al.  Communities from seed sets , 2006, WWW '06.

[23]  L. Venkata Subramaniam,et al.  Using content and interactions for discovering communities in social networks , 2012, WWW.

[24]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Robin I. M. Dunbar Neocortex size as a constraint on group size in primates , 1992 .

[26]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[27]  Kyle Lacy Twitter Marketing For Dummies , 2009 .

[28]  Chris Volinsky,et al.  Network-Based Marketing: Identifying Likely Adopters Via Consumer Networks , 2006, math/0606278.

[29]  A. Bernardo,et al.  Huberman, Romero, and Wu, Fang. . Social Networks that Matter: Twitter Under the Microscope. , 2008 .

[30]  Ravi Kumar,et al.  Trawling the Web for Emerging Cyber-Communities , 1999, Comput. Networks.