Detecting Local Communities within a Large Scale Social Network Using Mapreduce

Social network partitioning has become a very important function. One objective for partitioning is to identify interested communities to target for marketing and advertising activities. The bottleneck to detection of these communities is the large scalability of the social network. Previous methods did not effectively address the problem because they considered the overall network. Social networks have strong locality, so designing a local algorithm to find an interested community to address this objective is necessary. In this paper, we develop a local partition algorithm, named, Personalized PageRank Partitioning, to identify the community. We compute the conductance of the social network with a Personalized PageRank and Markov chain stationary distribution of the social network, and then sweep the conductance to find the smallest cut. The efficiency of the cut can reach. In order to detect a larger scale social network, we design and implement the algorithm on a MapReduce-programming framework. Finally, we execute our experiment on several actual social network data sets and compare our method to others. The experimental results show that our algorithm is feasible and very effective.

[1]  Santosh S. Vempala,et al.  On clusterings: Good, bad and spectral , 2004, JACM.

[2]  Ravi Bhushan Mishra,et al.  Multi-Agent Negotiation in B2C E-Commerce Based on Data Mining Methods , 2010, Int. J. Intell. Inf. Technol..

[3]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[4]  F. Chung Laplacians and the Cheeger Inequality for Directed Graphs , 2005 .

[5]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[6]  Fan Chung Graham,et al.  Local Partitioning for Directed Graphs Using PageRank , 2007, WAW.

[7]  Amin Saberi,et al.  Exploring the community structure of newsgroups , 2004, KDD.

[8]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Xiaoqing Li,et al.  Identifying Influencers in Online Social Networks: The Role of Tie Strength , 2013, Int. J. Intell. Inf. Technol..

[10]  Bin Wu,et al.  Community detection in large-scale social networks , 2007, WebKDD/SNA-KDD '07.

[11]  Stefan Sommer,et al.  What is the Conversation About?: A Topic-Model-Based Approach for Analyzing Customer Sentiments in Twitter , 2012, Int. J. Intell. Inf. Technol..

[12]  Peter J. Cameron,et al.  Spectral graph theory , 2004 .

[13]  Minas Gjoka,et al.  Coarse-grained topology estimation via graph sampling , 2011, WOSN '12.

[14]  N. Nagaveni,et al.  Low Dimensional Data Privacy Preservation Using Multi Layer Artificial Neural Network , 2012, Int. J. Intell. Inf. Technol..

[15]  M Girvan,et al.  Structure of growing social networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Jon M. Kleinberg,et al.  Inferring Web communities from link topology , 1998, HYPERTEXT '98.

[17]  Huan Liu,et al.  Community Detection and Mining in Social Media , 2010, Community Detection and Mining in Social Media.

[18]  Jure Leskovec,et al.  Statistical properties of community structure in large social and information networks , 2008, WWW.

[19]  Mudasser F. Wyne,et al.  Web 2.0 Based Intelligent Software Architecture for Photograph Sharing , 2010, Int. J. Intell. Inf. Technol..

[20]  Yiannis Kompatsiaris,et al.  Community detection in Social Media , 2012, Data Mining and Knowledge Discovery.

[21]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.

[22]  Shang-Hua Teng,et al.  Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems , 2003, STOC '04.

[23]  M. Newman,et al.  Why social networks are different from other types of networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Fan Chung Graham,et al.  Local Graph Partitioning using PageRank Vectors , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).