Interest-Based Clustering Approach for Social Networks

Recently, the applications of community detection have increased because of their effectiveness in identifying communities correctly. Many methods and algorithms have been introduced to bring new insights that will improve community detection in social networks. While such algorithms can find useful communities, they tend to focus on network structure and ignore node interests and interconnections. However, accurate community detection requires the consideration of both network structure and node interests. The best method to achieve this is by utilizing unsupervised models. In this article, we introduce a new approach for social network clustering, termed Interest-based Clustering, which clusters nodes in social networks based on a measure of interest similarity. It considers structure, interaction, and node interest along with nodes friends’ interests. The empirical evaluation of this new approach was done using real dataset crawled from Twitter. The approach outperforms well-known community detections algorithms, SCAN, Fast Modularity, Zhao et al., in terms of modularity, connectivity, and overlapping.

[1]  Krishna P. Gummadi,et al.  Inferring who-is-who in the Twitter social network , 2012, WOSN '12.

[2]  Xiangjian He,et al.  Community Detection Based on Links and Node Features in Social Networks , 2015, MMM.

[3]  Kwan Hui Lim,et al.  Finding twitter communities with common interests using following links of celebrities , 2012, MSM '12.

[4]  Jure Leskovec,et al.  Structure and Overlaps of Ground-Truth Communities in Networks , 2014, TIST.

[5]  Xiaowei Xu,et al.  SCAN: a structural clustering algorithm for networks , 2007, KDD '07.

[6]  Barbara Carminati,et al.  User similarities on social networks , 2013, Social Network Analysis and Mining.

[7]  Krishna P. Gummadi,et al.  Cognos: crowdsourcing search for topic experts in microblogs , 2012, SIGIR '12.

[8]  M. Newman,et al.  Mixing patterns in networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Dong Zhou,et al.  Translation techniques in cross-language information retrieval , 2012, CSUR.

[10]  Hui Xiong,et al.  Introduction to special section on intelligent mobile knowledge discovery and management systems , 2013, ACM Trans. Intell. Syst. Technol..

[11]  Satish Rao,et al.  A Flow-Based Method for Improving the Expansion or Conductance of Graph Cuts , 2004, IPCO.

[12]  Muhammad Abulaish,et al.  Overlapping Social Network Communities and Viral Marketing , 2013, 2013 International Symposium on Computational and Business Intelligence.

[13]  George A. Vouros,et al.  Mapping Ontologies Elements using Features in a Latent Space , 2007 .

[14]  Qiang Wang,et al.  Topic oriented community detection through social objects and link analysis in social networks , 2012, Knowl. Based Syst..

[15]  Noël Crespi,et al.  Alike people, alike interests? A large-scale study on interest similarity in social networks , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[16]  Hakim Hacid,et al.  Social networks and information retrieval, how are they converging? A survey, a taxonomy and an analysis of social information retrieval approaches and platforms , 2016, Inf. Syst..

[17]  Marcel Karnstedt,et al.  Graph-Based Methods for Clustering Topics of Interest in Twitter , 2015, ICWE.

[18]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[19]  Fan Chung Graham,et al.  Local Graph Partitioning using PageRank Vectors , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[20]  Ramesh Nallapati,et al.  Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora , 2009, EMNLP.

[21]  Mark E. J. Newman,et al.  Community detection and graph partitioning , 2013, ArXiv.

[22]  Gang Yu,et al.  JacUOD: A New Similarity Measurement for Collaborative Filtering , 2012, Journal of Computer Science and Technology.

[23]  Mao-Bin Hu,et al.  Detect overlapping and hierarchical community structure in networks , 2008, ArXiv.

[24]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[25]  Kwan Hui Lim,et al.  Following the follower: detecting communities with common interests on twitter , 2012, HT '12.

[26]  Jae-Gil Lee,et al.  Community Detection in Multi-Layer Graphs: A Survey , 2015, SGMD.

[27]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[28]  Mark E. J. Newman,et al.  Spectral methods for network community detection and graph partitioning , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Cristopher Moore,et al.  Community detection in networks with unequal groups , 2015, Physical review. E.

[30]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[31]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Kyomin Jung,et al.  LinkSCAN*: Overlapping community detection using the link-space transformation , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[33]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[34]  Dan Frankowski,et al.  Collaborative Filtering Recommender Systems , 2007, The Adaptive Web.

[35]  Tsuyoshi Murata,et al.  Community Detection in Multi-Partite Multi-Relational Networks Based on Information Compression , 2016, New Generation Computing.

[36]  Yiannis Kompatsiaris,et al.  Graph-based multimodal clustering for social multimedia , 2017, Multimedia Tools and Applications.

[37]  Myra Spiliopoulou,et al.  Logic Programming to Address Issues of the Semantic Web , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[38]  Peilin Yang,et al.  An overlapping community detection algorithm based on density peaks , 2017, Neurocomputing.

[39]  Noël Crespi,et al.  CSD: A multi-user similarity metric for community recommendation in online social networks , 2016, Expert Syst. Appl..

[40]  Noël Crespi,et al.  Alike people, alike interests? Inferring interest similarity in online social networks , 2015, Decis. Support Syst..

[41]  Jing Xiao,et al.  Friend Recommendation by User Similarity Graph Based on Interest in Social Tagging Systems , 2015, ICIC.

[42]  Ying Ding,et al.  Community detection: Topological vs. topical , 2011, J. Informetrics.

[43]  Abdelghani Bouras,et al.  A New Similarity Measure for the Profiles Management , 2011, 2011 UkSim 13th International Conference on Computer Modelling and Simulation.

[44]  Andreas Stafylopatis,et al.  Mining Interesting Topics in Twitter Communities , 2015, ICCCI.

[45]  Dong Wang,et al.  Discovering Similar Users on Twitter , 2013 .

[46]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  Yang Zhang,et al.  Community Discovery in Twitter Based on User Interests , 2012 .

[48]  Meng Wang,et al.  People Opinion Topic Model: Opinion based User Clustering in Social Networks , 2017, WWW.

[49]  Philippe Fournier-Viger,et al.  Inferring social network user profiles using a partial social graph , 2016, Journal of Intelligent Information Systems.

[50]  Charu C. Aggarwal,et al.  Data Mining: The Textbook , 2015 .

[51]  Mourad Ykhlef,et al.  Toward Information Diffusion Model for Viral Marketing in Business , 2016 .

[52]  Matthew Michelson,et al.  Tweet Disambiguate Entities Retrieve Folksonomy SubTree Step 1 : Discover Categories Generate Topic Profile from SubTrees Step 2 : Discover Profile Topic Profile : “ English Football ” “ World Cup ” , 2010 .

[53]  Seyed Mostafa Fakhrahmad,et al.  A user-profile-based friendship recommendation solution in social networks , 2015, J. Inf. Sci..

[54]  Ioannis Stavrakakis,et al.  ISCoDe: A framework for interest similarity-based community detection in social networks , 2011, 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[55]  Jure Leskovec,et al.  Empirical comparison of algorithms for network community detection , 2010, WWW '10.

[56]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[57]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[58]  Krishna P. Gummadi,et al.  Inferring user interests in the Twitter social network , 2014, RecSys '14.

[59]  Mourad Ykhlef,et al.  A novel spreading framework using incremental clustering for viral marketing , 2014, 2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA).

[60]  E A Leicht,et al.  Community structure in directed networks. , 2007, Physical review letters.

[61]  Huan Liu,et al.  Community Detection and Mining in Social Media , 2010, Community Detection and Mining in Social Media.

[62]  Frederick Kin Hing Phoa,et al.  A scanning method for detecting clustering pattern of both attribute and structure in social networks , 2016 .