Community detection in social networks with node attributes based on multi-objective biogeography based optimization

Detecting communities in complex networks is one of the most important issues considered when analyzing these kinds of networks. A majority of studies in the field of community detection tend to detect communities through analyzing linkages of the networks. What this paper aims to achieve is to reach to a trade-off between similarity of nodes' attributes and density of connections in finding communities of social networks with node attributes. Since the community detection problem can be modeled as a seriously non-linear discrete optimization problem, we have hereby proposed a multi-objective discrete Biogeography Based Optimization (BBO) algorithm to find communities in social networks with node attributes. This algorithm uses the Pareto-based approach for community detection. Also, we introduced a new metric, SimAtt, to measure the similarity of node attributes in a community of a network and used it along with Modularity, which considers the linkage structure of a network to detect communities, as the two objective functions of the proposed method to be maximized. In the proposed method, a two phase sorting strategy is introduced which uses the non-dominated sorting and Crowding-distance to sort the generated solution of a population in each iteration. Moreover, this paper introduces a method for mutation probability approximation and uses a chaotic mechanism to dynamically tune the mutation probability in each iteration. Additionally, two novel strategies are introduced for mutation in unweighted and weighted networks. Since the final output of the proposed method is a set of non-dominated (Pareto-optimal) solutions, a metric named alpha_SAM is proposed to determine the best compromise solution among these non-dominated ones. Quantitative evaluations based on extensive experiments on 14 real-life data sets reveals that the method presented in this study achieves favorable results which are quite superior to other relevant algorithms in the literature.

[1]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[2]  Ciro Cattuto,et al.  High-Resolution Measurements of Face-to-Face Contact Patterns in a Primary School , 2011, PloS one.

[3]  Minghao Yin,et al.  Multiobjective Binary Biogeography Based Optimization for Feature Selection Using Gene Expression Data , 2013, IEEE Transactions on NanoBioscience.

[4]  Dan Simon,et al.  Biogeography-Based Optimization , 2022 .

[5]  Jing Liu,et al.  A Multiobjective Evolutionary Algorithm Based on Similarity for Community Detection From Signed Social Networks , 2014, IEEE Transactions on Cybernetics.

[6]  ZhengYou Xia,et al.  Community detection based on a semantic network , 2012, Knowl. Based Syst..

[7]  Bin Li,et al.  Multiobjective biogeography based optimization algorithm with decomposition for community detection in dynamic networks , 2015 .

[8]  Qingfu Zhang,et al.  Community detection in networks by using multiobjective evolutionary algorithm with decomposition , 2012 .

[9]  P. Hespanha,et al.  An Efficient MATLAB Algorithm for Graph Partitioning , 2006 .

[10]  J. Reichardt,et al.  Statistical mechanics of community detection. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Santo Fortunato,et al.  Consensus clustering in complex networks , 2012, Scientific Reports.

[12]  Kristina Lerman,et al.  Partitioning Networks with Node Attributes by Compressing Information Flow , 2014, ACM Trans. Knowl. Discov. Data.

[13]  Clara Pizzuti,et al.  A Multiobjective Genetic Algorithm to Find Communities in Complex Networks , 2012, IEEE Transactions on Evolutionary Computation.

[14]  Huan Liu,et al.  eTrust: understanding trust evolution in an online world , 2012, KDD.

[15]  J. Liu,et al.  A multi-agent genetic algorithm for community detection in complex networks , 2016 .

[16]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[17]  T. Nepusz,et al.  Fuzzy communities and the concept of bridgeness in complex networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Leon Danon,et al.  The effect of size heterogeneity on community identification in complex networks , 2006, physics/0601144.

[19]  Víctor M. Eguíluz,et al.  Distinguishing topical and social groups based on common identity and bond theory , 2013, WSDM.

[20]  Muhammad Yousefnezhad,et al.  Evaluating the effect of topic consideration in identifying communities of rating-based social networks , 2015, 2015 7th Conference on Information and Knowledge Technology (IKT).

[21]  Sean M. McNee,et al.  Improving recommendation lists through topic diversification , 2005, WWW '05.

[22]  Jie Liu,et al.  Multi-level learning based memetic algorithm for community detection , 2014, Appl. Soft Comput..

[23]  Martin Atzmüller,et al.  Description-oriented community detection using exhaustive subgroup discovery , 2016, Inf. Sci..

[24]  Antonino Nocera,et al.  Recommendation of similar users, resources and social networks in a Social Internetworking Scenario , 2011, Inf. Sci..

[25]  R. Carter 11 – IT and society , 1991 .

[26]  Ken Wakita,et al.  Finding community structure in mega-scale social networks: [extended abstract] , 2007, WWW '07.

[27]  Florence Sèdes,et al.  A community-based algorithm for deriving users’ profiles from egocentrics networks: experiment on Facebook and DBLP , 2012, Social Network Analysis and Mining.

[28]  Jing Liu,et al.  A comparative analysis of evolutionary and memetic algorithms for community detection from signed social networks , 2013, Soft Computing.

[29]  Alex Arenas,et al.  Analysis of the structure of complex networks at different resolution levels , 2007, physics/0703218.

[30]  Aboul Ella Hassanien,et al.  Community Detection Algorithm Based on Artificial Fish Swarm Optimization , 2014, IEEE Conf. on Intelligent Systems.

[31]  Behrouz Minaei-Bidgoli,et al.  Topic-oriented community detection of rating-based social networks , 2016, J. King Saud Univ. Comput. Inf. Sci..

[32]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Vaidyanathan K. Jayaraman,et al.  Biogeography-based informative gene selection and cancer classification using SVM and Random Forests , 2012, 2012 IEEE Congress on Evolutionary Computation.

[34]  Julie Fournet,et al.  Data on face-to-face contacts in an office building suggest a low-cost vaccination strategy based on community linkers , 2014, Network Science.

[35]  Rolf T. Wigand,et al.  Community Detection in Complex Networks: Multi-objective Enhanced Firefly Algorithm , 2013, Knowl. Based Syst..

[36]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[37]  Babak Amiri,et al.  A hybrid artificial immune network for detecting communities in complex networks , 2014, Computing.

[38]  Toon De Pessemier,et al.  MovieTweetings: a movie rating dataset collected from twitter , 2013, RecSys 2013.

[39]  Zhao Yuxin,et al.  Overlapping community detection in complex networks using multi-objective evolutionary algorithm , 2015, Computational and Applied Mathematics.

[40]  Haiping Ma,et al.  An analysis of the equilibrium of migration models for biogeography-based optimization , 2010, Inf. Sci..

[41]  Chengcui Zhang,et al.  A fast parallel modularity optimization algorithm (FPMQA) for community detection in online social network , 2013, Knowl. Based Syst..

[42]  Behrouz Minaei-Bidgoli,et al.  Detecting communities in topical semantic networks , 2015, 2015 7th Conference on Information and Knowledge Technology (IKT).

[43]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[44]  Maoguo Gong,et al.  Complex Network Clustering by Multiobjective Discrete Particle Swarm Optimization Based on Decomposition , 2014, IEEE Transactions on Evolutionary Computation.

[45]  Qiang Wang,et al.  Topic oriented community detection through social objects and link analysis in social networks , 2012, Knowl. Based Syst..

[46]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[47]  Luigi Fortuna,et al.  Chaotic sequences to improve the performance of evolutionary algorithms , 2003, IEEE Trans. Evol. Comput..

[48]  Maoguo Gong,et al.  Greedy discrete particle swarm optimization for large-scale social network clustering , 2015, Inf. Sci..

[49]  Bin Wu,et al.  Multi-objective community detection in complex networks , 2012, Appl. Soft Comput..

[50]  Barbara Carminati,et al.  User similarities on social networks , 2013, Social Network Analysis and Mining.

[51]  Feng Zou,et al.  Multi-objective optimization of community detection using discrete teaching-learning-based optimization with decomposition , 2016, Inf. Sci..

[52]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[53]  Haluk Bingol,et al.  Community Detection in Complex Networks Using Genetic Algorithms , 2006, 0711.0491.

[54]  M. Newman Communities, modules and large-scale structure in networks , 2011, Nature Physics.

[55]  Donald E. Knuth,et al.  The Stanford GraphBase - a platform for combinatorial computing , 1993 .

[56]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[57]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems , 2002, Genetic Algorithms and Evolutionary Computation.

[58]  Chris Hankin,et al.  Multi-scale Community Detection using Stability Optimisation within Greedy Algorithms , 2012, ArXiv.

[59]  Joshua D. Knowles,et al.  An Evolutionary Approach to Multiobjective Clustering , 2007, IEEE Transactions on Evolutionary Computation.

[60]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.