Multi-objective community detection method by integrating users' behavior attributes

Social networks usually have abundant attributes associated with users to describe their features. Behavior attribute is one of the most important types of attribute which can better reflect users' intrinsic interests. In practice, many network applications prefer communities that not only are densely intra-connected, but also have homogeneous attribute value on specific behavior attributes. Structure clustering and attribute categorization are two types of method which can take full advantage of structure information and attribute information to partition the network, respectively. In this paper, we propose a novel community detection method by realizing structure clustering technology and attribute categorization technology simultaneously. Specifically, structure clustering is realized by optimizing modularity which captures densely intra-connected nature of communities. As for attribute categorization, a new metric named as homogeneity is defined to achieve the goal that nodes within each community have homogeneous attribute value, while in different communities have diverse attribute values. A multi-objective optimization evolutionary mechanism is adopted to optimize modularity and homogeneity simultaneously. Extensive experiments on several real-world networks demonstrate that our method can get a set of community structures corresponding to different trade-offs between structure clustering and attribute categorization.

[1]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[2]  Aristides Gionis,et al.  Overlapping community detection in labeled graphs , 2014, Data Mining and Knowledge Discovery.

[3]  François Poulet,et al.  Entropy based community detection in augmented social networks , 2011, 2011 International Conference on Computational Aspects of Social Networks (CASoN).

[4]  Francesco Bonchi,et al.  Description-Driven Community Detection , 2014, TIST.

[5]  Bin Wu,et al.  Multi-objective community detection in complex networks , 2012, Appl. Soft Comput..

[6]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[7]  Yue Gao,et al.  When Amazon Meets Google: Product Visualization by Exploring Multiple Web Sources , 2013, TOIT.

[8]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[9]  Zhoujun Li,et al.  Video recommendation over multiple information sources , 2012, Multimedia Systems.

[10]  Peng Wu,et al.  Multi-Objective Community Detection Based on Memetic Algorithm , 2015, PloS one.

[11]  Thomas Seidl,et al.  GAMer: a synthesis of subspace clustering and dense subgraph mining , 2013, Knowledge and Information Systems.

[12]  Meng Wang,et al.  Image clustering based on sparse patch alignment framework , 2014, Pattern Recognit..

[13]  Joshua D. Knowles,et al.  An Evolutionary Approach to Multiobjective Clustering , 2007, IEEE Transactions on Evolutionary Computation.

[14]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Gilbert Syswerda,et al.  Uniform Crossover in Genetic Algorithms , 1989, ICGA.

[16]  Clara Pizzuti,et al.  A Multiobjective Genetic Algorithm to Find Communities in Complex Networks , 2012, IEEE Transactions on Evolutionary Computation.

[17]  Hong Cheng,et al.  Graph Clustering Based on Structural/Attribute Similarities , 2009, Proc. VLDB Endow..

[18]  R. K. Ursem Multi-objective Optimization using Evolutionary Algorithms , 2009 .

[19]  Meng Wang,et al.  Semi-supervised distance metric learning based on local linear regression for data clustering , 2012, Neurocomputing.

[20]  Meng Wang,et al.  Visual Classification by ℓ1-Hypergraph Modeling , 2015, IEEE Trans. Knowl. Data Eng..

[21]  Yun Chi,et al.  Combining link and content for community detection: a discriminative approach , 2009, KDD.

[22]  Hong Cheng,et al.  A model-based approach to attributed graph clustering , 2012, SIGMOD Conference.

[23]  Mason A. Porter,et al.  Comparing Community Structure to Characteristics in Online Collegiate Social Networks , 2008, SIAM Rev..

[24]  Mason A. Porter,et al.  Social Structure of Facebook Networks , 2011, ArXiv.

[25]  DebK.,et al.  A fast and elitist multiobjective genetic algorithm , 2002 .

[26]  Tao Li,et al.  Entropy-based criterion in categorical clustering , 2004, ICML.

[27]  Maoguo Gong,et al.  Multiobjective Immune Algorithm with Nondominated Neighbor-Based Selection , 2008, Evolutionary Computation.

[28]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Martin Ester,et al.  Mining Cohesive Patterns from Graphs with Feature Vectors , 2009, SDM.

[30]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[31]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[32]  Hong Cheng,et al.  Clustering Large Attributed Graphs: An Efficient Incremental Approach , 2010, 2010 IEEE International Conference on Data Mining.

[33]  Junhua Tang,et al.  Detecting community structure via synchronous label propagation , 2015, Neurocomputing.

[34]  Srinivasan Parthasarathy,et al.  Efficient community detection in large networks using content and links , 2012, WWW.

[35]  Jun Sese,et al.  Mining networks with shared items , 2010, CIKM.