Hierarchical Fuzzy Spectral Clustering in Social Networks using Spectral Characterization

An important aspect of community analysis is not only determining the communities within the network, but also sub-communities and hierarchies. We present an approach for finding hierarchies in social networks that uses work from random matrix theory to estimate the number of clusters. The method analyzes the spectral fingerprint of the network to determine the level of hierarchy in the network. Using this information to inform the choice of clusters, the network is broken into successively smaller communities that are attached to their parents via Jaccard similarity. The efficacy of the approach is examined on two well known real world social networks as well as a political social network derived from campaign finance data. Introduction Many real world networks are characterized by dense subnetworks that are commonly referred to as communities and are generally composed of groups of nodes that have elements in common with each other. Examples of networks that have community structure can be drawn from social (Fortunato 2009), biological (Power et al. 2011), gene expression (Zhang and Horvath 2005), and many other types of networks. Since the communities can represent fundamental properties of the network, their discovery is important for understanding the nature of the networks (Newman and Girvan 2004),(Flake et al. 2002). The primary focus of this paper is on social networks. To best represent the communities, a classification of the nodes into clusters should satisfy two important realities of many social networks: overlap and hierarchy. For the first, nodes within the network may belong to multiple communities. Much like in human social groups, an individual may belong to more than one community or have multiple affiliations (Zhang, Wang, and Zhang 2007). Hierarchy is another important aspect of some social networks wherein smaller communities together make up larger ones. Military, business, and political hierarchies are all examples where individual smaller groups combine into a larger group. ∗also affiliated with the National Institute on Money in State Politics Copyright c © 2015, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. There is already a wealth of research on finding communities within networks. Some initial work focused on crisp splits of the network into non-overlapping, non-hierarchical communities (Newman and Girvan 2004), (Newman 2006). As part of this, a method for evaluating the quality of a partitioning of the data into clusters was developed called modularity. The idea behind modularity is to determine how well a community split describes the likelihood of the community as it relates to a null model, where each node keeps the same degree but is connected at random to other nodes. This is defined by Q = ∑

[1]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[2]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[3]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[4]  E. Ott,et al.  Spectral properties of networks with community structure. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Somwrita Sarkar,et al.  Spectral Characterization of Hierarchical Network Modularity and Limits of Modularity Detection , 2013, PloS one.

[6]  Bart Selman,et al.  Tracking evolving communities in large linked networks , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Vicenç Torra,et al.  Fuzzy c-means for Fuzzy Hierarchical Clustering , 2005, The 14th IEEE International Conference on Fuzzy Systems, 2005. FUZZ '05..

[8]  Charles R. MacCluer,et al.  The Many Proofs and Applications of Perron's Theorem , 2000, SIAM Rev..

[9]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  S. Horvath,et al.  Statistical Applications in Genetics and Molecular Biology , 2011 .

[11]  Jian Liu,et al.  Fuzzy modularity and fuzzy community structure in networks , 2010 .

[12]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[13]  PothenAlex,et al.  Partitioning sparse matrices with eigenvectors of graphs , 1990 .

[14]  M. Newman,et al.  Identifying the role that animals play in their social networks , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[15]  Somwrita Sarkar,et al.  Community detection in graphs using singular value decomposition. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[17]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Shihua Zhang,et al.  Identification of overlapping community structure in complex networks using fuzzy c-means clustering , 2007 .

[19]  Myra Spiliopoulou,et al.  MONIC: modeling and monitoring cluster transitions , 2006, KDD '06.

[20]  Sanghamitra Bandyopadhyay Automatic Determination of the Number of Fuzzy Clusters Using Simulated Annealing with Variable Representation , 2005, ISMIS.

[21]  Marimuthu Palaniswami,et al.  A Soft Modularity Function For Detecting Fuzzy Communities in Social Networks , 2013, IEEE Transactions on Fuzzy Systems.

[22]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[23]  Jean-Loup Guillaume,et al.  Static community detection algorithms for evolving networks , 2010, 8th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks.

[24]  Aviv Segev,et al.  Analyzing future communities in growing citation networks , 2013, Knowl. Based Syst..

[25]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[26]  Timothy O. Laumann,et al.  Functional Network Organization of the Human Brain , 2011, Neuron.

[27]  Adam Bonica,et al.  Mapping the Ideological Marketplace , 2013 .

[28]  Xiaoming Liu,et al.  SLPA: Uncovering Overlapping Communities in Social Networks via a Speaker-Listener Interaction Dynamic Process , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[29]  Arnaud Devillez,et al.  A fuzzy hybrid hierarchical clustering method with a new criterion able to find the optimal partition , 2002, Fuzzy Sets Syst..

[30]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .