Knowledge Graph Enhanced Community Detection and Characterization

Recent studies show that by combining network topology and node attributes, we can better understand community structures in complex networks. However, existing algorithms do not explore "contextually" similar node attribute values, and therefore may miss communities defined with abstract concepts. We propose a community detection and characterization algorithm that incorporates the contextual information of node attributes described by multiple domain-specific hierarchical concept graphs. The core problem is to find the context that can best summarize the nodes in communities, while also discovering communities aligned with the context summarizing communities. We formulate the two intertwined problems, optimal community-context computation, and community discovery, with a coordinate-ascent based algorithm that iteratively updates the nodes' community label assignment with a community-context and computes the best context summarizing nodes of each community. Our unique contributions include (1) a composite metric on Informativeness and Purity criteria in searching for the best context summarizing nodes of a community; (2) a node similarity measure that incorporates the context-level similarity on multiple node attributes; and (3) an integrated algorithm that drives community structure discovery by appropriately weighing edges. Experimental results on public datasets show nearly 20 percent improvement on F-measure and Jaccard for discovering underlying community structure over the current state-of-the-art of community detection methods. Community structure characterization was also accurate to find appropriate community types for four datasets.

[1]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[2]  Zhen Wang,et al.  Community Detection Based on Structure and Content: A Content Propagation Perspective , 2015, 2015 IEEE International Conference on Data Mining.

[3]  Pengfei Wang,et al.  Modeling Retail Transaction Data for Personalized Shopping Recommendation , 2014, CIKM.

[4]  Ronaldo Menezes,et al.  Assessing the suitability of network community detection to available meta-data using rank stability , 2017, WI.

[5]  Amit P. Sheth,et al.  Enhancing crowd wisdom using measures of diversity computed from social media data , 2017, WI.

[6]  Neil M. Ferguson,et al.  Understanding Peace and Conflict Through Social Identity Theory: Contemporary Global Perspectives , 2016 .

[7]  Olivier Corby,et al.  Analysis of a Real Online Social Network Using Semantic Web Frameworks , 2009, SEMWEB.

[8]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Barbora Micenková,et al.  Clustering attributed graphs: Models, measures and methods , 2015, Network Science.

[10]  Sergey Ioffe,et al.  Improved Consistent Sampling, Weighted Minhash and L1 Sketching , 2010, 2010 IEEE International Conference on Data Mining.

[11]  Xiaochun Cao,et al.  Semantic Community Identification in Large Attribute Networks , 2016, AAAI.

[12]  Yuan Zhang,et al.  Community Detection in Networks with Node Features , 2015, Electronic Journal of Statistics.

[13]  Yunming Ye,et al.  The Author-Topic-Community model for author interest profiling and community discovery , 2014, Knowledge and Information Systems.

[14]  Michael Vitale,et al.  The Wisdom of Crowds , 2015, Cell.

[15]  Dan Roth,et al.  Incorporating World Knowledge to Document Clustering via Heterogeneous Information Networks , 2015, KDD.

[16]  Guillermo Palma,et al.  Drug-Target Interaction Prediction Using Semantic Similarity and Edge Partitioning , 2014, SEMWEB.

[17]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[18]  Emilio Ferrara,et al.  Latent Space Model for Multi-Modal Social Data , 2015, WWW.

[19]  Phuc Do,et al.  Discovering Communities of Users on Social Networks Based on Topic Model Combined with Kohonen Network , 2015, 2015 Seventh International Conference on Knowledge and Systems Engineering (KSE).

[20]  Mark E. J. Newman,et al.  Structure and inference in annotated networks , 2015, Nature Communications.

[21]  Amit P. Sheth,et al.  Harnessing relationships for domain-specific subgraph extraction: A recommendation use case , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[22]  Christian Böhm,et al.  Attributed Graph Clustering with Unimodal Normalized Cut , 2017, ECML/PKDD.

[23]  R. Preston McAfee,et al.  The wisdom of smaller, smarter crowds , 2014, EC.

[24]  David Sánchez,et al.  A New Model to Compute the Information Content of Concepts from Taxonomic Knowledge , 2012, Int. J. Semantic Web Inf. Syst..

[25]  Jian Yu,et al.  Node Attribute-enhanced Community Detection in Complex Networks , 2017, Scientific Reports.

[26]  Fatima-Zahra Belouadha,et al.  Towards a unified semantic model for online social networks analysis and interoperability , 2015, 2015 10th International Conference on Intelligent Systems: Theories and Applications (SITA).

[27]  Ben Shneiderman,et al.  A dual-view approach to interactive network visualization , 2007, CIKM '07.

[28]  Weixiong Zhang,et al.  Joint Identification of Network Communities and Semantics via Integrative Modeling of Network Topologies and Node Contents , 2017, AAAI.

[29]  Francesco Bonchi,et al.  Description-Driven Community Detection , 2014, TIST.

[30]  Richard J. Hazler,et al.  Social Isolation of Youth at Risk: Conceptualizations and Practical Implications , 2002 .

[31]  Stanford,et al.  Learning to Discover Social Circles in Ego Networks , 2012 .

[32]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[33]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.