$k$ -Context Technique: A Method for Identifying Dense Subgraphs in a Heterogeneous Information Network

Most real-life problems can be modeled using heterogeneous networks, as they consist of several interconnected entities. However, researchers often study most of these problems as projected homogeneous networks. In such networks, different entities can be merged, some entities can be removed, or interactions among some entities can be overlooked, which results in a significant information loss. Moreover, these networks often overlook different structural possibilities in heterogeneous networks. Different structural possibilities may include a hierarchical network topology or a centralized network topology. Mining on different structural topologies of a heterogeneous information network may lead to a better understanding of the behavior and characteristics of the individual entities and the network as a whole. This article proposes a novel approach named $k$ -context technique for extracting dense subgraphs from different structural topologies of the heterogeneous information network. The approach has been evaluated using both synthetic and real-life data sets, and it compared the results with an existing algorithm for finding dense subgraphs from a homogeneous information network.

[1]  Adrian Perrig,et al.  Distillation Codes and Applications to DoS Resistant Multicast Authentication , 2004, NDSS.

[2]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[3]  TanKian-Lee,et al.  On triangulation-based dense neighborhood graph discovery , 2010, VLDB 2010.

[4]  Yizhou Sun,et al.  Distant Meta-Path Similarities for Text-Based Heterogeneous Information Networks , 2017, CIKM.

[5]  Ruixuan Li,et al.  Incremental K-clique clustering in dynamic social networks , 2012, Artificial Intelligence Review.

[6]  Yizhou Sun,et al.  Recommendation in heterogeneous information networks with implicit user feedback , 2013, RecSys.

[7]  Ran Canetti,et al.  Efficient and Secure Source Authentication for Multicast , 2001, NDSS.

[8]  Yu Wang,et al.  Semantic Preference-Based Personalized Recommendation on Heterogeneous Information Network , 2017, IEEE Access.

[9]  Philip S. Yu,et al.  Integrating meta-path selection with user-guided object clustering in heterogeneous information networks , 2012, KDD.

[10]  Yizhou Sun,et al.  RankClus: integrating clustering with ranking for heterogeneous information network analysis , 2009, EDBT '09.

[11]  Christopher C. Yang,et al.  Enriching User Experience in Online Health Communities Through Thread Recommendations and Heterogeneous Information Network Mining , 2018, IEEE Transactions on Computational Social Systems.

[12]  Sasikanth Avancha,et al.  Security for Sensor Networks , 2004 .

[13]  Philip S. Yu,et al.  Integrating Clustering and Ranking on Hybrid Heterogeneous Information Network , 2013, PAKDD.

[14]  Hal Daumé,et al.  Co-regularized Multi-view Spectral Clustering , 2011, NIPS.

[15]  Nitin Agarwal,et al.  Focal structures analysis: identifying influential sets of individuals in a social network , 2016, Social Network Analysis and Mining.

[16]  Charu C. Aggarwal,et al.  On clustering heterogeneous social media objects with outlier links , 2012, WSDM '12.

[17]  Refael Hassin,et al.  Complexity of finding dense subgraphs , 2002, Discret. Appl. Math..

[18]  András Faragó,et al.  Finding Dense Subgraphs Efficiently , 2005, FCS.

[19]  Chen Luo,et al.  Semi-supervised Clustering on Heterogeneous Information Networks , 2014, PAKDD.

[20]  Jiawei Han,et al.  Mining Heterogeneous Information Networks by Exploring the Power of Links , 2009, ALT.

[21]  Wang-Cheol Song,et al.  Clustering Objects in Heterogeneous Information Network Using Fuzzy C-Mean , 2013, MUSIC.

[22]  Jiawei Han,et al.  On detecting Association-Based Clique Outliers in heterogeneous information networks , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[23]  Su Deng,et al.  Multi-way Clustering for Heterogeneous Information Networks with General Network Schema , 2016, 2016 IEEE International Conference on Computer and Information Technology (CIT).

[24]  Yuanzhuo Wang,et al.  Link Inference in Dynamic Heterogeneous Information Network: A Knapsack-Based Approach , 2017, IEEE Transactions on Computational Social Systems.

[25]  Sean W. Smith,et al.  SAM: a flexible and secure auction architecture using trusted hardware , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[26]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[27]  Li Li,et al.  Mining Dual Networks , 2016, ACM Trans. Knowl. Discov. Data.

[28]  Dawn Xiaodong Song,et al.  ELK, a new protocol for efficient large-group key distribution , 2001, Proceedings 2001 IEEE Symposium on Security and Privacy. S&P 2001.

[29]  Yizhou Sun,et al.  Ranking-based clustering of heterogeneous information networks with star network schema , 2009, KDD.

[30]  R. Luce,et al.  Connectivity and generalized cliques in sociometric group structure , 1950, Psychometrika.

[31]  Sandra Sudarsky,et al.  Massive Quasi-Clique Detection , 2002, LATIN.

[32]  Ravi Kumar,et al.  Discovering Large Dense Subgraphs in Massive Graphs , 2005, VLDB.

[33]  Hinrich Schütze,et al.  Scoring , term weighting and thevector space model , 2015 .

[34]  Aristides Gionis,et al.  Piggybacking on Social Networks , 2013, Proc. VLDB Endow..

[35]  Thomas Seidl,et al.  Density-Based Subspace Clustering in Heterogeneous Networks , 2014, ECML/PKDD.

[36]  Xiaoran Xu,et al.  A novel probabilistic clustering model for heterogeneous networks , 2016, Machine Learning.

[37]  Marco Pellegrini,et al.  Extraction and classification of dense communities in the web , 2007, WWW '07.

[38]  Muriel Médard,et al.  XORs in the Air: Practical Wireless Network Coding , 2006, IEEE/ACM Transactions on Networking.

[39]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[40]  Jiebo Luo,et al.  Multi-type Co-clustering of General Heterogeneous Information Networks via Nonnegative Matrix Tri-Factorization , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[41]  Yizhou Sun,et al.  Mining heterogeneous information networks: a structural analysis approach , 2013, SKDD.

[42]  Philip S. Yu,et al.  Reducing uncertainty of dynamic heterogeneous information networks: a fusing reconstructing approach , 2017, Data Mining and Knowledge Discovery.

[43]  Wei Cheng,et al.  CGC: A Flexible and Robust Approach to Integrating Co-Regularized Multi-Domain Graph for Clustering , 2016, ACM Trans. Knowl. Discov. Data.

[44]  Mianxiong Dong,et al.  MultiSpectralNet: Spectral Clustering Using Deep Neural Network for Multi-View Data , 2019, IEEE Transactions on Computational Social Systems.

[45]  Chris Volinsky,et al.  Network-Based Marketing: Identifying Likely Adopters Via Consumer Networks , 2006, math/0606278.

[46]  Hinrich Schütze,et al.  Introduction to Information Retrieval: Scoring, term weighting, and the vector space model , 2008 .

[47]  Ran Canetti,et al.  Efficient authentication and signing of multicast streams over lossy channels , 2000, Proceeding 2000 IEEE Symposium on Security and Privacy. S&P 2000.

[48]  Xiang Li,et al.  Multiplex Influence Maximization in Online Social Networks With Heterogeneous Diffusion Models , 2018, IEEE Transactions on Computational Social Systems.