Key Nodes Cluster Augmented Embedding for Heterogeneous Information Networks

Heterogeneous Information Networks (HINs), composed of multiple types of node and relation, usually have more expressive ability for complex relational data. Recently, network embedding aiming to project the network into a low-dimensional vector space has received much attention. Most existing embedding methods for HINs utilize meta-path to capture the proximity of node. However, these methods usually ignore the inequivalence of different types of node and clustering structure of network, which are important characteristics of HINs. Hence, we propose a key node based heterogeneous network embedding method enhanced by the clustering information. In our method, we first utilize a meta-path guided random walk to obtain general node representations in terms of rich heterogeneous semantic features in HINs. To indicate different equivalence of nodes, we define key nodes which are usually in the essential location in HINs, such as paper in the bibliographic HINs. Afterwards, we incorporate the clustering structure of the key nodes into network embedding learning via Gaussian Mixture Model to further enhance the representations of nodes. Lastly, we design a unified objective function to mutually learn the two parts effectively. Extensive experiments are conducted and the results validate the effectiveness of our model.

[1]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[2]  Minyi Guo,et al.  SHINE: Signed Heterogeneous Information Network Embedding for Sentiment Link Prediction , 2017, WSDM.

[3]  Chuan Shi,et al.  Local and Global Information Fusion for Top-N Recommendation in Heterogeneous Information Network , 2018, CIKM.

[4]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[5]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[6]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[7]  Huaming Wu,et al.  Variational autoencoder based bipartite network embedding by integrating local and global structure , 2020, Inf. Sci..

[8]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[9]  Kevin Chen-Chuan Chang,et al.  Learning Community Embedding with Community Detection and Node Embedding on Graphs , 2017, CIKM.

[10]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[11]  Qiaozhu Mei,et al.  PTE: Predictive Text Embedding through Large-scale Heterogeneous Text Networks , 2015, KDD.

[12]  Wang-Chien Lee,et al.  HIN2Vec: Explore Meta-paths in Heterogeneous Information Networks for Representation Learning , 2017, CIKM.

[13]  Yaping Wang,et al.  SSNE: Status Signed Network Embedding , 2019, PAKDD.

[14]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[15]  Nitesh V. Chawla,et al.  metapath2vec: Scalable Representation Learning for Heterogeneous Networks , 2017, KDD.

[16]  Pengfei Jiao,et al.  NE-FLGC: Network Embedding Based on Fusing Local (First-Order) and Global (Second-Order) Network Structure with Node Content , 2018, PAKDD.

[17]  Ge Zhang,et al.  Detecting Communities with Multiplex Semantics by Distinguishing Background, General, and Specialized Topics , 2020, IEEE Transactions on Knowledge and Data Engineering.

[18]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .