AHNA: Adaptive representation learning for attributed heterogeneous networks

Meta‐path‐based random walk strategy has attracted tremendous attention in heterogeneous network representation, which can capture network semantics with heterogeneous neighborhoods of nodes. Despite the success of meta‐path‐based random walk strategy in plain heterogeneous networks which contain no attributes, it remains unexplored how meta‐path‐based random walk strategy could be utilized on attributed heterogeneous networks to simultaneously capture structural heterogeneity and attribute proximity. Moreover, the importance of node attributes and structural relations generally varies across data sets, thus requiring careful considerations when they are incorporated into representations. To tackle these problems, we propose a novel method, Attributed Heterogeneous Network embedding based on Aggregate‐path (AHNA), which generates aggregate‐path‐based random walks on attributed heterogeneous networks and adaptively fuses topological structures and node attributes based on the learned importance. Specifically, AHNA first converts node attributes to additional links in the network to deal with the heterogeneity of structures and attributes, which is followed by an adaptive random walk strategy to strike the importance balance between node attributes and topological structures, thereby generating high‐quality representations. Extensive experiments are conducted on three real‐world data sets, where AHNA outperforms state‐of‐the‐art approaches by up to 22.7%, 2.6%, and 2.3% on link prediction, community detection, and node classification, respectively. Moreover, our qualitative analysis indicates that AHNA can capture different balances of topological structures and node attributes on various data sets and thus boost the quality of node representations.

[1]  Michael Granitzer,et al.  Signed heterogeneous network embedding in social media , 2020, SAC.

[2]  Philip S. Yu,et al.  HeteSim: A General Framework for Relevance Measure in Heterogeneous Networks , 2013, IEEE Transactions on Knowledge and Data Engineering.

[3]  Xiangnan He,et al.  Attributed Social Network Embedding , 2017, IEEE Transactions on Knowledge and Data Engineering.

[4]  Heli Sun,et al.  CMG2Vec: A composite meta-graph based heterogeneous information network embedding approach , 2021, Knowl. Based Syst..

[5]  R. Kronmal,et al.  On the Alias Method for Generating Random Variables From a Discrete Distribution , 1979 .

[6]  Joachim H. Ahrens,et al.  An alias method for sampling from the normal distribution , 1989, Computing.

[7]  L. Farrand THE AMERICAN PSYCHOLOGICAL ASSOCIATION. , 1897, Science.

[8]  Wei Wang,et al.  Predicting Disease-related Associations by Heterogeneous Network Embedding , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[9]  Nitesh V. Chawla,et al.  metapath2vec: Scalable Representation Learning for Heterogeneous Networks , 2017, KDD.

[10]  Yizhou Sun,et al.  Heterogeneous Graph Transformer , 2020, WWW.

[11]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[12]  Jiliang Tang,et al.  Node Similarity Preserving Graph Convolutional Networks , 2020, WSDM.

[13]  Nikos A. Vlassis,et al.  The global k-means clustering algorithm , 2003, Pattern Recognit..

[14]  Philip S. Yu,et al.  Heterogeneous Information Network Embedding for Recommendation , 2017, IEEE Transactions on Knowledge and Data Engineering.

[15]  Philip S. Yu,et al.  A Comprehensive Survey on Graph Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[16]  Guohui Ling,et al.  SINE: Side Information Network Embedding , 2019, DASFAA.

[17]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[18]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[19]  Jian Liu,et al.  Dual Similarity Regularization for Recommendation , 2016, PAKDD.

[20]  Philip S. Yu,et al.  BL-MNE: Emerging Heterogeneous Social Network Embedding Through Broad Learning with Aligned Autoencoder , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[21]  Zhiyuan Liu,et al.  Graph Neural Networks: A Review of Methods and Applications , 2018, AI Open.

[22]  Jie Zhou,et al.  Measuring and Relieving the Over-smoothing Problem for Graph Neural Networks from the Topological View , 2020, AAAI.

[23]  Nikos Mamoulis,et al.  Heterogeneous Information Network Embedding for Meta Path based Proximity , 2017, ArXiv.

[24]  Ke Zhang,et al.  Energy-Efficient Offloading for Mobile Edge Computing in 5G Heterogeneous Networks , 2016, IEEE Access.

[25]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[26]  Yang Li,et al.  Network Embedding for Community Detection in Attributed Networks , 2020, ACM Trans. Knowl. Discov. Data.

[27]  Fang Wu,et al.  Finding communities in linear time: a physics approach , 2003, ArXiv.

[28]  Heng Huang,et al.  Deep Attributed Network Embedding , 2018, IJCAI.

[29]  Yanfang Ye,et al.  Heterogeneous Graph Attention Network , 2019, WWW.

[30]  Jieping Ye,et al.  An Attention-based Graph Neural Network for Heterogeneous Structural Learning , 2019, AAAI.

[31]  Irwin King,et al.  MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding , 2020, WWW.

[32]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[33]  Xiao Huang,et al.  Accelerated Attributed Network Embedding , 2017, SDM.

[34]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[35]  Chunyan Feng,et al.  User identity linkage across social networks via linked heterogeneous network embedding , 2018, World Wide Web.

[36]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[37]  Jiajun Bu,et al.  ANRL: Attributed Network Representation Learning via Deep Neural Networks , 2018, IJCAI.

[38]  Shuiwang Ji,et al.  Towards Deeper Graph Neural Networks , 2020, KDD.

[39]  Nitesh V. Chawla,et al.  Heterogeneous Graph Neural Network , 2019, KDD.

[40]  J. Hilbe Logistic Regression Models , 2009 .

[41]  Kristin L. Sainani,et al.  Logistic Regression , 2014, PM & R : the journal of injury, function, and rehabilitation.

[42]  Min Wu,et al.  mg2vec: Learning Relationship-Preserving Heterogeneous Graph Representations via Metagraph Embedding , 2020, IEEE Transactions on Knowledge and Data Engineering.

[43]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[44]  Chuan Shi,et al.  Local and Global Information Fusion for Top-N Recommendation in Heterogeneous Information Network , 2018, CIKM.

[45]  Omer Levy,et al.  word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method , 2014, ArXiv.

[46]  Min Li,et al.  HNEDTI: Prediction of drug-target interaction based on heterogeneous network embedding , 2019, 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).