论文信息 - Asymmetric Transitivity Preserving Graph Embedding

Asymmetric Transitivity Preserving Graph Embedding

Graph embedding algorithms embed a graph into a vector space where the structure and the inherent properties of the graph are preserved. The existing graph embedding methods cannot preserve the asymmetric transitivity well, which is a critical property of directed graphs. Asymmetric transitivity depicts the correlation among directed edges, that is, if there is a directed path from u to v, then there is likely a directed edge from u to v. Asymmetric transitivity can help in capturing structures of graphs and recovering from partially observed graphs. To tackle this challenge, we propose the idea of preserving asymmetric transitivity by approximating high-order proximity which are based on asymmetric transitivity. In particular, we develop a novel graph embedding algorithm, High-Order Proximity preserved Embedding (HOPE for short), which is scalable to preserve high-order proximities of large scale graphs and capable of capturing the asymmetric transitivity. More specifically, we first derive a general formulation that cover multiple popular high-order proximity measurements, then propose a scalable embedding algorithm to approximate the high-order proximity measurements based on their general formulation. Moreover, we provide a theoretical upper bound on the RMSE (Root Mean Squared Error) of the approximation. Our empirical experiments on a synthetic dataset and three real-world datasets demonstrate that HOPE can approximate the high-order proximities significantly better than the state-of-art algorithms and outperform the state-of-art algorithms in tasks of reconstruction, link prediction and vertex recommendation.

[1] K. Selçuk Candan,et al. How Does the Data Sampling Strategy Impact the Discovery of Information Diffusion in Social Media? , 2010, ICWSM.

[2] Yihong Gong,et al. Combining content and link for classification using matrix factorization , 2007, SIGIR.

[3] P. Pattison,et al. New Specifications for Exponential Random Graph Models , 2006 .

[4] Stephen Lin,et al. Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] B. Scholkopf,et al. Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[6] Thomas L. Griffiths,et al. Nonparametric Latent Feature Models for Link Prediction , 2009, NIPS.

[7] Mikhail Belkin,et al. Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[8] Israel Cohen,et al. Embedding and function extension on directed graph , 2015, Signal Process..

[9] Jieping Ye,et al. Two-Dimensional Linear Discriminant Analysis , 2004, NIPS.

[10] Yuxiao Hu,et al. Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Marko Bajec,et al. Model of complex networks based on citation dynamics , 2013, WWW.

[12] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[13] Yin Zhang,et al. Scalable proximity estimation and link prediction in online social networks , 2009, IMC '09.

[14] Heng Tao Shen,et al. Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[15] Charu C. Aggarwal,et al. Factorized Similarity Learning in Networks , 2014, 2014 IEEE International Conference on Data Mining.

[16] Peter D. Hoff,et al. Multiplicative latent factor models for description and prediction of social networks , 2009, Comput. Math. Organ. Theory.

[17] Steven Skiena,et al. DeepWalk: online learning of social representations , 2014, KDD.