Improving network embedding with partially available vertex and edge content

Abstract Network embedding aims to learn a low-dimensional representation for each vertex in a network, which has recently shown its power in many graph mining problems such as vertex classification and link prediction. Most existing methods learn such representations according to network structure information, and some methods further consider vertex content in a network. Unlike prior works, we study the problem of network embedding with two distinctive properties: (1) content information exists on both vertices and edges; (2) only a part of vertices and edges have content information. To solve this problem, we propose a novel Partially available Vertex and Edge Content Boosted network embedding method, namely PVECB, which uses available vertex and edge content information to fine-tune structure-only representations through two hand-designed mechanisms respectively. Empirical results on four real-world datasets demonstrate that our method can effectively boost structure-only representations to capture more accurate proximities between vertices.

[1]  Deli Zhao,et al.  Network Representation Learning with Rich Text Information , 2015, IJCAI.

[2]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[3]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[4]  Jie Tang,et al.  A Combination Approach to Web User Profiling , 2010, TKDD.

[5]  Jian Pei,et al.  Community Preserving Network Embedding , 2017, AAAI.

[6]  Steven Skiena,et al.  Enhanced Network Embeddings via Exploiting Edge Labels , 2018, CIKM.

[7]  Chengqi Zhang,et al.  SINE: Scalable Incomplete Network Embedding , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[8]  Michael D. Vose,et al.  A Linear Algorithm For Generating Random Numbers With a Given Distribution , 1991, IEEE Trans. Software Eng..

[9]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[10]  Li Wang,et al.  How Noisy Social Media Text, How Diffrnt Social Media Sources? , 2013, IJCNLP.

[11]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[12]  Daniel R. Figueiredo,et al.  struc2vec: Learning Node Representations from Structural Identity , 2017, KDD.

[13]  Nitesh V. Chawla,et al.  Link Prediction and Recommendation across Heterogeneous Social Networks , 2012, 2012 IEEE 12th International Conference on Data Mining.

[14]  Geoffrey E. Hinton,et al.  Semantic hashing , 2009, Int. J. Approx. Reason..

[15]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[16]  Roger M. Needham The changing environment for security protocols , 1997 .

[17]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[18]  Zhiyuan Liu,et al.  TransNet: Translation-Based Network Representation Learning for Social Relation Extraction , 2017, IJCAI.

[19]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[20]  Lei Wang,et al.  Learning with multi-resolution overlapping communities , 2013, Knowledge and Information Systems.

[21]  Philip S. Yu,et al.  Meta-path based multi-network collective link prediction , 2014, KDD.

[22]  Qiongkai Xu,et al.  GraRep: Learning Graph Representations with Global Structural Information , 2015, CIKM.

[23]  Hang Li,et al.  Variation Autoencoder Based Network Representation Learning for Classification , 2017, ACL.

[24]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[25]  Jian Pei,et al.  Asymmetric Transitivity Preserving Graph Embedding , 2016, KDD.

[26]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[27]  Omer Levy,et al.  Neural Word Embedding as Implicit Matrix Factorization , 2014, NIPS.

[28]  Chengqi Zhang,et al.  Tri-Party Deep Network Representation , 2016, IJCAI.

[29]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[30]  Diederik P. Kingma,et al.  Stochastic Gradient VB and the Variational Auto-Encoder , 2013 .

[31]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[32]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[33]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.