DONE: Enhancing network embedding via greedy vertex domination

Abstract In this paper, we present DONE, a novel framework for learning networks representations, which fundamentally supports numerous network analytic tasks such as node classification, clustering, and visualization. Most existing network embedding methods are unable to efficiently scale for large networks and usually suffer from performance issues. In addition, these methods cannot efficiently leverage the vertex labels that are usually very scarce in real-world networks. Our framework provides a powerful way to generate representations for network vertices without relying on user-defined heuristics for manual feature extraction. We present a deep autoencoder model to generate low-dimensional feature representations by learning network reconstruction and semi-supervised classification tasks. We propose a novel greedy algorithm based on vertex domination and centrality concepts to simplify networks while preserving network topology and community structure, namely GreedyNet. In addition, we propose a novel sampling approach to estimate the labels of unlabeled vertices and adopt it to learn superior embedding. Using network dominating vertices, our approach enhances the generalization and scalability of network embedding through simplifying the underlying network. To the best of our knowledge, we are the first to propose the idea of using network dominating vertices to enhance network embedding. The experimental results show that our method outperforms the state-of-the-art methods.

[1]  Andrew McCallum,et al.  Automating the Construction of Internet Portals with Machine Learning , 2000, Information Retrieval.

[2]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[3]  Vasek Chvátal,et al.  A Greedy Heuristic for the Set-Covering Problem , 1979, Math. Oper. Res..

[4]  Mathias Niepert,et al.  Learning Convolutional Neural Networks for Graphs , 2016, ICML.

[5]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[6]  Lise Getoor,et al.  Collective Classification in Network Data , 2008, AI Mag..

[7]  Yumin Fu Dominating Set and Converse Dominating Set of a Directed Graph , 1968 .

[8]  Chengqi Zhang,et al.  Tri-Party Deep Network Representation , 2016, IJCAI.

[9]  Zhengyang Wang,et al.  Large-Scale Learnable Graph Convolutional Networks , 2018, KDD.

[10]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[11]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[12]  Alberto Montresor,et al.  gat2vec: representation learning for attributed graphs , 2018, Computing.

[13]  Qiongkai Xu,et al.  GraRep: Learning Graph Representations with Global Structural Information , 2015, CIKM.

[14]  David S. Johnson,et al.  Approximation algorithms for combinatorial problems , 1973, STOC.

[15]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[16]  Xiao-Ming Wu,et al.  Deeper Insights into Graph Convolutional Networks for Semi-Supervised Learning , 2018, AAAI.

[17]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[18]  Roger M. Needham The changing environment for security protocols , 1997 .

[19]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[20]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[21]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[22]  Weili Wu,et al.  A greedy approximation for minimum connected dominating sets , 2004, Theor. Comput. Sci..

[23]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Roger Wattenhofer,et al.  Constant-time distributed dominating set approximation , 2003, PODC '03.

[25]  Steven Skiena,et al.  Enhanced Network Embeddings via Exploiting Edge Labels , 2018, CIKM.

[26]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.