Semi-supervised Classification-based Local Vertex Ranking via Dual Generative Adversarial Nets

Real-world graphs are usually very sparse in terms of inadequate edges and labels as well as have poor quality due to a large amount of noisy data. In this paper, we propose a classification-based local vertex ranking architecture through dual generative adversarial networks in the semi-supervised setting, DQGAN, for analyzing sparse noisy graphs with rarely labeled data. First, we develop a quadruple generative adversarial ClassNet model to address the noisy data and data sparsity issues as well as to classify each vertex into K classes by automatically creating imaginary/real-looking supplementary labeled vertices with the quite different/similar distributions as real vertices, without the high cost of multi-step graph propagation, heterogeneous graph mining, and iterative weight learning. In addition, the vertex label vicinity is incorporated into the classification model to capture the pairwise vertex closeness based on the labeling and align the vertex label vicinity with the well-known vertex homophily for preserving the original structural semantics in the classification space. Second, we present a quintuple generative adversarial RankNet framework to locally rank each vertex on each of K classes by designing the game of multiple competitors utilizing the mix of real and noisy data to fight against each other, for improving the robustness of local vertex ranking to noisy data with few help from human efforts. The cycle ranking consistency strategy is designed to make the ranking quality verifiable through the bidirectional information-lossless translations between the original features and the ranking features. We propose to utilize the relaxed local PageRank property to produce high-quality local vertex ranking results in the context of information networks. Third but last, extensive evaluation on real graph datasets demonstrates that DQGAN outperforms existing representative methods in terms of both classification and ranking in the semi-supervised setting.

[1]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[2]  Philip S. Yu,et al.  Integrating meta-path selection with user-guided object clustering in heterogeneous information networks , 2012, KDD.

[3]  Yizhou Sun,et al.  RankClus: integrating clustering with ranking for heterogeneous information network analysis , 2009, EDBT '09.

[4]  Calton Pu,et al.  Scaling iterative graph computations with GraphMap , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.

[5]  Ruoming Jin,et al.  Density-Adaptive Local Edge Representation Learning with Generative Adversarial Network Multi-label Edge Classification , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[6]  Charu C. Aggarwal,et al.  On Node Classification in Dynamic Content-based Networks , 2011, SDM.

[7]  Wei Cheng,et al.  Flexible and robust co-regularized multi-domain graph clustering , 2013, KDD.

[8]  Shou-De Lin,et al.  Unsupervised Ranking using Graph Structures and Node Attributes , 2017, WSDM.

[9]  Hong Cheng,et al.  Graph Clustering Based on Structural/Attribute Similarities , 2009, Proc. VLDB Endow..

[10]  Jiawei Han,et al.  Ranking-based classification of heterogeneous information networks , 2011, KDD.

[11]  Michael K. Ng,et al.  Tensor Based Relations Ranking for Multi-relational Collective Classification , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[12]  Yixin Chen,et al.  An End-to-End Deep Learning Architecture for Graph Classification , 2018, AAAI.

[13]  Qi Zhang,et al.  GraphTwist: Fast Iterative Graph Computation with Two-tier Optimizations , 2015, Proc. VLDB Endow..

[14]  Philip S. Yu,et al.  Predicting Social Links for New Users across Aligned Heterogeneous Social Networks , 2013, 2013 IEEE 13th International Conference on Data Mining.

[15]  Charu C. Aggarwal,et al.  The Troll-Trust Model for Ranking in Signed Networks , 2016, WSDM.

[16]  Philip S. Yu,et al.  Multi-View Multi-Graph Embedding for Brain Network Clustering Analysis , 2018, AAAI.

[17]  Mingchu Li,et al.  Reliable and Resilient Trust Management in Distributed Service Provision Networks , 2015, ACM Trans. Web.

[18]  Christos Faloutsos,et al.  Fast Random Walk with Restart and Its Applications , 2006, Sixth International Conference on Data Mining (ICDM'06).

[19]  Philip S. Yu,et al.  Multi-view Graph Embedding with Hub Detection for Brain Network Analysis , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[20]  Yizhou Sun,et al.  Ranking-based clustering of heterogeneous information networks with star network schema , 2009, KDD.

[21]  Calton Pu,et al.  Fast Iterative Graph Computation with Resource Aware Graph Parallel Abstractions , 2015, HPDC.

[22]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[23]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[24]  Jingrui He,et al.  Diversified ranking on large graphs: an optimization viewpoint , 2011, KDD.

[25]  Tie-Yan Liu,et al.  Semi-supervised ranking on very large graphs with rich metadata , 2011, KDD.

[26]  Shanshan Li,et al.  Deep Collective Classification in Heterogeneous Information Networks , 2018, WWW.

[27]  Philip S. Yu,et al.  Multi-view Clustering with Graph Embedding for Connectome Analysis , 2017, CIKM.

[28]  Hong Cheng,et al.  Clustering Large Attributed Graphs: An Efficient Incremental Approach , 2010, 2010 IEEE International Conference on Data Mining.

[29]  Jiawei Han,et al.  A Feature-Enhanced Ranking-Based Classifier for Multimodal Data and Heterogeneous Information Networks , 2013, 2013 IEEE 13th International Conference on Data Mining.

[30]  Hui Fang,et al.  Supervised User Ranking in Signed Social Networks , 2019, AAAI.

[31]  Peng Yang,et al.  An Aggressive Graph-Based Selective Sampling Algorithm for Classification , 2015, 2015 IEEE International Conference on Data Mining.

[32]  Ling Liu,et al.  Integrating Vertex-centric Clustering with Edge-centric Clustering for Meta Path Graph Analysis , 2015, KDD.

[33]  M. Narasimha Murty,et al.  Structural Neighborhood Based Classification of Nodes in a Network , 2016, KDD.

[34]  Jiebo Luo,et al.  RankCompete: Simultaneous ranking and clustering of information networks , 2012, Neurocomputing.

[35]  Wenpu Xing,et al.  Weighted PageRank algorithm , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..

[36]  Wojciech Zaremba,et al.  Improved Techniques for Training GANs , 2016, NIPS.

[37]  Nitesh V. Chawla,et al.  Task-Guided and Semantic-Aware Ranking for Academic Author-Paper Correlation Inference , 2018, IJCAI.

[38]  Ling Liu,et al.  Activity-edge centric multi-label classification for mining heterogeneous information networks , 2014, KDD.

[39]  Charu C. Aggarwal,et al.  Selective sampling on graphs for classification , 2013, KDD.

[40]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[41]  Ruiqi Hu,et al.  Graph Ladder Networks for Network Classification , 2017, CIKM.

[42]  Huan Liu,et al.  Scalable learning of collective behavior based on sparse social dimensions , 2009, CIKM.

[43]  Charu C. Aggarwal,et al.  Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes , 2012, Proc. VLDB Endow..

[44]  Foster Provost,et al.  A Simple Relational Classifier , 2003 .

[45]  Ming-Syan Chen,et al.  Personalized Ranking on Poisson Factorization , 2018, SDM.

[46]  Diane J. Cook,et al.  Monitoring Influenza Trends through Mining Social Media , 2009, BIOCOMP.

[47]  Jiawei Han,et al.  Curriculum Learning for Heterogeneous Star Network Embedding via Deep Reinforcement Learning , 2018, WSDM.

[48]  John D. Lafferty,et al.  Diffusion Kernels on Graphs and Other Discrete Input Spaces , 2002, ICML.

[49]  Michael R. Lyu,et al.  Mining social networks using heat diffusion processes for marketing candidates selection , 2008, CIKM '08.

[50]  Cho-Jui Hsieh,et al.  Large-scale Collaborative Ranking in Near-Linear Time , 2017, KDD.

[51]  M. Narasimha Murty,et al.  Overlap-Robust Decision Boundary Learning for Within-Network Classification , 2018, AAAI.

[52]  Zhao Chen,et al.  Ranking Users in Social Networks With Higher-Order Structures , 2018, AAAI.

[53]  Yang Zhou,et al.  Enhancing Collaborative Filtering with Multi-label Classification , 2019, CSoNet.

[54]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[55]  Zoubin Ghahramani,et al.  Learning from labeled and unlabeled data with label propagation , 2002 .

[56]  L. Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[57]  Ling Liu,et al.  Social influence based clustering of heterogeneous information networks , 2013, KDD.

[58]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[59]  Shuigeng Zhou,et al.  Group and Graph Joint Sparsity for Linked Data Classification , 2016, AAAI.

[60]  Ling Liu,et al.  Social Influence Based Clustering and Optimization over Heterogeneous Information Networks , 2015, ACM Trans. Knowl. Discov. Data.

[61]  Huan Liu,et al.  Exploiting homophily effect for trust prediction , 2013, WSDM.

[62]  Mizuki Morita,et al.  Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter , 2011, EMNLP.

[63]  Philip S. Yu,et al.  Meta path-based collective classification in heterogeneous information networks , 2012, CIKM.

[64]  Huan Liu,et al.  Leveraging social media networks for classification , 2011, Data Mining and Knowledge Discovery.

[65]  Tsuyoshi Murata,et al.  Transductive Classification on Heterogeneous Information Networks with Edge Betweenness-based Normalization , 2016, WSDM.

[66]  Susan T. Dumais,et al.  Classification-enhanced ranking , 2010, WWW '10.

[67]  Charu C. Aggarwal,et al.  Link Prediction with Spatial and Temporal Consistency in Dynamic Networks , 2017, IJCAI.

[68]  Gita Reese Sukthankar,et al.  Multi-label relational neighbor classification using social context features , 2013, KDD.

[69]  Jiawei Han,et al.  An Attention-based Collaboration Framework for Multi-View Network Representation Learning , 2017, CIKM.

[70]  Yang Zhou,et al.  Dual Adversarial Learning Based Network Alignment , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[71]  Ruoming Jin,et al.  Density-aware Local Siamese Autoencoder Network Embedding with Autoencoder Graph Clustering , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[72]  Charu C. Aggarwal,et al.  Node Classification in Signed Social Networks , 2016, SDM.