TrGraph: Cross-Network Transfer Learning via Common Signature Subgraphs

In this paper, we present a novel transfer learning framework for network node classification. Our objective is to accurately predict the labels of nodes in a target network by leveraging information from an auxiliary source network. Such a transfer learning framework is potentially useful for broader areas of network classification, where emerging new networks might not have sufficient labeled information because node labels are either costly to obtain or simply not available, whereas many established networks from related domains are available to benefit the learning. In reality, the source and the target networks may not share common nodes or connections, so the major challenge of cross-network transfer learning is to identify knowledge/patterns transferable between networks and potentially useful to support cross-network learning. In this work, we propose to learn common signature subgraphs between networks, and use them to construct new structure features for the target network. By combining the original node content features and the new structure features, we develop an iterative classification algorithm, TrGraph, that utilizes label dependency to jointly classify nodes in the target network. Experiments on real-world networks demonstrate that TrGraph achieves the superior performance compared to the state-of-the-art baseline methods, and transferring generalizable structure information can indeed improve the node classification accuracy.

[1]  Jie Yin,et al.  Transfer Learning across Networks for Collective Classification , 2013, 2013 IEEE 13th International Conference on Data Mining.

[2]  Jennifer Neville,et al.  Iterative Classification in Relational Data , 2000 .

[3]  Lawrence B. Holder,et al.  Discovering Structural Anomalies in Graph-Based Data , 2007 .

[4]  Oded Maron,et al.  Learning from Ambiguity , 1998 .

[5]  Philip S. Yu,et al.  Graph indexing: a frequent structure-based approach , 2004, SIGMOD '04.

[6]  Jiawei Han,et al.  Discriminative Frequent Pattern Analysis for Effective Classification , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[7]  Mario Vento,et al.  An Improved Algorithm for Matching Large Graphs , 2001 .

[8]  Pedro M. Domingos,et al.  Deep transfer via second-order Markov logic , 2009, ICML '09.

[9]  Horst Bunke,et al.  Efficient Graph Matching for Video Indexing , 1997, GbRPR.

[10]  Qiang Yang,et al.  Cross-domain sentiment classification via spectral feature alignment , 2010, WWW '10.

[11]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[12]  Horst Bunke,et al.  A Comparison of Algorithms for Maximum Common Subgraph on Randomly Connected Graphs , 2002, SSPR/SPR.

[13]  J. Laurie Snell,et al.  Markov Random Fields and Their Applications , 1980 .

[14]  Panos M. Pardalos,et al.  The maximum clique problem , 1994, J. Glob. Optim..

[15]  Jiawei Han,et al.  Knowledge transfer via multiple model local structure mapping , 2008, KDD.

[16]  Qiang Yang,et al.  Heterogeneous Transfer Learning for Image Clustering via the SocialWeb , 2009, ACL.

[17]  Ana-Maria Popescu,et al.  A Machine Learning Approach to Twitter User Classification , 2011, ICWSM.

[18]  John Blitzer,et al.  Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[19]  Lise Getoor,et al.  Collective Classification in Network Data , 2008, AI Mag..

[20]  Christos Faloutsos,et al.  Patterns of Cascading Behavior in Large Blog Graphs , 2007, SDM.

[21]  Rajat Raina,et al.  Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[22]  Jure Leskovec,et al.  Patterns of Influence in a Recommendation Network , 2006, PAKDD.

[23]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[24]  Qiang Yang,et al.  Heterogeneous Transfer Learning for Image Classification , 2011, AAAI.

[25]  Rong Jin,et al.  Unsupervised transfer classification: application to text categorization , 2010, KDD.

[26]  Ben Taskar,et al.  Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[27]  Massimiliano Pontil,et al.  Multi-Task Feature Learning , 2006, NIPS.

[28]  Kalyan Moy Gupta,et al.  Cautious Collective Classification , 2009, J. Mach. Learn. Res..

[29]  Tao Mei,et al.  SocialTransfer: cross-domain transfer learning from social streams for media applications , 2012, ACM Multimedia.

[30]  Graham Cormode,et al.  Node Classification in Social Networks , 2011, Social Network Data Analytics.

[31]  Charu C. Aggarwal,et al.  On Node Classification in Dynamic Content-based Networks , 2011, SDM.

[32]  Qiang Yang,et al.  Topic-bridged PLSA for cross-domain text classification , 2008, SIGIR '08.

[33]  Przemyslaw Kazienko,et al.  Label-dependent node classification in the network , 2012, Neurocomputing.

[34]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[35]  Lise Getoor,et al.  Combining Collective Classification and Link Prediction , 2007 .

[36]  Jennifer Neville,et al.  Across-Model Collective Ensemble Classification , 2011, AAAI.

[37]  Kalyan Moy Gupta,et al.  Case-Based Collective Classification , 2007, FLAIRS.

[38]  Steven C. H. Hoi,et al.  OTL: A Framework of Online Transfer Learning , 2010, ICML.

[39]  Feiping Nie,et al.  Cross-language web page classification via dual knowledge transfer using nonnegative matrix tri-factorization , 2011, SIGIR.

[40]  Yujiu Yang,et al.  Node Classification in Social Network via a Factor Graph Model , 2013, PAKDD.

[41]  Mario Vento,et al.  A (sub)graph isomorphism algorithm for matching large graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[43]  Shirui Pan,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Graph Classification with Imbalanced Class Distributions and Noise ∗ , 2022 .

[44]  Minghua Chen,et al.  Predicting positive and negative links in signed social networks by transfer learning , 2013, WWW.

[45]  Thomas Seidl,et al.  Subgraph Mining on Directed and Weighted Graphs , 2010, PAKDD.

[46]  Jure Leskovec,et al.  The Network Completion Problem: Inferring Missing Nodes and Edges in Networks , 2011, SDM.

[47]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[48]  Qiang Yang,et al.  Discriminative Factor Alignment across Heterogeneous Feature Space , 2012, ECML/PKDD.

[49]  Qiang Yang,et al.  Transferring Naive Bayes Classifiers for Text Classification , 2007, AAAI.