TrGraph: Cross-Network Transfer Learning via Common Signature Subgraphs

In this paper, we present a novel transfer learning framework for network node classification. Our objective is to accurately predict the labels of nodes in a target network by leveraging information from an auxiliary source network. Such a transfer learning framework is potentially useful for broader areas of network classification, where emerging new networks might not have sufficient labeled information because node labels are either costly to obtain or simply not available, whereas many established networks from related domains are available to benefit the learning. In reality, the source and the target networks may not share common nodes or connections, so the major challenge of cross-network transfer learning is to identify knowledge/patterns transferable between networks and potentially useful to support cross-network learning. In this work, we propose to learn common signature subgraphs between networks, and use them to construct new structure features for the target network. By combining the original node content features and the new structure features, we develop an iterative classification algorithm, TrGraph, that utilizes label dependency to jointly classify nodes in the target network. Experiments on real-world networks demonstrate that TrGraph achieves the superior performance compared to the state-of-the-art baseline methods, and transferring generalizable structure information can indeed improve the node classification accuracy.

[1]  Qiang Yang,et al.  Transferring Naive Bayes Classifiers for Text Classification , 2007, AAAI.

[2]  Philip S. Yu,et al.  Graph indexing: a frequent structure-based approach , 2004, SIGMOD '04.

[3]  Qiang Yang,et al.  Heterogeneous Transfer Learning for Image Clustering via the SocialWeb , 2009, ACL.

[4]  Jiawei Han,et al.  Discriminative Frequent Pattern Analysis for Effective Classification , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[5]  Qiang Yang,et al.  Cross-domain sentiment classification via spectral feature alignment , 2010, WWW '10.

[6]  Lise Getoor,et al.  Collective Classification in Network Data , 2008, AI Mag..

[7]  Christos Faloutsos,et al.  Patterns of Cascading Behavior in Large Blog Graphs , 2007, SDM.

[8]  Massimiliano Pontil,et al.  Multi-Task Feature Learning , 2006, NIPS.

[9]  Ana-Maria Popescu,et al.  A Machine Learning Approach to Twitter User Classification , 2011, ICWSM.

[10]  John Blitzer,et al.  Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[11]  Kalyan Moy Gupta,et al.  Case-Based Collective Classification , 2007, FLAIRS.

[12]  Steven C. H. Hoi,et al.  OTL: A Framework of Online Transfer Learning , 2010, ICML.

[13]  Qiang Yang,et al.  Heterogeneous Transfer Learning for Image Classification , 2011, AAAI.

[14]  J. Laurie Snell,et al.  Markov Random Fields and Their Applications , 1980 .

[15]  Jure Leskovec,et al.  The Network Completion Problem: Inferring Missing Nodes and Edges in Networks , 2011, SDM.

[16]  Ben Taskar,et al.  Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[17]  Feiping Nie,et al.  Cross-language web page classification via dual knowledge transfer using nonnegative matrix tri-factorization , 2011, SIGIR.

[18]  Jennifer Neville,et al.  Iterative Classification in Relational Data , 2000 .

[19]  Oded Maron,et al.  Learning from Ambiguity , 1998 .

[20]  Jie Yin,et al.  Transfer Learning across Networks for Collective Classification , 2013, 2013 IEEE 13th International Conference on Data Mining.

[21]  Mario Vento,et al.  An Improved Algorithm for Matching Large Graphs , 2001 .

[22]  Pedro M. Domingos,et al.  Deep transfer via second-order Markov logic , 2009, ICML '09.

[23]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[24]  Horst Bunke,et al.  A Comparison of Algorithms for Maximum Common Subgraph on Randomly Connected Graphs , 2002, SSPR/SPR.

[25]  Jiawei Han,et al.  Knowledge transfer via multiple model local structure mapping , 2008, KDD.

[26]  Przemyslaw Kazienko,et al.  Label-dependent node classification in the network , 2012, Neurocomputing.

[27]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[28]  Rajat Raina,et al.  Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[29]  Jure Leskovec,et al.  Patterns of Influence in a Recommendation Network , 2006, PAKDD.

[30]  Rong Jin,et al.  Unsupervised transfer classification: application to text categorization , 2010, KDD.

[31]  Yujiu Yang,et al.  Node Classification in Social Network via a Factor Graph Model , 2013, PAKDD.

[32]  Mario Vento,et al.  A (sub)graph isomorphism algorithm for matching large graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[34]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[35]  J. Jeffry Howbert,et al.  The Maximum Clique Problem , 2007 .

[36]  Lise Getoor,et al.  Combining Collective Classification and Link Prediction , 2007, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007).

[37]  Chengqi Zhang,et al.  TrGraph: Cross-Network Transfer Learning via Common Signature Subgraphs , 2015, IEEE Trans. Knowl. Data Eng..

[38]  Jennifer Neville,et al.  Across-Model Collective Ensemble Classification , 2011, AAAI.

[39]  Shirui Pan,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Graph Classification with Imbalanced Class Distributions and Noise ∗ , 2022 .

[40]  Minghua Chen,et al.  Predicting positive and negative links in signed social networks by transfer learning , 2013, WWW.

[41]  Thomas Seidl,et al.  Subgraph Mining on Directed and Weighted Graphs , 2010, PAKDD.

[42]  Horst Bunke,et al.  Efficient Graph Matching for Video Indexing , 1997, GbRPR.

[43]  Qiang Yang,et al.  Discriminative Factor Alignment across Heterogeneous Feature Space , 2012, ECML/PKDD.

[44]  Kalyan Moy Gupta,et al.  Cautious Collective Classification , 2009, J. Mach. Learn. Res..

[45]  Graham Cormode,et al.  Node Classification in Social Networks , 2011, Social Network Data Analytics.

[46]  Charu C. Aggarwal,et al.  On Node Classification in Dynamic Content-based Networks , 2011, SDM.

[47]  Qiang Yang,et al.  Topic-bridged PLSA for cross-domain text classification , 2008, SIGIR '08.

[48]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[49]  Raymond J. Mooney,et al.  Mapping and Revising Markov Logic Networks for Transfer Learning , 2007, AAAI.