Label Propagation on K-Partite Graphs with Heterophily

In this paper, for the first time, we study label propagation in heterogeneous graphs under heterophily assumption. Homophily label propagation (i.e., two connected nodes share similar labels) in homogeneous graph (with same types of vertices and relations) has been extensively studied before. Unfortunately, real-life networks are heterogeneous, they contain different types of vertices (e.g., users, images, texts) and relations (e.g., friendships, co-tagging) and allow for each node to propagate both the same and opposite copy of labels to its neighbors. We propose a $\mathcal{K}$-partite label propagation model to handle the mystifying combination of heterogeneous nodes/relations and heterophily propagation. With this model, we develop a novel label inference algorithm framework with update rules in near-linear time complexity. Since real networks change over time, we devise an incremental approach, which supports fast updates for both new data and evidence (e.g., ground truth labels) with guaranteed efficiency. We further provide a utility function to automatically determine whether an incremental or a re-modeling approach is favored. Extensive experiments on real datasets have verified the effectiveness and efficiency of our approach, and its superiority over the state-of-the-art label propagation methods.

[1]  Zhigang Luo,et al.  NeNMF: An Optimal Gradient Method for Nonnegative Matrix Factorization , 2012, IEEE Transactions on Signal Processing.

[2]  Haesun Park,et al.  Algorithms for nonnegative matrix and tensor factorizations: a unified view based on block coordinate descent framework , 2014, J. Glob. Optim..

[3]  John Riedl,et al.  Tagommenders: connecting users to items through tags , 2009, WWW '09.

[4]  Suh-Ryung Kim,et al.  The competition numbers of complete tripartite graphs , 2008, Discret. Appl. Math..

[5]  Xiaojun Wu,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Chih-Jen Lin,et al.  Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[7]  Ludovic Denoyer,et al.  Learning latent representations of nodes for classifying in heterogeneous social networks , 2014, WSDM.

[8]  Deepayan Chakrabarti,et al.  Joint Inference of Multiple Label Types in Large Networks , 2014, ICML.

[9]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[10]  Carlos Castillo,et al.  Graph regularization methods for Web spam detection , 2010, Machine Learning.

[11]  William T. Freeman,et al.  Understanding belief propagation and its generalizations , 2003 .

[12]  Philip S. Yu,et al.  A Survey of Heterogeneous Information Network Analysis , 2015, IEEE Transactions on Knowledge and Data Engineering.

[13]  Paul H. Calamai,et al.  Projected gradient methods for linearly constrained problems , 1987, Math. Program..

[14]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[15]  Philip S. Yu,et al.  Unsupervised learning on k-partite graphs , 2006, KDD '06.

[16]  Kristina Lerman,et al.  Tripartite graph clustering for dynamic sentiment analysis on social media , 2014, SIGMOD Conference.

[17]  Xiaojin Zhu,et al.  Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization , 2006 .

[18]  Lars Backstrom,et al.  Balanced label propagation for partitioning massive graphs , 2013, WSDM.

[19]  Katia P. Sycara,et al.  Nonnegative Matrix Tri-Factorization with Graph Regularization for Community Detection in Social Networks , 2015, IJCAI.

[20]  Quanquan Gu,et al.  Co-clustering on manifolds , 2009, KDD.

[21]  Chris H. Q. Ding,et al.  Orthogonal nonnegative matrix t-factorizations for clustering , 2006, KDD '06.

[22]  Christos Faloutsos Large graph mining: patterns, cascades, fraud detection, and algorithms , 2014, WWW.

[23]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[24]  Daniel P. Huttenlocher,et al.  Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[25]  Stephen Cox,et al.  Automatic pitch accent prediction for text-to-speech synthesis , 2007, INTERSPEECH.

[26]  Gerhard Lakemeyer,et al.  Exploring artificial intelligence in the new millennium , 2003 .

[27]  Stephen A. Vavasis,et al.  On the Complexity of Nonnegative Matrix Factorization , 2007, SIAM J. Optim..

[28]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[29]  Sihem Amer-Yahia,et al.  Logical and physical support for heterogeneous data , 2002, CIKM '02.

[30]  Recommendation Diversification Using Explanations , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[31]  Jeff A. Bilmes,et al.  Semi-Supervised Learning with Measure Propagation , 2011, J. Mach. Learn. Res..

[32]  Christos Faloutsos,et al.  OMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation , 2015, AAAI.

[33]  Koby Crammer,et al.  New Regularized Algorithms for Transductive Learning , 2009, ECML/PKDD.

[34]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[35]  John W. Fisher,et al.  Loopy Belief Propagation: Convergence and Effects of Message Errors , 2005, J. Mach. Learn. Res..

[36]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[37]  Chris H. Q. Ding,et al.  Label Propagation on K-partite Graphs , 2009, 2009 International Conference on Machine Learning and Applications.

[38]  Danai Koutra,et al.  Linearized and Single-Pass Belief Propagation , 2014, Proc. VLDB Endow..

[39]  John D. Lafferty,et al.  Semi-supervised learning using randomized mincuts , 2004, ICML.

[40]  Jingrui He,et al.  A Graph-Based Hybrid Framework for Modeling Complex Heterogeneity , 2015, 2015 IEEE International Conference on Data Mining.

[41]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[42]  Indre Zliobaite,et al.  Learning under Concept Drift: an Overview , 2010, ArXiv.

[43]  K. Schittkowski,et al.  NONLINEAR PROGRAMMING , 2022 .

[44]  Ian Davidson,et al.  Guided learning for role discovery (GLRD): framework, algorithms, and applications , 2013, KDD.