论文信息 - ZooBP: Belief Propagation for Heterogeneous Networks

ZooBP: Belief Propagation for Heterogeneous Networks

Given a heterogeneous network, with nodes of different types - e.g., products, users and sellers from an online recommendation site like Amazon - and labels for a few nodes ('honest', 'suspicious', etc), can we find a closed formula for Belief Propagation (BP), exact or approximate? Can we say whether it will converge? BP, traditionally an inference algorithm for graphical models, exploits so-called "network effects" to perform graph classification tasks when labels for a subset of nodes are provided; and it has been successful in numerous settings like fraudulent entity detection in online retailers and classification in social networks. However, it does not have a closed-form nor does it provide convergence guarantees in general. We propose ZooBP, a method to perform fast BP on undirected heterogeneous graphs with provable convergence guarantees. ZooBP has the following advantages: (1) Generality: It works on heterogeneous graphs with multiple types of nodes and edges; (2) Closed-form solution: ZooBP gives a closed-form solution as well as convergence guarantees; (3) Scalability: ZooBP is linear on the graph size and is up to 600× faster than BP, running on graphs with 3.3 million edges in a few seconds. (4) Effectiveness: Applied on real data (a Flipkart e-commerce network with users, products and sellers), ZooBP identifies fraudulent users with a near-perfect precision of 92.3 % over the top 300 results.

[1] Brendan J. Frey,et al. Probability Propagation and Iterative Decoding , 1996 .

[2] Danai Koutra,et al. Unifying Guilt-by-Association Approaches: Theorems and Fast Algorithms , 2011, ECML/PKDD.

[3] Aniket Kittur,et al. Apolo: making sense of large network data by combining rich user interaction and machine learning , 2011, CHI.

[4] Hilbert J. Kappen,et al. Sufficient Conditions for Convergence of the Sum–Product Algorithm , 2005, IEEE Transactions on Information Theory.

[5] Christos Faloutsos,et al. Opinion Fraud Detection in Online Reviews by Network Effects , 2013, ICWSM.

[6] Judea Pearl,et al. Reverend Bayes on Inference Engines: A Distributed Hierarchical Approach , 1982, AAAI.

[7] William T. Freeman,et al. Understanding belief propagation and its generalizations , 2003 .

[8] Michael I. Jordan,et al. Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[9] Vladimir Vapnik. Transductive Inference and Semi-Supervised Learning , 2006, Semi-Supervised Learning.

[10] Christos Faloutsos,et al. Netprobe: a fast and scalable system for fraud detection in online auction networks , 2007, WWW '07.

[11] Zoubin Ghahramani,et al. Learning from labeled and unlabeled data with label propagation , 2002 .

[12] Christos Faloutsos,et al. Fast Random Walk with Restart and Its Applications , 2006, Sixth International Conference on Data Mining (ICDM'06).

[13] Christos Faloutsos,et al. CAMLP: Confidence-Aware Modulated Label Propagation , 2016, SDM.

[14] Jennifer Neville,et al. Iterative Classification in Relational Data , 2000 .

[15] Xiaojin Zhu,et al. Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[16] Nanning Zheng,et al. Stereo Matching Using Belief Propagation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[17] Yousef Saad,et al. Iterative methods for sparse linear systems , 2003 .

[18] Judea Pearl,et al. Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[19] S. R. Searle,et al. The Vec-Permutation Matrix, the Vec Operator and Kronecker Products: A Review , 1981 .

[20] F. Ayres. Schaum's outline of theory and problems of matrices , 1952 .

[21] Jennifer Neville,et al. Why collective inference improves relational classification , 2004, KDD.

[22] Danai Koutra,et al. Linearized and Single-Pass Belief Propagation , 2014, Proc. VLDB Endow..

[23] Ian McGraw,et al. Residual Belief Propagation: Informed Scheduling for Asynchronous Message Passing , 2006, UAI.

[24] Daniel P. Huttenlocher,et al. Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[25] Hideki Imai,et al. Reduced complexity iterative decoding of low-density parity check codes based on belief propagation , 1999, IEEE Trans. Commun..

[26] Wolfgang Gatterbauer,et al. The Linearization of Pairwise Markov Networks , 2015, ArXiv.