Graph Matching with Partially-Correct Seeds

The graph matching problem aims to find the latent vertex correspondence between two edge-correlated graphs and has many practical applications. In this work, we study a version of the seeded graph matching problem, which assumes that a set of seeds, i.e., pre-mapped vertex-pairs, is given in advance. Specifically, consider two correlated graphs whose edges are sampled independently with probability $s$ from a parent \ER graph $\mathcal{G}(n,p)$. Furthermore, a mapping between the vertices of the two graphs is provided as seeds, of which an unknown $\beta$ fraction is correct. This problem was first studied in \cite{lubars2018correcting} where an algorithm is proposed and shown to perfectly recover the correct vertex mapping with high probability if $\beta\geq\max\left\{\frac{8}{3}p,\frac{16\log{n}}{nps^2}\right\}$. We improve their condition to $\beta\geq\max\left\{30\sqrt{\frac{\log n}{n(1-p)^2s^2}},\frac{45\log{n}}{np(1-p)^2s^2}\right)$. However, when $p=O\left( \sqrt{{\log n}/{ns^2}}\right)$, our improved condition still requires that $\beta$ must increase inversely proportional to $np$. In order to improve the matching performance for sparse graphs, we propose a new algorithm that uses "witnesses" in the 2-hop neighborhood, instead of only 1-hop neighborhood as in \cite{lubars2018correcting}. We show that when $np^2\leq\frac{1}{135\log n}$, our new algorithm can achieve perfect recovery with high probability if $\beta\geq\max\left\{900\sqrt{\frac{np^3(1-s)\log n}{s}},600\sqrt{\frac{\log n}{ns^4}}, \frac{1200\log n}{n^2p^2s^4}\right\}$ and $nps^2\geq 128\log n$. Numerical experiments on both synthetic and real graphs corroborate our theoretical findings and show that our 2-hop algorithm significantly outperforms the 1-hop algorithm when the graphs are relatively sparse.

[1]  Osman Emre Dai,et al.  Analysis of a Canonical Labeling Algorithm for the Alignment of Correlated Erdős-Rényi Graphs , 2018, Abstracts of the 2019 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems.

[2]  Christoph Schnörr,et al.  Probabilistic Subgraph Matching Based on Convex Relaxation , 2005, EMMCVPR.

[3]  Svante Janson,et al.  Large deviations for sums of partly dependent random variables , 2004, Random Struct. Algorithms.

[4]  Panos M. Pardalos,et al.  The Quadratic Assignment Problem: A Survey and Recent Developments , 1993, Quadratic Assignment and Related Problems.

[5]  Andrew Y. Ng,et al.  Robust Textual Inference via Graph Matching , 2005, HLT.

[6]  Yaron Lipman,et al.  DS++ , 2017, ACM Trans. Graph..

[7]  Carey E. Priebe,et al.  Fast Approximate Quadratic Programming for Graph Matching , 2015, PloS one.

[8]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[9]  Bonnie Berger,et al.  Global alignment of multiple protein interaction networks with application to functional orthology detection , 2008, Proceedings of the National Academy of Sciences.

[10]  Jiaming Xu,et al.  Efficient random graph matching via degree profiles , 2018, Probability Theory and Related Fields.

[11]  Carey E. Priebe,et al.  Seeded graph matching for correlated Erdös-Rényi graphs , 2014, J. Mach. Learn. Res..

[12]  Elchanan Mossel,et al.  Seeded graph matching via large neighborhood statistics , 2018, SODA.

[13]  Alessandro Panconesi,et al.  Concentration of Measure for the Analysis of Randomized Algorithms , 2009 .

[14]  R. Srikant,et al.  Correcting the Output of Approximate Graph Matching Algorithms , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[15]  Guillermo Sapiro,et al.  Graph Matching: Relax at Your Own Risk , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Jiaming Xu,et al.  Spectral Graph Matching and Regularized Quadratic Relaxations II: Erdős-Rényi Graphs and Universality , 2019, Foundations of Computational Mathematics.

[17]  Jiaming Xu,et al.  Spectral Graph Matching and Regularized Quadratic Relaxations I: The Gaussian Model , 2019, ArXiv.

[18]  Carey E. Priebe,et al.  Seeded graph matching , 2012, Pattern Recognit..

[19]  Laurent Massouli'e,et al.  From tree matching to sparse graph alignment , 2020, COLT.

[20]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[21]  Y. Aflalo,et al.  On convex relaxation of graph isomorphism , 2015, Proceedings of the National Academy of Sciences.

[22]  Mario Vento,et al.  Thirty Years Of Graph Matching In Pattern Recognition , 2004, Int. J. Pattern Recognit. Artif. Intell..

[23]  Marco Gori,et al.  Exact and approximate graph matching using random walks , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Matthias Grossglauser,et al.  PROPER: global protein interaction network alignment through percolation matching , 2016, BMC Bioinformatics.

[25]  Daniel Cullina,et al.  On the Performance of a Canonical Labeling for Matching Correlated Erdős-Rényi Graphs , 2018, ArXiv.

[26]  Elza Erkip,et al.  Seeded graph matching: Efficient algorithms and theoretical guarantees , 2017, 2017 51st Asilomar Conference on Signals, Systems, and Computers.

[27]  Vitaly Shmatikov,et al.  De-anonymizing Social Networks , 2009, 2009 30th IEEE Symposium on Security and Privacy.

[28]  Maxim Sviridenko,et al.  Maximum Quadratic Assignment Problem: Reduction from Maximum Label Cover and LP-based Approximation Algorithm , 2010, TALG.

[29]  Shinji Umeyama,et al.  An Eigendecomposition Approach to Weighted Graph Matching Problems , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Matthias Grossglauser,et al.  On the performance of percolation graph matching , 2013, COSN '13.

[31]  Silvio Lattanzi,et al.  An efficient reconciliation algorithm for social networks , 2013, Proc. VLDB Endow..

[32]  Matthias Grossglauser,et al.  On the privacy of anonymized networks , 2011, KDD.

[33]  Maxim Sviridenko,et al.  Maximum Quadratic Assignment Problem: Reduction from Maximum Label Cover and LP-Based Approximation Algorithm , 2010, ICALP.

[34]  Muriel Medard,et al.  Spectral Alignment of Graphs , 2016, IEEE Transactions on Network Science and Engineering.

[35]  François Le Gall,et al.  Powers of tensors and fast matrix multiplication , 2014, ISSAC.

[36]  Tselil Schramm,et al.  (Nearly) Efficient Algorithms for the Graph Matching Problem on Correlated Random Graphs , 2018, NeurIPS.

[37]  Panos M. Pardalos,et al.  Quadratic Assignment Problem , 1997, Encyclopedia of Optimization.