Interplay Between Graph Isomorphism and Earth Mover's Distance in the Query and Communication Worlds

The graph isomorphism distance between two graphs Gu and Gk is the fraction of entries in the adjacency matrix that has to be changed to make Gu isomorphic to Gk. We study the problem of estimating, up to a constant additive factor, the graph isomorphism distance between two graphs in the query model. In other words, if Gk is a known graph and Gu is an unknown graph whose adjacency matrix has to be accessed by querying the entries, what is the query complexity for testing whether the graph isomorphism distance between Gu and Gk is less than γ1 or more than γ2, where γ1 and γ2 are two constants with 0 ≤ γ1 < γ2 ≤ 1. It is also called the tolerant property testing of graph isomorphism in the dense graph model. The non-tolerant version (where γ1 is 0) has been studied by Fischer and Matsliah (SICOMP’08). In this paper, we prove a (interesting) connection between tolerant graph isomorphism testing and tolerant testing of the well studied Earth Mover’s Distance (EMD). We prove that deciding tolerant graph isomorphism is equivalent to deciding tolerant EMD testing between multi-sets in the query setting. Moreover, the reductions between tolerant graph isomorphism and tolerant EMD testing (in query setting) can also be extended directly to work in the two party Alice-Bob communication model (where Alice and Bob have one graph each and they want to solve tolerant graph isomorphism problem by communicating bits), and possibly in other sublinear models as well. Testing tolerant EMD between two probability distributions is equivalent to testing EMD between two multi-sets, where the multiplicity of each element is taken appropriately, and we sample elements from the unknown multi-set with replacement. In this paper, our (main) contribution is to introduce the problem of (tolerant) EMD testing between multi-sets (over Hamming cube) when we get samples from the unknown multi-set without replacement and to show that this variant of tolerant testing of EMD is as hard as tolerant testing of graph isomorphism between two graphs. Thus, while testing of equivalence between distributions is at the heart of the non-tolerant testing of graph isomorphism, we are showing that the estimation of the EMD over a Hamming cube (when we are allowed to sample without replacement) is at the heart of tolerant graph isomorphism. We believe that the introduction of the problem of testing EMD between multi-sets (when we get samples without replacement) opens an entirely new direction in the world of testing properties of distributions. 2012 ACM Subject Classification Theory of computation → Streaming, sublinear and near linear time algorithms

[1]  David P. Woodruff,et al.  Efficient Sketches for Earth-Mover Distance, with Applications , 2009, 2009 50th Annual IEEE Symposium on Foundations of Computer Science.

[2]  Chih-Long Lin,et al.  Hardness of Approximating Graph Transformation Problem , 1994, ISAAC.

[3]  Gregory Valiant,et al.  An Automatic Inequality Prover and Instance Optimal Identity Testing , 2014, 2014 IEEE 55th Annual Symposium on Foundations of Computer Science.

[4]  Luc Devroye,et al.  Combinatorial methods in density estimation , 2001, Springer series in statistics.

[5]  Xiaorui Sun On the isomorphism testing of graphs , 2016 .

[6]  László Babai,et al.  Graph isomorphism in quasipolynomial time [extended abstract] , 2015, STOC.

[7]  Ronitt Rubinfeld,et al.  Tolerant property testing and distance approximation , 2006, J. Comput. Syst. Sci..

[8]  Paul Valiant Testing symmetric properties of distributions , 2008, STOC '08.

[9]  Gregory Valiant,et al.  The Power of Linear Estimators , 2011, 2011 IEEE 52nd Annual Symposium on Foundations of Computer Science.

[10]  Subhash Khot,et al.  Nonembeddability theorems via Fourier analysis , 2005, 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS'05).

[11]  Ilan Newman,et al.  Private vs. Common Random Bits in Communication Complexity , 1991, Inf. Process. Lett..

[12]  Dana Ron,et al.  Strong Lower Bounds for Approximating Distribution Support Size and the Distinct Elements Problem , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).

[13]  Constantinos Daskalakis,et al.  Optimal Testing for Properties of Distributions , 2015, NIPS.

[14]  David A. Freedman,et al.  A Remark on the Difference between Sampling with and without Replacement , 1977 .

[15]  Ronitt Rubinfeld,et al.  Testing random variables for independence and identity , 2001, Proceedings 2001 IEEE International Conference on Cluster Computing.

[16]  Andrew Chi-Chih Yao,et al.  Some complexity questions related to distributive computing(Preliminary Report) , 1979, STOC.

[17]  Krzysztof Onak,et al.  The query complexity of graph isomorphism: bypassing distribution testing lower bounds , 2018, STOC.

[18]  László Babai,et al.  Property Testing of Equivalence under a Permutation Group Action , 2008, Electron. Colloquium Comput. Complex..

[19]  Ronitt Rubinfeld,et al.  Sublinear Time Algorithms for Earth Mover’s Distance , 2009, Theory of Computing Systems.

[20]  Alessandro Panconesi,et al.  Concentration of Measure for the Analysis of Randomized Algorithms , 2009 .

[21]  Aravind Srinivasan,et al.  Lower Bounds on the Deterministic and Quantum Communication Complexity of Hamming-Distance Problems , 2004, ACM Trans. Comput. Theory.

[22]  Oded Goldreich Testing Isomorphism in the Bounded-Degree Graph Model , 2019, Electron. Colloquium Comput. Complex..

[23]  Clément L. Canonne,et al.  A Survey on Distribution Testing: Your Data is Big. But is it Blue? , 2020, Electron. Colloquium Comput. Complex..

[24]  Alexandr Andoni,et al.  Earth mover distance over high-dimensional spaces , 2008, SODA '08.

[25]  Jacobo Torán,et al.  The Graph Isomorphism Problem (Dagstuhl Seminar 15511) , 2015, Dagstuhl Reports.

[26]  Alexandr Andoni,et al.  Sketching and Embedding are Equivalent for Norms , 2014, STOC.

[27]  Barnabás Póczos,et al.  Minimax Distribution Estimation in Wasserstein Distance , 2018, ArXiv.

[28]  Moti Medina,et al.  Distributed Testing of Graph Isomorphism in the CONGEST model , 2020, APPROX/RANDOM.

[29]  Eldar Fischer,et al.  Testing graph isomorphism , 2006, SODA '06.

[30]  Liam Paninski,et al.  A Coincidence-Based Test for Uniformity Given Very Sparsely Sampled Discrete Data , 2008, IEEE Transactions on Information Theory.