Deterministic massively parallel connectivity

We consider the problem of designing fundamental graph algorithms on the model of Massive Parallel Computation (MPC). The input to the problem is an undirected graph G with n vertices and m edges, and with D being the maximum diameter of any connected component in G. We consider the MPC with low local space, allowing each machine to store only Θ(nδ) words for an arbitrary constant δ>0, and with linear global space (which is the number of machines times the local space available), that is, with optimal utilization. In a recent breakthrough, Andoni et al. (FOCS’18) and Behnezhad et al. (FOCS’19) designed parallel randomized algorithms that in O(logD + loglogn) rounds on an MPC with low local space determine all connected components of a graph, improving on the classic bound of O(logn) derived from earlier works on PRAM algorithms. In this paper, we show that asymptotically identical bounds can be also achieved for deterministic algorithms: we present a deterministic MPC low local space algorithm that in O(logD + loglogn) rounds determines connected components of the input graph. Our result matches the complexity of state of the art randomized algorithms for this task. The techniques developed in our paper can be also applied to several related problems, giving new deterministic MPC algorithms for problems like finding a spanning forest, minimum spanning forest, etc. We complement our upper bounds by extending a recent lower bound for connectivity on an MPC conditioned on the 1-vs-2-cycles conjecture (which requires D ≥ log1+Ω(1)n), by showing a related conditional hardness of Ω(logD) MPC rounds for the entire spectrum of D, covering a particularly interesting range when D ≤ O(logn).

[1]  Michele Scquizzato,et al.  Equivalence classes and conditional hardness in massively parallel computations , 2020, Distributed Computing.

[2]  Moses Charikar,et al.  Brief Announcement: A Randomness-efficient Massively Parallel Algorithm for Connectivity , 2021, PODC.

[3]  A. Czumaj,et al.  Improved Deterministic (Δ+1) Coloring in Low-Space MPC , 2021, PODC.

[4]  Peter Davies,et al.  Component Stability in Low-Space Massively Parallel Computation , 2021, PODC.

[5]  A deterministic algorithm for the MST problem in constant rounds of congested clique , 2019, STOC.

[6]  Sebastian Brandt,et al.  Brief Announcement: Memory Efficient Massively Parallel Algorithms for LCL Problems on Trees , 2021, DISC.

[7]  Peter Davies,et al.  Simple, Deterministic, Constant-Round Coloring in the Congested Clique , 2020, PODC.

[8]  Robert E. Tarjan,et al.  Connected Components on a PRAM in Log Diameter Time , 2020, SPAA.

[9]  A. Czumaj,et al.  Graph Sparsification for Derandomizing Massively Parallel Computation with Low Space , 2019, SPAA.

[10]  Fabian Kuhn,et al.  Efficient Deterministic Distributed Coloring with Small Bandwidth , 2020, PODC.

[11]  Alexandr Andoni,et al.  Parallel approximate undirected shortest paths via low hop emulators , 2019, STOC.

[12]  Krzysztof Onak,et al.  Walking randomly, massively, and efficiently , 2019, STOC.

[13]  Gregory Schwartzman,et al.  Derandomizing local distributed algorithms under bandwidth restrictions , 2016, Distributed Computing.

[14]  Fabian Kuhn,et al.  Conditional Hardness Results for Massively Parallel Computation from Distributed Lower Bounds , 2019, 2019 IEEE 60th Annual Symposium on Foundations of Computer Science (FOCS).

[15]  Vahab S. Mirrokni,et al.  Near-Optimal Massively Parallel Graph Connectivity , 2019, 2019 IEEE 60th Annual Symposium on Foundations of Computer Science (FOCS).

[16]  Richard M. Karp,et al.  Massively Parallel Computation of Matching and MIS in Sparse Graphs , 2019, PODC.

[17]  Fabian Kuhn,et al.  Deterministic Distributed Dominating Set Approximation in the CONGEST Model , 2019, PODC.

[18]  Vahab S. Mirrokni,et al.  Massively Parallel Computation via Remote Memory Access , 2019, SPAA.

[19]  Mohammad Taghi Hajiaghayi,et al.  Exponentially Faster Massively Parallel Maximal Matching , 2019, 2019 IEEE 60th Annual Symposium on Foundations of Computer Science (FOCS).

[20]  Yufan Zheng,et al.  The Complexity of (Δ+1) Coloring in Congested Clique, Massively Parallel Computation, and Centralized Local Computation , 2018, PODC.

[21]  Mohsen Ghaffari,et al.  Sparsifying Distributed Algorithms with Ramifications in Massively Parallel Computation and Centralized Local Computation , 2018, SODA.

[22]  Sepehr Assadi,et al.  Massively Parallel Algorithms for Finding Well-Connected Components in Sparse Graphs , 2018, PODC.

[23]  Vahab S. Mirrokni,et al.  Coresets Meet EDCS: Algorithms for Matching and Vertex Cover on Massive Graphs , 2017, SODA.

[24]  Sergei Vassilvitskii,et al.  Shuffles and Circuits (On Lower Bounds for Modern Parallel Computation) , 2018, J. ACM.

[25]  Vahab S. Mirrokni,et al.  Connected Components at Scale via Local Contractions , 2018, ArXiv.

[26]  Krzysztof Onak Round Compression for Parallel Graph Algorithms in Strongly Sublinear Space , 2018, ArXiv.

[27]  Alexandr Andoni,et al.  Parallel Graph Connectivity in Log Diameter Rounds , 2018, 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS).

[28]  Mohammad Taghi Hajiaghayi,et al.  Brief Announcement: Semi-MapReduce Meets Congested Clique , 2018, ArXiv.

[29]  Ronitt Rubinfeld,et al.  Improved Massively Parallel Computation Algorithms for MIS, Matching, and Vertex Cover , 2018, PODC.

[30]  Tomasz Jurdzinski,et al.  MST in O(1) Rounds of Congested Clique , 2018, SODA.

[31]  Krzysztof Onak,et al.  Round compression for parallel matching algorithms , 2017, STOC.

[32]  Fabian Kuhn,et al.  Derandomizing Distributed Algorithms with Small Messages: Spanners and Dominating Set , 2018, DISC.

[33]  Dan Suciu,et al.  Communication Steps for Parallel Query Processing , 2017, J. ACM.

[34]  Alexandr Andoni,et al.  Parallel algorithms for geometric graph problems , 2013, STOC.

[35]  Paraschos Koutris,et al.  Communication steps for parallel query processing , 2013, PODS '13.

[36]  Jukka Suomela,et al.  Survey of local algorithms , 2013, CSUR.

[37]  M. Forcinito,et al.  Wiley‐Interscience Series in Discrete Mathematics and Optimization , 2011 .

[38]  Silvio Lattanzi,et al.  Filtering: a method for solving graph problems in MapReduce , 2011, SPAA '11.

[39]  Qin Zhang,et al.  Sorting, Searching, and Simulation in the MapReduce Framework , 2011, ISAAC.

[40]  Tom White,et al.  Hadoop - The Definitive Guide: Storage and Analysis at Internet Scale (2. ed.) , 2011 .

[41]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[42]  Sergei Vassilvitskii,et al.  A model of computation for MapReduce , 2010, SODA '10.

[43]  Andrzej Czygrinow,et al.  Fast Distributed Approximations in Planar Graphs , 2008, DISC.

[44]  Christoph Lenzen,et al.  Leveraging Linial's Locality Limit , 2008, DISC.

[45]  N. Alon,et al.  The Probabilistic Method: Alon/Probabilistic , 2008 .

[46]  Sanjay Ghemawat,et al.  MapReduce: simplified data processing on large clusters , 2008, CACM.

[47]  Yuan Yu,et al.  Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.

[48]  Avi Wigderson,et al.  Pairwise Independence and Derandomization , 2006, Found. Trends Theor. Comput. Sci..

[49]  Noga Alon,et al.  Almost k-wise independence versus k-wise independence , 2003, Information Processing Letters.

[50]  Oded Goldreich,et al.  Simple Constructions of Almost k -wise Independent Random Variables , 2002 .

[51]  Yijie Han,et al.  Concurrent threads and optimal parallel minimum spanning trees algorithm , 2001, JACM.

[52]  Kaoru Kurosawa,et al.  Almost k -Wise Independent Sample Spaces and Their Cryptologic Applications , 2001, Journal of Cryptology.

[53]  Aravind Srinivasan,et al.  Improved Algorithms via Approximations of Probability Distributions , 2000, J. Comput. Syst. Sci..

[54]  Russ Bubley,et al.  Randomized algorithms , 1995, CSUR.

[55]  Noam Nisan,et al.  Efficient approximation of product distributions , 1998, Random Struct. Algorithms.

[56]  Mihir Bellare,et al.  Randomness-efficient oblivious sampling , 1994, Proceedings 35th Annual Symposium on Foundations of Computer Science.

[57]  Aravind Srinivasan,et al.  Chernoff-Hoeffding bounds for applications with limited independence , 1995, SODA '93.

[58]  Noam Nisan,et al.  Approximations of general independent distributions , 1992, STOC '92.

[59]  Moni Naor,et al.  Small-bias probability spaces: efficient constructions and applications , 1990, STOC '90.

[60]  Bonnie Berger,et al.  Efficient NC algorithms for set cover with applications to learning and geometry , 1989, 30th Annual Symposium on Foundations of Computer Science.

[61]  Moni Naor,et al.  The probabilistic method yields deterministic parallel algorithms , 1989, 30th Annual Symposium on Foundations of Computer Science.

[62]  Oded Goldreich,et al.  On the power of two-point based sampling , 1989, J. Complex..

[63]  M. Luby Removing randomness in parallel computation without a processor penalty , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.

[64]  Prabhakar Raghavan,et al.  Probabilistic construction of deterministic algorithms: Approximating packing integer programs , 1986, 27th Annual Symposium on Foundations of Computer Science (sfcs 1986).

[65]  Noga Alon,et al.  A Fast and Simple Randomized Parallel Algorithm for the Maximal Independent Set Problem , 1985, J. Algorithms.

[66]  Michael Luby,et al.  A simple parallel algorithm for the maximal independent set problem , 1985, STOC '85.

[67]  Larry Carter,et al.  New classes and applications of hash functions , 1979, 20th Annual Symposium on Foundations of Computer Science (sfcs 1979).

[68]  Larry Carter,et al.  Universal Classes of Hash Functions , 1979, J. Comput. Syst. Sci..

[69]  Paul Erdös,et al.  On a Combinatorial Game , 1973, J. Comb. Theory A.