Sparsifying Distributed Algorithms with Ramifications in Massively Parallel Computation and Centralized Local Computation

We introduce a method for sparsifying distributed algorithms and exhibit how it leads to improvements that go past known barriers in two algorithmic settings of large-scale graph processing: Massively Parallel Computation (MPC), and Local Computation Algorithms (LCA). - MPC with Strongly Sublinear Memory: Recently, there has been growing interest in obtaining MPC algorithms that are faster than their classic $O(\log n)$-round parallel counterparts for problems such as MIS, Maximal Matching, 2-Approximation of Minimum Vertex Cover, and $(1+\epsilon)$-Approximation of Maximum Matching. Currently, all such MPC algorithms require $\tilde{\Omega}(n)$ memory per machine. Czumaj et al. [STOC'18] were the first to handle $\tilde{\Omega}(n)$ memory, running in $O((\log\log n)^2)$ rounds. We obtain $\tilde{O}(\sqrt{\log \Delta})$-round MPC algorithms for all these four problems that work even when each machine has memory $n^{\alpha}$ for any constant $\alpha\in (0, 1)$. Here, $\Delta$ denotes the maximum degree. These are the first sublogarithmic-time algorithms for these problems that break the linear memory barrier. - LCAs with Query Complexity Below the Parnas-Ron Paradigm: Currently, the best known LCA for MIS has query complexity $\Delta^{O(\log \Delta)} poly(\log n)$, by Ghaffari [SODA'16]. As pointed out by Rubinfeld, obtaining a query complexity of $poly(\Delta\log n)$ remains a central open question. Ghaffari's bound almost reaches a $\Delta^{\Omega\left(\frac{\log \Delta}{\log\log \Delta}\right)}$ barrier common to all known MIS LCAs, which simulate distributed algorithms by learning the local topology, a la Parnas-Ron [TCS'07]. This barrier follows from the $\Omega(\frac{\log \Delta}{\log\log \Delta})$ distributed lower bound of Kuhn, et al. [JACM'16]. We break this barrier and obtain an MIS LCA with query complexity $\Delta^{O(\log\log \Delta)} poly(\log n)$.

[1]  Krzysztof Onak,et al.  Maintaining a large matching and a small vertex cover , 2010, STOC '10.

[2]  Dana Ron,et al.  Deterministic Stateless Centralized Local Algorithms for Bounded Degree Graphs , 2014, ESA.

[3]  Nicholas J. A. Harvey,et al.  Greedy and Local Ratio Algorithms in the MapReduce Model , 2018, SPAA.

[4]  Sebastian Brandt,et al.  Breaking the Linear-Memory Barrier in MPC: Fast MIS on Trees with Strongly Sublinear Memory , 2019, SIROCCO.

[5]  Alexandr Andoni,et al.  Parallel Graph Connectivity in Log Diameter Rounds , 2018, 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS).

[6]  Ronitt Rubinfeld,et al.  Improved Massively Parallel Computation Algorithms for MIS, Matching, and Vertex Cover , 2018, PODC.

[7]  Noga Alon,et al.  Space-efficient local computation algorithms , 2011, SODA.

[8]  Benjamin Moseley,et al.  Efficient massively parallel methods for dynamic programming , 2017, STOC.

[9]  Krzysztof Onak,et al.  Round compression for parallel matching algorithms , 2017, STOC.

[10]  Alon Itai,et al.  A Fast and Simple Randomized Parallel Algorithm for Maximal Matching , 1986, Inf. Process. Lett..

[11]  Sergei Vassilvitskii,et al.  A model of computation for MapReduce , 2010, SODA '10.

[12]  Noga Alon,et al.  A Fast and Simple Randomized Parallel Algorithm for the Maximal Independent Set Problem , 1985, J. Algorithms.

[13]  Sebastian Brandt,et al.  Matching and MIS for Uniformly Sparse Graphs in the Low-Memory MPC Model , 2018, ArXiv.

[14]  Roger Wattenhofer,et al.  Local Computation , 2010, J. ACM.

[15]  Mohsen Ghaffari Distributed MIS via All-to-All Communication , 2017, PODC.

[16]  Sriram V. Pemmaraju,et al.  Equitable colorings extend Chernoff-Hoeffding bounds , 2001, SODA '01.

[17]  Andrew McGregor,et al.  Finding Graph Matchings in Data Streams , 2005, APPROX-RANDOM.

[18]  Ronitt Rubinfeld,et al.  Fast Local Computation Algorithms , 2011, ICS.

[19]  Silvio Lattanzi,et al.  Filtering: a method for solving graph problems in MapReduce , 2011, SPAA '11.

[20]  Alexandr Andoni,et al.  Parallel algorithms for geometric graph problems , 2013, STOC.

[21]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[22]  Moti Medina,et al.  A (Centralized) Local Guide , 2017, Bull. EATCS.

[23]  Nathan Linial,et al.  Distributive graph algorithms Global solutions from local data , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[24]  Qin Zhang,et al.  Sorting, Searching, and Simulation in the MapReduce Framework , 2011, ISAAC.

[25]  Omer Reingold,et al.  New techniques and tighter bounds for local computation algorithms , 2014, J. Comput. Syst. Sci..

[26]  Michael Luby,et al.  A simple parallel algorithm for the maximal independent set problem , 1985, STOC '85.

[27]  Sergei Vassilvitskii,et al.  Shuffles and Circuits (On Lower Bounds for Modern Parallel Computation) , 2018, J. ACM.

[28]  Sepehr Assadi,et al.  Massively Parallel Algorithms for Finding Well-Connected Components in Sparse Graphs , 2018, PODC.

[29]  Dana Ron,et al.  On Approximating the Minimum Vertex Cover in Sublinear Time and the Connection to Distributed Algorithms , 2007, Electron. Colloquium Comput. Complex..

[30]  Mohammad Ghodsi,et al.  Approximating Edit Distance in Truly Subquadratic Time: Quantum and MapReduce , 2018, SODA.

[31]  Christoph Lenzen,et al.  Brief announcement: exponential speed-up of local algorithms using non-local communication , 2010, PODC '10.

[32]  Sepehr Assadi Simple Round Compression for Parallel Vertex Cover , 2017, ArXiv.

[33]  Dan Suciu,et al.  Skew in parallel query processing , 2014, PODS.

[34]  Yuan Yu,et al.  Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.

[35]  Sriram V. Pemmaraju,et al.  Lessons from the Congested Clique Applied to MapReduce , 2014, SIROCCO.

[36]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[37]  Vahab S. Mirrokni,et al.  Coresets Meet EDCS: Algorithms for Matching and Vertex Cover on Massive Graphs , 2017, SODA.

[38]  Mohsen Ghaffari,et al.  An Improved Distributed Algorithm for Maximal Independent Set , 2015, SODA.

[39]  Ronitt Rubinfeld,et al.  Local Computation Algorithms for Graphs of Non-constant Degrees , 2016, Algorithmica.