Fast distributed random walks

Performing random walks in networks is a fundamental primitive that has found applications in many areas of computer science, including distributed computing. In this paper, we focus on the problem of performing random walks efficiently in a distributed network. Given bandwidth constraints, the goal is to minimize the number of rounds required to obtain a random walk sample. All previous algorithms that compute a random walk sample of length ℓ as a subroutine always do so naively, i.e., in <i>O</i>(ℓ) rounds. The main contribution of this paper is a fast distributed algorithm for performing random walks. We show that a random walk sample of length ℓ can be computed in <i>Õ</i>(ℓ<sup>2/3</sup> <i>D</i><sup>1/3</sup>) rounds on an undirected unweighted network, where <i>D</i> is the diameter of the network.<sup>1</sup> When ℓ = Ω(<i>D</i> log <i>n</i>), this is an improvement over the naive <i>O</i>(ℓ) bound. (We show that Ω(min{<i>D</i>, ℓ}) is a lower bound and hence in general we cannot have a running time faster than the diameter of the graph.) We also show that our algorithm can be applied to speedup the more general Metropolis-Hastings sampling. We extend our algorithms to perform a large number, <i>k</i>, of random walks efficiently. We show how <i>k</i> destinations can be sampled in <i>Õ</i>((<i>k</i>ℓ)<sup>2/3</sup> <i>D</i><sup>1/3</sup>) rounds if <i>k</i> ≤ ℓ<sup>2</sup> and <i>Õ</i>((<i>k</i>ℓ)<sup>1/2</sup>) rounds otherwise. We also present faster algorithms for performing random walks of length larger than (or equal to) the mixing time of the underlying graph. Our techniques can be useful in speeding up distributed algorithms for a variety of applications that use random walks as a subroutine.

[1]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[2]  David Bruce Wilson,et al.  Generating random spanning trees more quickly than the cover time , 1996, STOC '96.

[3]  Srinivasan Seshan,et al.  Mercury: supporting scalable multi-attribute range queries , 2004, SIGCOMM '04.

[4]  Ming Zhong,et al.  Non-uniform random membership management in peer-to-peer networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[5]  Tarek A. El-Ghazawi,et al.  A self-stabilizing distributed algorithm for spanning tree construction in wireless ad hoc networks , 2003, J. Parallel Distributed Comput..

[6]  Edith Cohen,et al.  Search and replication in unstructured peer-to-peer networks , 2002, ICS '02.

[7]  Baruch Awerbuch,et al.  Cost-sensitive analysis of communication protocols , 1990, PODC '90.

[8]  Harald Niederreiter,et al.  Probability and computing: randomized algorithms and probabilistic analysis , 2006, Math. Comput..

[9]  Richard J. Lipton,et al.  Random walks, universal traversal sequences, and the complexity of maze problems , 1979, 20th Annual Symposium on Foundations of Computer Science (sfcs 1979).

[10]  Alain Bui,et al.  Random Walks in Distributed Computing: A Survey , 2004, IICS.

[11]  Judit Bar-Ilan,et al.  Random Leaders and Random Spanning Trees , 1989, WDAG.

[12]  Dmitri Loguinov,et al.  Graph-theoretic analysis of structured peer-to-peer systems: routing distances and fault resilience , 2003, IEEE/ACM Transactions on Networking.

[13]  Jon M. Kleinberg,et al.  Spatial gossip and resource location protocols , 2001, JACM.

[14]  Andrew Twigg,et al.  Lower bounds for distributed markov chain problems , 2008, ArXiv.

[15]  Anne-Marie Kermarrec,et al.  Peer-to-Peer Membership Management for Gossip-Based Protocols , 2003, IEEE Trans. Computers.

[16]  Andrei Z. Broder,et al.  Generating random spanning trees , 1989, 30th Annual Symposium on Foundations of Computer Science.

[17]  Sreenivas Gollapudi,et al.  Estimating PageRank on graph streams , 2008, PODS.

[18]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[19]  Indranil Gupta,et al.  AVMON: Optimal and Scalable Discovery of Consistent Availability Monitoring Overlays for Distributed Systems , 2009, IEEE Trans. Parallel Distributed Syst..

[20]  M. Karonski Collisions among Random Walks on a Graph , 1993 .

[21]  Navin Goyal,et al.  Expanders via random spanning trees , 2008, SODA.

[22]  Kai-Yeung Siu,et al.  Distributed construction of random expander networks , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[23]  Amos Israeli,et al.  Token management schemes and random walks yield self-stabilizing mutual exclusion , 1990, PODC '90.

[24]  Maleq Khan,et al.  Theory of communication networks , 2010 .

[25]  Alain Bui,et al.  Random Distributed Self-stabilizing Structures Maintenance , 2004, ISSADS.

[26]  Lada A. Adamic,et al.  Search in Power-Law Networks , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Brian F. Cooper Quickly Routing Searches Without Having to Move Content , 2005, IPTPS.

[28]  David Peleg,et al.  Distributed Computing: A Locality-Sensitive Approach , 1987 .

[29]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[30]  Ming Zhong,et al.  Random walk based node sampling in self-organizing networks , 2006, OPSR.

[31]  Noga Alon,et al.  Many Random Walks Are Faster Than One , 2011, Comb. Probab. Comput..

[32]  David R. Karger,et al.  Simple Efficient Load Balancing Algorithms for Peer-to-Peer Systems , 2004, IPTPS.

[33]  Jennifer L. Welch,et al.  Random Walk for Self-Stabilizing Group Communication in Ad Hoc Networks , 2006, IEEE Trans. Mob. Comput..

[34]  Jon M. Kleinberg,et al.  The small-world phenomenon: an algorithmic perspective , 2000, STOC '00.

[35]  Christos Gkantsidis,et al.  Hybrid search schemes for unstructured peer-to-peer networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[36]  Shlomi Dolev,et al.  Spanders: distributed spanning expanders , 2010, SAC '10.