Estimating graph parameters via random walks with restarts

In this paper we discuss the problem of estimating graph parameters from a random walk with restarts at a fixed vertex $x$. For regular graphs $G$, one can estimate the number of vertices $n_G$ and the $\ell^2$ mixing time of $G$ from $x$ in $\widetilde{O}(\sqrt{n_G}\,(t_{\rm unif}^G)^{3/4})$ steps, where $t_{\rm unif}^G$ is the uniform mixing time on $G$. The algorithm is based on the number of intersections of random walk paths $X,Y$, i.e. the number of times $(t,s)$ such that $X_t=Y_s$. Our method improves on previous methods by various authors which only consider collisions (i.e. times $t$ with $X_t=Y_t$). We also show that the time complexity of our algorithm is optimal (up to log factors) for $3$-regular graphs with prescribed mixing times. For general graphs, we adapt the intersections algorithm to compute the number of edges $m_G$ and the $\ell^2$ mixing time from the starting vertex $x$ in $\widetilde{O}(\sqrt{m_G}\,(t_{\rm unif}^G)^{3/4})$ steps. Under mild additional assumptions (which hold e.g. for sparse graphs) the number of vertices can also be estimated by this time. Finally, we show that these algorithms, which may take sublinear time, have a fundamental limitation: it is not possible to devise a sublinear stopping time at which one can be reasonably sure that our parameters are well estimated. On the other hand, we show that, given either $m_G$ or the mixing time of $G$, we can compute the "other parameter" with a self-stopping algorithm.

[1]  Allan Sly,et al.  Cutoff phenomena for random walks on random regular graphs , 2008, 0812.0060.

[2]  Béla Bollobás,et al.  A Probabilistic Proof of an Asymptotic Formula for the Number of Labelled Regular Graphs , 1980, Eur. J. Comb..

[3]  Andrei Z. Broder,et al.  On the second eigenvalue of random regular graphs , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[4]  Y. Peres,et al.  Estimating the Spectral Gap of a Reversible Markov Chain from a Short Trajectory , 2016, 1612.05330.

[5]  László Lovász,et al.  Waiting for a Bat to Fly By (in Polynomial Time) , 2006, Comb. Probab. Comput..

[6]  Elizabeth L. Wilmer,et al.  Markov Chains and Mixing Times , 2008 .

[7]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[8]  Y. Peres,et al.  Intersection and mixing times for reversible chains , 2014, 1412.8458.

[9]  Russell Lyons Asymptotic Enumeration of Spanning Trees , 2005, Comb. Probab. Comput..

[10]  J. Bunge,et al.  Estimating the Number of Species: A Review , 1993 .

[11]  Prasad Tetali,et al.  Distributed Random Walks , 2013, JACM.

[12]  Svante Janson The Probability That a Random Multigraph is Simple , 2009, Comb. Probab. Comput..

[13]  I. Benjamini,et al.  The birthday problem and Markov chain Monte Carlo , 2007, math/0701390.

[14]  Colin Cooper,et al.  Estimating network parameters using random walks , 2012, CASoN.

[15]  Edo Liberty,et al.  Estimating Sizes of Social Networks via Biased Sampling , 2014, Internet Math..