Approximating the Minimum Spanning Tree Weight in Sublinear Time

We present a probabilistic algorithm that, given a connected graph G (represented by adjacency lists) of average degree d, with edge weights in the set{1,...,w}, and given a parameter $0<\eps<1/2$, estimates in time $O( dw \varepsilon^{-2} \log{\frac{dw}\varepsilon})$ the weight of the minimum spanning tree (MST) of G with a relative error of at most $\eps$. Note that the running time does not depend on the number of vertices in G. We also prove a nearly matching lower bound of $\Omega( dw \varepsilon^{-2} )$ on the probe and time complexity of any approximation algorithm for MST weight. The essential component of our algorithm is a procedure for estimating in time $O(d\eps^{-2}\log \frac{d}\varepsilon)$ the number of connected components of an unweighted graph to within an additive error of $\varepsilon n$. (This becomes $O(\eps^{-2}\log \frac{1}\varepsilon)$ for $d=O(1)$.) The time bound is shown to be tight up to within the $\log \frac{d}\varepsilon$ factor. Our connected-components algorithm picks $O(1/\varepsilon^2)$ vertices in the graph and then grows "local spanning trees" whose sizes are specified by a stochastic process. From the local information collected in this way, the algorithm is able to infer, with high confidence, an estimate of the number of connected components. We then show how estimates on the number of components in various subgraphs of G can be used to estimate the weight of its MST.

[1]  Dana Ron,et al.  Property Testing in Bounded Degree Graphs , 2002, STOC '97.

[2]  D. Gale Optimal assignments in an ordered set: An application of matroid theory , 1968 .

[3]  Dana Ron,et al.  Property testing and its connection to learning and approximation , 1998, JACM.

[4]  Bernard Chazelle,et al.  A minimum spanning tree algorithm with inverse-Ackermann type complexity , 2000, JACM.

[5]  Leonard Pitt,et al.  Sublinear time approximate clustering , 2001, SODA '01.

[6]  Bernard Chazelle,et al.  The discrepancy method - randomness and complexity , 2000 .

[7]  D. Eppstein Representing all minimum spanning trees with applications to counting and generation , 1995 .

[8]  Ronitt Rubinfeld,et al.  Robust Characterizations of Polynomials with Applications to Program Testing , 1996, SIAM J. Comput..

[9]  Ronald L. Graham,et al.  On the History of the Minimum Spanning Tree Problem , 1985, Annals of the History of Computing.

[10]  Michael L. Fredman,et al.  Surpassing the Information Theoretic Bound with Fusion Trees , 1993, J. Comput. Syst. Sci..

[11]  Sanguthevar Rajasekaran Handbook of randomized computing , 2001 .

[12]  Noga Alon,et al.  Testing of Clustering , 2003, SIAM J. Discret. Math..

[13]  Ronitt Rubinfeld,et al.  Robust Chara terizations of Polynomials withAppli ations to Program Testing , 1996 .

[14]  Alan M. Frieze,et al.  Quick Approximation to Matrices and Applications , 1999, Comb..

[15]  Michael L. Fredman,et al.  Trans-dichotomous algorithms for minimum spanning trees and shortest paths , 1990, Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.

[16]  J. Nesetril A few remarks on the history of MST-problem , 1997 .

[17]  Seth Pettie,et al.  An optimal minimum spanning tree algorithm , 2000, JACM.

[18]  Philip N. Klein,et al.  A randomized linear-time algorithm to find minimum spanning trees , 1995, JACM.

[19]  Alan M. Frieze,et al.  Fast monte-carlo algorithms for finding low-rank approximations , 2004, JACM.