论文信息 - Efficient estimation algorithms for neighborhood variance and other moments

Efficient estimation algorithms for neighborhood variance and other moments

The neighborhood variance problem is as follows. Given a (directed or undirected) graph with values associated with each node, compute a data structure that for any given node v and r ≥ 0, would quickly produce an estimate of the variance of all values of nodes that lie within distance r from v. The problem can be generalized to other moment functions and to arbitrary distance-dependent decay.These problems are motivated by applications where the relevance of a measurement observed (or data present) at a certain location decreases with its distance, and thus the aggregate value varies by location. The centralized version of the problem is motivated by applications to query processing on graphical databases. The distributed version of the problem falls in a model we recently introduced for spatially decaying aggregation and is motivated by sensor or p2p networks.We present novel algorithms for the centralized and distributed versions of the problem. Our algorithms are nearly optimal, the centralized version requires Õ(m) time and the distributed version requires polylogarithmic communication per node or edge (depending on assumptions).

Edith Cohen | Haim Kaplan | Haim Kaplan | E. Cohen

[1] Edith Cohen,et al. Size-Estimation Framework with Applications to Transitive Closure and Reachability , 1997, J. Comput. Syst. Sci..

[2] Srikanta Tirthapura,et al. Distributed Streams Algorithms for Sliding Windows , 2002, SPAA '02.

[3] Jennifer Widom,et al. Models and issues in data stream systems , 2002, PODS.

[4] D. F. Hays,et al. Table of Integrals, Series, and Products , 1966 .

[5] John G. Proakis,et al. Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[6] Edith Cohen,et al. Maintaining time-decaying stream aggregates , 2006, J. Algorithms.

[7] Piotr Indyk,et al. Maintaining Stream Statistics over Sliding Windows , 2002, SIAM J. Comput..

[8] Edith Cohen,et al. Spatially-decaying aggregation over a network: model and algorithms , 2004, SIGMOD '04.

[9] I. S. Gradshteyn,et al. Table of Integrals, Series, and Products , 1976 .

[10] Rajeev Motwani,et al. Maintaining variance and k-medians over data stream windows , 2003, PODS.

[11] Rina Panigrahy,et al. Better streaming algorithms for clustering problems , 2003, STOC '03.