A Fast Algorithm to Find All High-Degree Vertices in Graphs with a Power-Law Degree Sequence

Abstract We develop a fast method for finding all high-degree vertices of a connected graph with a power-law degree sequence. The method uses a biased random walk, where the bias is a function of the power law c of the degree sequence. Let G(t) be a t-vertex graph, with degree sequence power law c ≥ 3 generated by a generalized preferential attachment process that adds m edges at each step. Let Sa be the set of all vertices of degree at least ta in G(t). We analyze a biased random walk that makes transitions along undirected edges {x, y} with probabilities proportional to (d(x)d(y))b, where d(x) is the degree of vertex x and b > 0 is a constant parameter. With parameter b = (c − 1)(c − 2)/(2c − 3), the random walk discovers the set Sa completely in steps with high probability. The error parameter ε depends on c, a, and m. The cover time of the entire graph G(t) by the biased walk is . Thus the expected time to discover all vertices by the biased walk is not much higher than the Θ(tlog t) cover time of a simple random walk. The standard preferential attachment process generates graphs with power law c = 3. The search parameter b = 2/3 is appropriate for such graphs. We conduct experimental tests on a preferential attachment graph and on a sample of the underlying graph of the World Wide Web with power law c ≈ 3 that support the claimed property.

[1]  Walter Willinger,et al.  On Unbiased Sampling for Unstructured Peer-to-Peer Networks , 2006, IEEE/ACM Transactions on Networking.

[2]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[3]  Michael Kearns,et al.  Local Algorithms for Finding Interesting Individuals in Large Networks , 2010, ICS.

[4]  Minas Gjoka,et al.  A Walk in Facebook: Uniform Sampling of Users in Online Social Networks , 2009, ArXiv.

[5]  Masafumi Yamashita,et al.  Impact of Local Topological Information on Random Walks on Finite Graphs , 2003, ICALP.

[6]  Ricardo A. Baeza-Yates,et al.  Crawling a country: better strategies than breadth-first for web page ordering , 2005, WWW '05.

[7]  Colin Cooper,et al.  Coalescing random walks and voting on graphs , 2012, PODC '12.

[8]  Colin Cooper,et al.  Distribution of Vertex Degree in Web-Graphs , 2006, Combinatorics, Probability and Computing.

[9]  David J. Aldous,et al.  Lower bounds for covering times for reversible Markov chains and random walks on graphs , 1989 .

[10]  Cristopher Moore,et al.  On the bias of traceroute sampling: Or, power-law degree distributions in regular graphs , 2005, JACM.

[11]  L. Asz Random Walks on Graphs: a Survey , 2022 .

[12]  Béla Bollobás,et al.  The Diameter of a Scale-Free Random Graph , 2004, Comb..

[13]  A. Barabasi,et al.  Scale-free characteristics of random networks: the topology of the world-wide web , 2000 .

[14]  Alan M. Frieze,et al.  The cover time of the preferential attachment graph , 2007, J. Comb. Theory, Ser. B.

[15]  Béla Bollobás,et al.  The degree sequence of a scale‐free random graph process , 2001, Random Struct. Algorithms.

[16]  Abraham D. Flaxman,et al.  Bias Reduction in Traceroute Sampling - Towards a More Accurate Map of the Internet , 2007, WAW.

[17]  Alan M. Frieze,et al.  A general model of web graphs , 2003, Random Struct. Algorithms.