Evolution of page popularity under random web graph models

The link structure of the Web can be viewed as a massive graph. The preferential attachment model and its variants are well-known random graph models that help explain the evolution of the web graph. However, those models assign more links to older pages without reference to the quality of web pages, which does not capture the real-world evolution of the web graph and renders the models inappropriate for studying the popularity evolution of new pages.We extend the preferential attachment model with page quality, where the probability of a page getting new links depends not only on its current degree but also on its quality. We study the distribution of degrees among different quality values, and prove that under discrete quality distributions, the degree sequence still follows a power law distribution. Then we use the model to study the evolution of page popularity. We show that for pages with the same quality, the older pages are more popular; if a younger page is better than an older page, then eventually the younger-and-better page will become more popular. We also use the model to study a randomized ranking scheme proposed earlier [18] and show that it accelerates popularity evolution of new pages.

[1]  G. B. A. Barab'asi Competition and multiscaling in evolving networks , 2000, cond-mat/0011029.

[2]  Junghoo Cho,et al.  Impact of search engines on page popularity , 2004, WWW '04.

[3]  Sandeep Pandey,et al.  Shuffling a Stacked Deck: The Case for Partially Randomized Ranking of Search Engine Results , 2005, VLDB.

[4]  Alan M. Frieze,et al.  The influence of search engines on preferential attachment , 2005, SODA '05.

[5]  Jon M. Kleinberg,et al.  The Web as a Graph: Measurements, Models, and Methods , 1999, COCOON.

[6]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[7]  Rajeev Motwani,et al.  Randomized Algorithms , 1995, SIGA.

[8]  Béla Bollobás,et al.  Directed scale-free graphs , 2003, SODA '03.

[9]  Béla Bollobás,et al.  Robustness and Vulnerability of Scale-Free Random Graphs , 2004, Internet Math..

[10]  Alan M. Frieze,et al.  A general model of web graphs , 2003, Random Struct. Algorithms.

[11]  Eli Upfal,et al.  Using PageRank to Characterize Web Structure , 2002, COCOON.

[12]  D. Kossmann,et al.  What can you do with a Web in your Pocket ? , 2007 .

[13]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[14]  Eli Upfal,et al.  Stochastic models for the Web graph , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[15]  Ginestra Bianconi,et al.  Competition and multiscaling in evolving networks , 2001 .

[16]  Rajeev Motwani,et al.  Randomized algorithms , 1996, CSUR.

[17]  Alan M. Frieze,et al.  Random Deletion in a Scale-Free Random Graph Process , 2004, Internet Math..

[18]  Junghoo Cho,et al.  Page quality: in search of an unbiased web ranking , 2005, SIGMOD '05.

[19]  Mihaela Enachescu,et al.  Variations on Random Graph Models for the Web , 2001 .

[20]  Alan M. Frieze,et al.  Balls and bins models with feedback , 2002, SODA '02.