Random surfers on a web encyclopedia

The random surfer model is a frequently used model for simulating user navigation behavior on the Web. Various algorithms, such as PageRank, are based on the assumption that the model represents a good approximation of users browsing a website. However, the way users browse the Web has been drastically altered over the last decade due to the rise of search engines. Hence, new adaptations for the established random surfer model might be required, which better capture and simulate this change in navigation behavior. In this article we compare the classical uniform random surfer to empirical navigation and page access data in a Web Encyclopedia. Our high level contributions are (i) a comparison of stationary distributions of different types of the random surfer to quantify the similarities and differences between those models as well as (ii) new insights into the impact of search engines on traditional user navigation. Our results suggest that the behavior of the random surfer is almost similar to those of users---as long as users do not use search engines. We also find that classical website navigation structures, such as navigation hierarchies or breadcrumbs, only exercise limited influence on user navigation anymore. Rather, a new kind of navigational tools (e.g., recommendation systems) might be needed to better reflect the changes in browsing behavior of existing users.

[1]  Jon M. Kleinberg,et al.  Small-World Phenomena and the Dynamics of Information , 2001, NIPS.

[2]  Joseph L. Gastwirth,et al.  A General Definition of the Lorenz Curve , 1971 .

[3]  Denis Helic,et al.  Detecting Memory and Structure in Human Navigation Patterns Using Markov Chain Models of Varying Order , 2014, PloS one.

[4]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[5]  Wolfgang Woess,et al.  Random Walks on Infinite Graphs and Groups — a Survey on Selected topics , 1994 .

[6]  Feng Qiu,et al.  Automatic identification of user interest for personalized search , 2006, WWW '06.

[7]  Reinhold Scherer,et al.  Models of human navigation in information networks based on decentralized search , 2013, HT.

[8]  A. Fronczak,et al.  Biased random walks in complex networks: the role of local navigation rules. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Lada A. Adamic,et al.  Search in Power-Law Networks , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Filippo Menczer,et al.  What's in a session: tracking individual behavior on the web , 2009, HT '09.

[11]  Gene H. Golub,et al.  Extrapolation methods for accelerating PageRank computations , 2003, WWW '03.

[12]  David F. Gleich,et al.  Tracking the random surfer: empirically measured teleportation parameters in PageRank , 2010, WWW '10.

[13]  Jure Leskovec,et al.  Automatic Versus Human Navigation in Information Networks , 2012, ICWSM.

[14]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[15]  Avrim Blum,et al.  A Random-Surfer Web-Graph Model , 2006, ANALCO.

[16]  Denis Helic,et al.  The Role of Homophily and Popularity in Informed Decentralized Search , 2014, DyNaK.

[17]  Kristina Lerman,et al.  Leveraging Position Bias to Improve Peer Recommendation , 2014, PloS one.

[18]  L. D. Costa,et al.  Exploring complex networks through random walks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[20]  Guido Caldarelli,et al.  Topologically biased random walk and community finding in networks. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Larry Birnbaum,et al.  Measuring Semantic Similarity between Named Entities by Searching the Web Directory , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[22]  László Lovász,et al.  Random Walks on Graphs: A Survey , 1993 .

[23]  David D. Jensen,et al.  Decentralized Search in Networks Using Homophily and Degree Disparity , 2005, IJCAI.

[24]  Dietmar Plenz,et al.  powerlaw: A Python Package for Analysis of Heavy-Tailed Distributions , 2013, PloS one.

[25]  Goldhirsch,et al.  Biased random walk on networks. , 1987, Physical review. A, General physics.

[26]  Taher H. Haveliwala,et al.  The Second Eigenvalue of the Google Matrix , 2003 .

[27]  L. Asz Random Walks on Graphs: a Survey , 2022 .

[28]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[29]  Jure Leskovec,et al.  Human wayfinding in information networks , 2012, WWW.

[30]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[31]  S. Al-Saffar,et al.  Experimental Bounds on the Usefulness of Personalized and Topic-Sensitive PageRank , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).