On the Value of Wikipedia as a Gateway to the Web

By linking to external websites, Wikipedia can act as a gateway to the Web. To date, however, little is known about the amount of traffic generated by Wikipedia’s external links. We fill this gap in a detailed analysis of usage logs gathered from Wikipedia users’ client devices. Our analysis proceeds in three steps: First, we quantify the level of engagement with external links, finding that, in one month, English Wikipedia generated 43M clicks to external websites, in roughly even parts via links in infoboxes, cited references, and article bodies. Official links listed in infoboxes have by far the highest click-through rate (CTR), 2.47% on average. In particular, official links associated with articles about businesses, educational institutions, and websites have the highest CTR, whereas official links associated with articles about geographical content, television, and music have the lowest CTR. Second, we investigate patterns of engagement with external links, finding that Wikipedia frequently serves as a stepping stone between search engines and third-party websites, effectively fulfilling information needs that search engines do not meet. Third, we quantify the hypothetical economic value of the clicks received by external websites from English Wikipedia, by estimating that the respective website owners would need to pay a total of $7–13 million per month to obtain the same volume of traffic via sponsored search. Overall, these findings shed light on Wikipedia’s role not only as an important source of information, but also as a high-traffic gateway to the broader Web ecosystem.

[1]  Jeff Donahue,et al.  Visual Search at Pinterest , 2015, KDD.

[2]  Blagoj Mitrevski,et al.  WikiHist.html: English Wikipedia's Full Revision History in HTML Format , 2020, ICWSM.

[3]  Stefano Ermon,et al.  Predicting Economic Development using Geolocated Wikipedia Articles , 2019, KDD.

[4]  Alina Deshpande,et al.  Global Disease Monitoring and Forecasting with Wikipedia , 2014, PLoS Comput. Biol..

[5]  Kristofer Erickson,et al.  What is the Commons Worth?: Estimating the Value of Wikimedia Imagery by Observing Downstream Use , 2018, OpenSym.

[6]  Olga Vasileva,et al.  Dwelling on Wikipedia: investigating time spent by global encyclopedia readers , 2019, OpenSym.

[7]  Giovanni Colavizza,et al.  Quantifying Engagement with Citations on Wikipedia , 2020, WWW.

[8]  Brent J. Hecht,et al.  The Substantial Interdependence of Wikipedia and Google: A Case Study on the Relationship Between Peer Production Communities and Information Technologies , 2017, ICWSM.

[9]  Xiaoquan Zhang,et al.  Impact of Wikipedia on Market Information Environment: Evidence on Management Disclosure and Investor Reaction , 2013, MIS Q..

[10]  Aaron Halfaker,et al.  ORES: Lowering Barriers with Participatory Machine Learning in Wikipedia , 2020, Proc. ACM Hum. Comput. Interact..

[11]  Monika Henzinger,et al.  Purely URL-based topic classification , 2009, WWW '09.

[12]  James M. Hyman,et al.  Forecasting the 2013–2014 Influenza Season Using Wikipedia , 2014, PLoS Comput. Biol..

[13]  Jure Leskovec,et al.  Why We Read Wikipedia , 2017, WWW.

[14]  Fabrizio Silvestri,et al.  Improving Post-Click User Engagement on Native Ads via Survival Analysis , 2016, WWW.

[15]  Eric Gilbert,et al.  Faces engage us: photos with faces attract more likes and comments on Instagram , 2014, CHI.

[16]  Tobias Kretschmer,et al.  The Effects of Rewarding User Engagement – The Case of Facebook Apps , 2012, Inf. Syst. Res..

[17]  Fan Zhang,et al.  How Well do Offline and Online Evaluation Metrics Measure User Satisfaction in Web Image Search? , 2018, SIGIR.

[18]  Weiwei Deng,et al.  Model Ensemble for Click Prediction in Bing Search Ads , 2017, WWW.

[19]  Kartik Talamadupula,et al.  Predicting User Engagement on Twitter with Real-World Events , 2015, ICWSM.

[20]  L. Maggio,et al.  Reader engagement with medical content on Wikipedia , 2020, eLife.

[21]  Gabriella Kazai,et al.  Towards a science of user engagement (Position Paper) , 2011 .

[22]  H. Eugene Stanley,et al.  Quantifying Wikipedia Usage Patterns Before Stock Market Moves , 2013, Scientific Reports.

[23]  Neil Thompson,et al.  Science Is Shaped by Wikipedia: Evidence From a Randomized Control Trial , 2018 .

[24]  Florian Lemmerich,et al.  Why the World Reads Wikipedia: Beyond English Speakers , 2018, WSDM.

[25]  Brent J. Hecht,et al.  Examining Wikipedia With a Broader Lens: Quantifying the Value of Wikipedia's Relationships with Other Large-Scale Online Communities , 2018, CHI.

[26]  Yang Song,et al.  Evaluating and predicting user engagement change with degraded search relevance , 2013, WWW.

[27]  Michael Scholz,et al.  AKEGIS: automatic keyword generation for sponsored search advertising in online retailing , 2019, Decis. Support Syst..

[28]  H. Eugene Stanley,et al.  Provided for non-commercial research and education use . Not for reproduction , distribution or commercial use , 2009 .