Behind the scenes: a cross-country study into third-party website referencing and the online advertising ecosystem

The ubiquitous nature of the Internet provides an ideal platform for human communication, trade, information sharing and learning. Websites play a central role in these activities as they often act as a key point of interaction for individuals in navigating through cyberspace. In this article, we look beyond the visual interface of websites to consider exactly what occurs when a webpage is visited. In particular, we focus on the various web scripts that are often programmatically executed, to explore the extent to which third-party sites are referenced. Our aim is to study these references and the ecosystem that they create. To gain maximal impact while also allowing for a cross-country comparison, our study is scoped to an assessment of the top 250 sites in the UK, USA, Germany, Russia and Japan. From our analysis, there are various novel contributions of note. These include the empirical identification of a vast ecosystem of third-party information processing sites, especially advertisement networks, and the evidential discovery of a few significant players irrespective of country and locale. Through a user study, we also find that while individuals do have some knowledge of the prevalence of advertisements in websites, their understanding of the variety of activities that occur upon visiting websites, is not widely known. Going forward, we therefore advocate for increased transparency in such activities and the wider online advertisement ecosystem.

[1]  Daniel R. Horne,et al.  The Privacy Paradox: Personal Information Disclosure Intentions versus Behaviors , 2007 .

[2]  Leveraging “ big data ” to optimize digital marketing , .

[3]  Joey F. George,et al.  Website Credibility Assessment: An Empirical - Investigation of Prominence-interpretation Theory , 2016, AIS Trans. Hum. Comput. Interact..

[4]  Noah A. Smith,et al.  Crowdsourcing Annotations for Websites' Privacy Policies: Can It Really Work? , 2016, WWW.

[5]  Lujo Bauer,et al.  (Do Not) Track Me Sometimes: Users’ Contextual Preferences for Web Tracking , 2016, Proc. Priv. Enhancing Technol..

[6]  Sadie Creese,et al.  Future scenarios and challenges for security and privacy , 2016, 2016 IEEE 2nd International Forum on Research and Technologies for Society and Industry Leveraging a better tomorrow (RTSI).

[7]  Peng Liu,et al.  A Machine Learning Approach for Detecting Third-Party Trackers on the Web , 2016, ESORICS.

[8]  Marco Mellia,et al.  The Online Tracking Horde: A View from Passive Measurements , 2015, TMA.

[9]  Steve Uhlig,et al.  Tracking Personal Identifiers Across the Web , 2016, PAM.

[10]  Ashraf Matrawy,et al.  A classification of web browser fingerprinting techniques , 2015, 2015 7th International Conference on New Technologies, Mobility and Security (NTMS).

[11]  Yang Wang,et al.  What matters to users?: factors that affect users' willingness to share information with online advertisers , 2013, SOUPS.

[12]  Chiuhsiang Joe Lin,et al.  Exploring the design criteria of website interfaces for gender , 2016 .

[13]  Walter Rudametkin,et al.  Beauty and the Beast: Diverting Modern Web Browsers to Build Unique Browser Fingerprints , 2016, 2016 IEEE Symposium on Security and Privacy (SP).

[14]  Michael Trusov,et al.  Crumbs of the Cookie: User Profiling in Customer-Base Analysis and Behavioral Targeting , 2016, Mark. Sci..

[15]  H. Borko Information science: What is it? , 1968 .

[16]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[17]  Yuqing Zhang,et al.  TrackerDetector: A system to detect third-party trackers through machine learning , 2015, Comput. Networks.

[18]  坂本一仁 Do Not Embarrass: Re-Examining User Concerns for Online Tracking and Advertisingの紹介 , 2013 .

[19]  Sadie Creese,et al.  Inferring social relationships from technology-level device connections , 2014, 2014 Twelfth Annual International Conference on Privacy, Security and Trust.

[20]  Qiuzhen Wang,et al.  An eye-tracking study of website complexity from cognitive load perspective , 2014, Decis. Support Syst..

[21]  Wouter Joosen,et al.  PriVaricator: Deceiving Fingerprinters with Little White Lies , 2015, WWW.

[22]  Michalis Faloutsos,et al.  TrackAdvisor: Taking Back Browsing Privacy from Third-Party Trackers , 2015, PAM.