Advertising-based Measurement: A Platform of 7 Billion Mobile Devices

The most important step in an empirical computer scientist's research is gathering sufficient real-world data to validate a system. Unfortunately, it is also one of the most time-consuming and expensive tasks: placing measurement tools in remote networks or end-clients requires one to marshal resources from different administrative domains, devices, populations, and countries. Often such efforts culminate in a trace that is deficient in multiple ways: a small set of test subjects, a short time frame, missing ground truth for device IDs, networking environments lacking in diversity and geographic spread, or highly biased sampling. We present a method of addressing these challenges by leveraging the most open and globally accessible test and measurement platform: digital advertising. Digital advertising instantly provides a window into 7 billion devices spanning every county for an extremely low cost. We propose Advertising as a Platform (AaaP), an ad-based system to perform massive-scale mobile measurement studies. In contrast with measurements made by large media companies who own platforms, ad networks, and apps, we concentrate on the opportunities and challenges for researchers that are end-users of advertising systems. We evaluate a prototype system, discuss ethical guidelines, and demonstrate its use in four scenarios: IP2Geo databases, bandwidth measurement, energy management, and the identifiability of mobile users. We show the efficacy and ease-of-use of AaaP, and illuminate key challenges and the great promise of using AaaP to study a wide variety of mobile phenomena.

[1]  Clayton Shepard,et al.  LiveLab: measuring wireless networks and smartphone users in the field , 2011, SIGMETRICS Perform. Evaluation Rev..

[2]  Hui Chen,et al.  User-centric power management for mobile operating systems , 2015 .

[3]  Xuxian Jiang,et al.  Unsafe exposure analysis of mobile in-app advertisements , 2012, WISEC '12.

[4]  Brian Neil Levine,et al.  A retrospective look at the UMass DOME mobile testbed , 2012, MOCO.

[5]  Dipankar Raychaudhuri,et al.  The WINLAB Network Centric Cognitive Radio Hardware Platform—WiNC2R , 2008, 2007 2nd International Conference on Cognitive Radio Oriented Wireless Networks and Communications.

[6]  Peter G. Harrison,et al.  Collecting battery data with Open Battery , 2012, ICCSW.

[7]  Wentong Li,et al.  Estimating conversion rate in display advertising from past erformance data , 2012, KDD.

[8]  Eric Eide,et al.  PhantomNet: Research Infrastructure for Mobile Networking, Cloud Computing and Software-Defined Networking , 2015, GETMBL.

[9]  Ahmad Rahmati,et al.  Users and Batteries: Interactions and Adaptive Energy Management in Mobile Systems , 2007, UbiComp.

[10]  Arvind Narayanan,et al.  Online Tracking: A 1-million-site Measurement and Analysis , 2016, CCS.

[11]  Hovav Shacham,et al.  Fingerprinting Information in JavaScript Implementations , 2011 .

[12]  Weijia Jia,et al.  Mobility: A Double-Edged Sword for HSPA Networks: A Large-Scale Test on Hong Kong Mobile HSPA Networks , 2010, IEEE Transactions on Parallel and Distributed Systems.

[13]  Alastair R. Beresford,et al.  Device Analyzer: Understanding Smartphone Usage , 2013, MobiQuitous.

[14]  Narseo Vallina-Rodriguez,et al.  Breaking for commercials: characterizing mobile advertising , 2012, Internet Measurement Conference.

[15]  Michael Carl Tschantz,et al.  Automated Experiments on Ad Privacy Settings , 2014, Proc. Priv. Enhancing Technol..

[16]  Azeem J. Khan,et al.  CAMEO: a middleware for mobile advertisement delivery , 2013, MobiSys '13.

[17]  Peter Eckersley,et al.  How Unique Is Your Web Browser? , 2010, Privacy Enhancing Technologies.

[18]  Suman Nath,et al.  Prefetching mobile ads: can advertising systems afford it? , 2013, EuroSys '13.

[19]  Nikhil R. Devanur,et al.  Real-time bidding algorithms for performance-based display ad allocation , 2011, KDD.

[20]  David A. Wagner,et al.  AdDroid: privilege separation for applications and advertisers in Android , 2012, ASIACCS '12.

[21]  Sasu Tarkoma,et al.  Characterizing smartphone power management in the wild , 2016, UbiComp Adjunct.

[22]  Ahmed Elmokashfi,et al.  Measuring the Reliability of Mobile Broadband Networks , 2014, Internet Measurement Conference.

[23]  Steve Uhlig,et al.  IP geolocation databases: unreliable? , 2011, CCRV.

[24]  Ahmad Rahmati,et al.  Understanding human-battery interaction on mobile phones , 2007, Mobile HCI.

[25]  Brian Neil Levine,et al.  Turning Off GPS Is Not Enough: Cellular Location Leaks over the Internet , 2013, Privacy Enhancing Technologies.

[26]  Ayman Farahat How effective is targeted advertising? , 2013, ACC.

[27]  Samir Ranjan Das,et al.  Understanding traffic dynamics in cellular data networks , 2011, 2011 Proceedings IEEE INFOCOM.

[28]  Ayman Farahat,et al.  How effective is targeted advertising? , 2012, 2013 American Control Conference.

[29]  D. Dittrich,et al.  The Menlo Report: Ethical Principles Guiding Information and Communication Technology Research , 2012 .

[30]  Claude Castelluccia,et al.  Betrayed by Your Ads! - Reconstructing User Profiles from Targeted Ads , 2012, Privacy Enhancing Technologies.

[31]  Shashi Shekhar,et al.  AdSplit: Separating Smartphone Advertising from Applications , 2012, USENIX Security Symposium.

[32]  Hao Chen,et al.  Investigating User Privacy in Android Ad Libraries , 2012 .

[33]  Liviu Iftode,et al.  Context-aware Battery Management for Mobile Phones , 2008, 2008 Sixth Annual IEEE International Conference on Pervasive Computing and Communications (PerCom).

[34]  Suman Nath,et al.  MAdScope: Characterizing Mobile In-App Targeted Ads , 2015, MobiSys.

[35]  Nick Feamster,et al.  Encore: Lightweight Measurement of Web Censorship with Cross-Origin Requests , 2015, SIGCOMM.

[36]  Kyunghan Lee,et al.  Mobile Data Offloading: How Much Can WiFi Deliver? , 2013, IEEE/ACM Transactions on Networking.

[37]  Qiang Ma,et al.  Adscape: harvesting and analyzing online display ads , 2014, WWW.

[38]  Arvind Narayanan,et al.  No Encore for Encore? Ethical questions for web-based censorship measurement , 2015 .

[39]  Narseo Vallina-Rodriguez,et al.  Exhausting battery statistics: understanding the energy demands on mobile handsets , 2010, MobiHeld '10.

[40]  Mahesh Balakrishnan,et al.  Where's that phone?: geolocating IP addresses on 3G networks , 2009, IMC '09.

[41]  Weinan Zhang,et al.  Optimal real-time bidding for display advertising , 2014, KDD.

[42]  Saikat Guha,et al.  Challenges in measuring online advertising systems , 2010, IMC '10.

[43]  Deborah Estrin,et al.  Diversity in smartphone usage , 2010, MobiSys '10.

[44]  Hovav Shacham,et al.  Pixel Perfect : Fingerprinting Canvas in HTML 5 , 2012 .

[45]  Sasu Tarkoma,et al.  Carat: collaborative energy diagnosis for mobile devices , 2013, SenSys '13.

[46]  E. Weippl,et al.  Fast and Reliable Browser Identification with JavaScript Engine Fingerprinting , 2013 .

[47]  Cecilia Mascolo,et al.  Don't kill my ads!: balancing privacy in an ad-supported mobile application market , 2012, HotMobile '12.

[48]  Nick Feamster,et al.  Ethical Concerns for Censorship Measurement , 2015, NS Ethics@SIGCOMM.

[49]  Weijia Jia,et al.  Mobility: A Double-Edged Sword for HSPA Networks: A Large-Scale Test on Hong Kong Mobile HSPA Networks , 2012, IEEE Transactions on Parallel and Distributed Systems.

[50]  Brendan T. O'Connor,et al.  Censorship and deletion practices in Chinese social media , 2012, First Monday.

[51]  Edgar R. Weippl,et al.  SHPF: Enhancing HTTP(S) Session Security with Browser Fingerprinting , 2013, 2013 International Conference on Availability, Reliability and Security.

[52]  Anirban Mahanti,et al.  Geolocating IP Addresses in Cellular Data Networks , 2012, PAM.

[53]  Simson L. Garfinkel,et al.  IRBs and Security Research: Myths, Facts and Mission Creep , 2008, UPSEC.

[54]  Arun Venkataramani,et al.  Towards a quantitative comparison of location-independent network architectures , 2014, SIGCOMM.

[55]  Wen Zhang,et al.  How much can behavioral targeting help online advertising? , 2009, WWW '09.

[56]  David Wetherall,et al.  Detecting and Defending Against Third-Party Tracking on the Web , 2012, NSDI.

[57]  Chunming Qiao,et al.  PhoneLab: A Large Programmable Smartphone Testbed , 2013, SENSEMINE@SenSys.

[58]  D. Zucker The Belmont Report , 2014 .

[59]  Roksana Boreli,et al.  Characterising user targeting for in-App Mobile Ads , 2014, 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).