You must have clicked on this ad by mistake! Data-driven identification of accidental clicks on mobile ads with applications to advertiser cost discounting and click-through rate prediction

In the cost per click pricing model, an advertiser pays an ad network only when a user clicks on an ad; in turn, the ad network gives a share of that revenue to the publisher where the ad was impressed. Still, advertisers may be unsatisfied with ad networks charging them for “valueless” clicks, or so-called accidental clicks. These happen when users click on an ad, are redirected to the advertiser website and bounce back without spending any time on the ad landing page. Charging advertisers for such clicks is detrimental in the long term as the advertiser may decide to run their campaigns on other ad networks. In addition, machine-learned click models trained to predict which ad will bring the highest revenue may overestimate an ad click-through rate, and as a consequence negatively impacting revenue for both the ad network and the publisher. In this work, we propose a data-driven method to detect accidental clicks from the perspective of the ad network. We collect observations of time spent by users on a large set of ad landing pages—i.e., dwell time. We notice that the majority of per-ad distributions of dwell time fit to a mixture of distributions, where each component may correspond to a particular type of clicks, the first one being accidental. We then estimate dwell time thresholds of accidental clicks from that component. Using our method to identify accidental clicks, we then propose a technique that smoothly discounts the advertiser’s cost of accidental clicks at billing time. Experiments conducted on a large dataset of ads served on Yahoo mobile apps confirm that our thresholds are stable over time, and revenue loss in the short term is marginal. We also compare the performance of an existing machine-learned click model trained on all ad clicks with that of the same model trained only on non-accidental clicks. There, we observe an increase in both ad click-through rate (+ 3.9%) and revenue (+ 0.2%) on ads served by the Yahoo Gemini network when using the latter. These two applications validate the need to consider accidental clicks for both billing advertisers and training ad click models.

[1]  Hema Raghavan,et al.  A relevance model based filter for improving ad quality , 2009, SIGIR.

[2]  Ryen W. White,et al.  Understanding web browsing behaviors through Weibull analysis of dwell time , 2010, SIGIR.

[3]  Matthew Goldman,et al.  Experiments as Instruments: Heterogeneous Position Effects in Sponsored Search Auctions , 2015, EAI Endorsed Trans. Serious Games.

[4]  Ram Akella,et al.  Dynamic effects of ad impressions on commercial actions in display advertising , 2012, CIKM.

[5]  Ricardo Baeza-Yates,et al.  Scalable Semantic Matching of Queries to Ads in Sponsored Search Advertising , 2016, ArXiv.

[6]  Peter Schlattmann,et al.  Medical Applications of Finite Mixture Models , 2009 .

[7]  Fabrizio Silvestri,et al.  Improving Post-Click User Engagement on Native Ads via Survival Analysis , 2016, WWW.

[8]  Athanasios Papoulis,et al.  Probability, Random Variables and Stochastic Processes , 1965 .

[9]  Ryen W. White,et al.  Modeling dwell time to predict click-level satisfaction , 2014, WSDM.

[10]  Daniela M. Witten,et al.  An Introduction to Statistical Learning: with Applications in R , 2013 .

[11]  Rómer Rosales,et al.  Post-click conversion modeling and analysis for non-guaranteed delivery display advertising , 2012, WSDM '12.

[12]  Dragomir Yankov,et al.  Categorization of display ads using image and landing page features , 2011, LDMTA '11.

[13]  I. Miller Probability, Random Variables, and Stochastic Processes , 1966 .

[14]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[15]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[16]  Eve E. Hoggan,et al.  An exploration of inadvertent variations in mobile pressure input , 2012, Mobile HCI.

[17]  Hila Becker,et al.  What happens after an ad click?: quantifying the impact of landing pages in web advertising , 2009, CIKM.

[18]  Dustin Hillard,et al.  A predictive model for advertiser value-per-click in sponsored search , 2013, WWW.

[19]  Xiaoli Z. Fern,et al.  Visual appearance of display ads and its effect on click through rate , 2012, CIKM.

[20]  Christopher Krügel,et al.  Understanding fraudulent activities in online ad exchanges , 2011, IMC '11.

[21]  Suju Rajan,et al.  Beyond clicks: dwell time for personalization , 2014, RecSys '14.

[22]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[23]  Peifeng Yin,et al.  Silence is also evidence: interpreting dwell time for recommendation from psychological perspective , 2013, KDD.

[24]  L. Brown,et al.  Interval Estimation for a Binomial Proportion , 2001 .

[25]  Fabrizio Silvestri,et al.  Promoting Positive Post-Click Experience for In-Stream Yahoo Gemini Users , 2015, KDD.

[26]  D. Sculley,et al.  Predicting bounce rates in sponsored search advertisements , 2009, KDD.

[27]  Neil Daswani,et al.  The Anatomy of Clickbot.A , 2007, HotBots.

[28]  B. Lindsay Mixture models : theory, geometry, and applications , 1995 .

[29]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[30]  P. Nurmi Mixture Models , 2008 .