Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting

Online advertising is now one of the fastest advancing areas in the IT industry. In display and mobile advertising, the most significant technical development in recent years is the growth of Real-Time Bidding (RTB), which facilitates a real-time auction for a display opportunity. RTB essentially facilitates buying an individual ad impression in real time while it is still being generated from a users visit. RTB not only scales up the buying process by aggregating a large number of available inventories across publishers but, most importantly, enables direct targeting of individual users. As such, RTB has fundamentally changed the landscape of digital marketing. Scientifically, the demand for automation, integration and optimization in RTB also brings new research opportunities in information retrieval, data mining, machine learning and other related fields. Despite its rapid growth and huge potential, many aspects of RTB remain unknown to the research community for a variety of reasons. This monograph offers insightful knowledge of real-world systems, to bridge the gaps between industry and academia, and to provide an overview of the fundamental infrastructure, algorithms, and technical and research challenges of the new frontier of computational advertising. The topics covered include user response prediction, bid landscape forecasting, bidding algorithms, revenue optimization, statistical arbitrage, dynamic pricing, and ad fraud detection. This is an invaluable text for researchers and practitioners alike. Academic researchers will get a better understanding of the real-time online advertising systems currently deployed in industry. While industry practitioners are introduced to the research challenges, the state of the art algorithms and potential future systems in this field.

[1]  W. Greene,et al.  Censored Data and Truncated Distributions , 2005 .

[2]  Gabriel Y. Weintraub,et al.  Repeated Auctions with Budgets in Ad Exchanges: Approximations and Design , 2014, Manag. Sci..

[3]  Sándor Imre,et al.  User Tracking on the Web via Cross-Browser Fingerprinting , 2011, NordSec.

[4]  Foster Provost,et al.  Evaluating and Optimizing Online Advertising: Forget the Click, but There Are Good Proxies , 2015, Big Data.

[5]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[6]  Zoë Abrams,et al.  Revenue maximization when bidders have budgets , 2006, SODA '06.

[7]  Brendan Kitts,et al.  Optimal Bidding on Keyword Auctions , 2004, Electron. Mark..

[8]  Andrei Z. Broder,et al.  A semantic approach to contextual advertising , 2007, SIGIR.

[9]  Foster J. Provost,et al.  Design principles of massive, robust prediction systems , 2012, KDD.

[10]  J. Friedman Stochastic gradient boosting , 2002 .

[11]  Michael Ostrovsky,et al.  Reserve Prices in Internet Advertising Auctions: A Field Experiment , 2009, Journal of Political Economy.

[12]  Alexandre Proutière,et al.  Optimal bidding strategies in dynamic auctions with budget constraints , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[13]  Joaquin Quiñonero Candela,et al.  Web-Scale Bayesian Click-Through rate Prediction for Sponsored Search Advertising in Microsoft's Bing Search Engine , 2010, ICML.

[14]  Foster J. Provost,et al.  Bid optimizing and inventory scoring in targeted online advertising , 2012, KDD.

[15]  Jianqiang Shen,et al.  From 0.5 Million to 2.5 Million: Efficiently Scaling up Real-Time Bidding , 2015, 2015 IEEE International Conference on Data Mining.

[16]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[17]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[18]  Sandra Maurer,et al.  An Introduction To Auction Theory , 2016 .

[19]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[20]  Tie-Yan Liu,et al.  Joint optimization of bid and budget allocation in sponsored search , 2012, KDD.

[21]  Nicole Immorlica,et al.  Dynamics of bid optimization in online advertisement auctions , 2007, WWW '07.

[22]  Chris Jay Hoofnagle,et al.  Flash Cookies and Privacy , 2009, AAAI Spring Symposium: Intelligent Information Privacy Management.

[23]  Stephen Tyree,et al.  Parallel boosted regression trees for web search ranking , 2011, WWW.

[24]  Chih-Jen Lin,et al.  Field-aware Factorization Machines for CTR Prediction , 2016, RecSys.

[25]  Ya Zhang,et al.  Multi-touch Attribution in Online Advertising with Survival Theory , 2014, 2014 IEEE International Conference on Data Mining.

[26]  Arvind Narayanan,et al.  The Web Never Forgets: Persistent Tracking Mechanisms in the Wild , 2014, CCS.

[27]  F. Black,et al.  The Pricing of Options and Corporate Liabilities , 1973, Journal of Political Economy.

[28]  Paul Milgrom,et al.  Putting Auction Theory to Work , 2004 .

[29]  Wentong Li,et al.  Estimating conversion rate in display advertising from past erformance data , 2012, KDD.

[30]  Foster Provost,et al.  Causally motivated attribution for online advertising , 2012, ADKDD '12.

[31]  Jagadeesh Gorla,et al.  Probabilistic group recommendation via information matching , 2013, WWW.

[32]  Alexander J. Smola,et al.  Scalable hierarchical multitask learning algorithms for conversion optimization in display advertising , 2014, WSDM.

[33]  Jun Wang,et al.  A lattice framework for pricing display advertisement options with the stochastic volatility underlying model , 2014, Electron. Commer. Res. Appl..

[34]  Yong Yu,et al.  Feature-Based Matrix Factorization , 2011, ArXiv.

[35]  Nikhil R. Devanur,et al.  Real-time bidding algorithms for performance-based display ad allocation , 2011, KDD.

[36]  Peter Stone,et al.  Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..

[37]  Jun Wang,et al.  Selling futures online advertising slots via option contracts , 2012, WWW.

[38]  Qiang Yang,et al.  Transferring Naive Bayes Classifiers for Text Classification , 2007, AAAI.

[39]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[40]  R. McAfee,et al.  The Design of Advertising Exchanges , 2011 .

[41]  Ryan Stevens,et al.  MAdFraud: investigating ad fraud in android applications , 2014, MobiSys.

[42]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[43]  Deeparnab Chakrabarty,et al.  Budget constrained bidding in keyword auctions and online knapsack problems , 2008, WINE.

[44]  Zhengdong Lu,et al.  Deep Learning for Information Retrieval , 2016, SIGIR.

[45]  Weinan Zhang,et al.  Optimal real-time bidding for display advertising , 2014, KDD.

[46]  Stephen E. Robertson,et al.  A probabilistic model of information retrieval: development and comparative experiments - Part 2 , 2000, Inf. Process. Manag..

[47]  Jun Wang,et al.  An Empirical Study on Display Ad Impression Viewability Measurements , 2015, ArXiv.

[48]  Vern Paxson,et al.  Ad Injection at Scale: Assessing Deceptive Advertisement Modifications , 2015, 2015 IEEE Symposium on Security and Privacy.

[49]  Maxim Raya,et al.  ISPs and Ad Networks Against Botnet Ad Fraud , 2010, GameSec.

[50]  Olivier Chapelle,et al.  Offline Evaluation of Response Prediction in Online Advertising Auctions , 2015, WWW.

[51]  Sachin Garg,et al.  Response prediction using collaborative filtering with hierarchies and side-information , 2011, KDD.

[52]  Jun Wang,et al.  Feedback Control of Real-Time Display Advertising , 2016, WSDM.

[53]  Jon Feldman,et al.  Budget optimization in search-based advertising auctions , 2006, EC '07.

[54]  Jun Wang,et al.  Interactive collaborative filtering , 2013, CIKM.

[55]  Souvik Ghosh,et al.  Budget pacing for targeted online advertisements at LinkedIn , 2014, KDD.

[56]  Natasa Milic-Frayling,et al.  Network Analysis of Third Party Tracking: User Exposure to Tracking Cookies through Search , 2013, 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[57]  Anh-Phuong Ta,et al.  Factorization machines with follow-the-regularized-leader for CTR prediction in display advertising , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[58]  Jun Wang,et al.  Real-Time Bidding Benchmarking with iPinYou Dataset , 2014, ArXiv.

[59]  Sergei Vassilvitskii,et al.  Handling forecast errors while bidding for display advertising , 2012, WWW.

[60]  Andrei Z. Broder,et al.  Just-in-time contextual advertising , 2007, CIKM '07.

[61]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[62]  Vikram Pudi,et al.  A feature-pair-based associative classification approach to look-alike modeling for conversion-oriented user-targeting in tail campaigns , 2011, WWW.

[63]  Benjamin Edelman,et al.  Optimal Auction Design in a Multi-unit Environment : The Case of Sponsored Search Auctions , 2007 .

[64]  Frank Thomson Leighton,et al.  The value of knowing a demand curve: bounds on regret for online posted-price auctions , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[65]  Martin Wattenberg,et al.  Ad click prediction: a view from the trenches , 2013, KDD.

[66]  Olivier Chapelle,et al.  Modeling delayed feedback in display advertising , 2014, KDD.

[67]  Gian Fulgoni Fraud in Digital Advertising: A Multibillion-Dollar Black Hole , 2016, Journal of Advertising Research.

[68]  B. Xiao OPTIMAL RESERVE PRICE FOR THE GENERALIZED SECOND-PRICE AUCTION IN SPONSORED SEARCH ADVERTISING , 2009 .

[69]  Jianlong Zhang,et al.  Applications of feedback control in online advertising , 2013, 2013 American Control Conference.

[70]  Paul A. Samuelson,et al.  Rational Theory of Warrant Pricing , 2015 .

[71]  Jun Wang,et al.  Unified relevance models for rating prediction in collaborative filtering , 2008, TOIS.

[72]  Jun Wang,et al.  Portfolio theory of information retrieval , 2009, SIGIR.

[73]  Claudio Gentile,et al.  Ieee Transactions on Information Theory 1 Regret Minimization for Reserve Prices in Second-price Auctions , 2022 .

[74]  Vahab S. Mirrokni,et al.  Bid optimization for broad match ad auctions , 2009, WWW '09.

[75]  Foster J. Provost,et al.  Scalable hands-free transfer learning for online advertising , 2014, KDD.

[76]  Jon Feldman,et al.  Position Auctions with Bidder-Specific Minimum Prices , 2008, WINE.

[77]  Jun Wang,et al.  Statistical Arbitrage Mining for Display Advertising , 2015, KDD.

[78]  Thomas P. Hayes,et al.  The adwords problem: online keyword matching with budgeted bidders under random permutations , 2009, EC '09.

[79]  A. Proutière,et al.  Repeated Auctions under Budget Constraints : Optimal bidding strategies and Equilibria , 2012 .

[80]  Ingemar J. Cox,et al.  Risky business: modeling and exploiting uncertainty in information retrieval , 2009, SIGIR.

[81]  S. Robertson The probability ranking principle in IR , 1997 .

[82]  Xuehua Shen,et al.  iPinYou Global RTB Bidding Algorithm Competition Dataset , 2014, ADKDD'14.

[83]  Jun Wang,et al.  Product-Based Neural Networks for User Response Prediction , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[84]  Lars Schmidt-Thieme,et al.  Pairwise interaction tensor factorization for personalized tag recommendation , 2010, WSDM '10.

[85]  Sureswaran Ramadass,et al.  A Survey of Botnet and Botnet Detection , 2009, 2009 Third International Conference on Emerging Security Information, Systems and Technologies.

[86]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[87]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[88]  Y WeintraubGabriel,et al.  Repeated Auctions with Budgets in Ad Exchanges , 2015 .

[89]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[90]  Filip Radlinski,et al.  Optimizing relevance and revenue in ad search: a query substitution approach , 2008, SIGIR '08.

[91]  Stephen E. Robertson,et al.  The Unified Probabilistic Model for IR , 1982, SIGIR.

[92]  Rómer Rosales,et al.  Simple and Scalable Response Prediction for Display Advertising , 2014, ACM Trans. Intell. Syst. Technol..

[93]  Andrew B. Whinston,et al.  Path to Purchase: A Mutually Exciting Point Process Model for Online Advertising and Conversion , 2012, Manag. Sci..

[94]  Vibhanshu Abhishek,et al.  Media Exposure through the Funnel: A Model of Multi-Stage Attribution , 2012 .

[95]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[96]  Qiang Yang,et al.  Transfer learning for collaborative filtering via a rating-matrix generative model , 2009, ICML '09.

[97]  Jagadeesh Gorla,et al.  A bi-directional unified model for information retrieval , 2016 .

[98]  Jun Wang,et al.  User Response Learning for Directly Optimizing Campaign Performance in Display Advertising , 2016, CIKM.

[99]  Christopher Krügel,et al.  Understanding fraudulent activities in online ad exchanges , 2011, IMC '11.

[100]  Paul Barford,et al.  Impression Fraud in On-line Advertising via Pay-Per-View Networks , 2013, USENIX Security Symposium.

[101]  Jun Wang,et al.  Sequential selection of correlated ads by POMDPs , 2012, CIKM.

[102]  Charles L. A. Clarke,et al.  Novelty and diversity in information retrieval evaluation , 2008, SIGIR '08.

[103]  Jianfeng Gao,et al.  Deep stacking networks for information retrieval , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[104]  David Lo,et al.  Predicting response in mobile advertising with hierarchical importance-aware factorization machine , 2014, WSDM.

[105]  Jun Wang,et al.  Internet Advertising: An Interplay among Advertisers, Online Publishers, Ad Exchanges and Web Users , 2012, ArXiv.

[106]  Shiv Kumar Saini,et al.  Estimating the incremental effects of interactions for marketing attribution , 2014, 2014 International Conference on Behavioral, Economic, and Socio-Cultural Computing (BESC2014).

[107]  Mohan S. Kankanhalli,et al.  Multi-Keyword Multi-Click Advertisement Option Contracts for Sponsored Search , 2013, ACM Trans. Intell. Syst. Technol..

[108]  D. Dabrowska Non-parametric regression with censored survival time data , 1987 .

[109]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[110]  Tie-Yan Liu,et al.  Sequential Click Prediction for Sponsored Search with Recurrent Neural Networks , 2014, AAAI.

[111]  Jun Wang,et al.  Bid-aware Gradient Descent for Unbiased Learning with Censored Data in Display Advertising , 2016, KDD.

[112]  David Lo,et al.  Detecting click fraud in online advertising: a data mining approach , 2014, J. Mach. Learn. Res..

[113]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[114]  Roger B. Myerson,et al.  Optimal Auction Design , 1981, Math. Oper. Res..

[115]  Tara N. Sainath,et al.  FUNDAMENTAL TECHNOLOGIES IN MODERN SPEECH RECOGNITION Digital Object Identifier 10.1109/MSP.2012.2205597 , 2012 .

[116]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[117]  David A. Wooff,et al.  Time-Weighted Multi-Touch Attribution and Channel Relevance in the Customer Journey to Online Purchase , 2015 .

[118]  Martha Larson,et al.  Cross-Domain Collaborative Filtering with Factorization Machines , 2014, ECIR.

[119]  Lawrence Carin,et al.  Logistic regression with an auxiliary data source , 2005, ICML.

[120]  W. Lieberman The Theory and Practice of Revenue Management , 2005 .

[121]  Jun Wang,et al.  An empirical study of reserve price optimisation in real-time bidding , 2014, KDD.

[122]  Jun Wang,et al.  Optimal Real-Time Bidding Frameworks Discussion , 2016, ArXiv.

[123]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[124]  Peter Eckersley,et al.  How Unique Is Your Web Browser? , 2010, Privacy Enhancing Technologies.

[125]  Foster J. Provost,et al.  Using co-visitation networks for detecting large scale online display advertising exchange fraud , 2013, KDD.

[126]  Kilian Q. Weinberger,et al.  Feature hashing for large scale multitask learning , 2009, ICML '09.

[127]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[128]  S. Muthukrishnan,et al.  Ad Exchanges: Research Issues , 2009, WINE.

[129]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[130]  Moni Naor,et al.  Secure Accounting and Auditing on the Web , 1998, Comput. Networks.

[131]  Anton Schwaighofer,et al.  Budget Optimization for Sponsored Search: Censored Learning in MDPs , 2012, UAI.

[132]  Sahin Cem Geyik,et al.  Multi-Touch Attribution Based Budget Allocation in Online Advertising , 2014, ADKDD'14.

[133]  Kartik Hosanagar,et al.  Optimal bidding in stochastic budget constrained slot auctions , 2008, EC '08.

[134]  S. Muthukrishnan,et al.  Stochastic Models for Budget Optimization in Search-Based Advertising , 2006, Algorithmica.

[135]  Jian Xu,et al.  Smart Pacing for Effective Online Ad Campaign Optimization , 2015, KDD.

[136]  Ariel Rubinstein,et al.  A Course in Game Theory , 1995 .

[138]  Hamed Haddadi,et al.  Fighting online click-fraud using bluff ads , 2010, CCRV.

[139]  Tie-Yan Liu,et al.  Bid keyword suggestion in sponsored search based on competitiveness and relevance , 2014, Inf. Process. Manag..

[140]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[141]  Jun Wang,et al.  Deep Learning over Multi-field Categorical Data - - A Case Study on User Response Prediction , 2016, ECIR.

[142]  Jun Wang,et al.  Implicit Look-Alike Modelling in Display Ads - Transfer Collaborative Filtering to CTR Estimation , 2016, ECIR.

[143]  Lars Schmidt-Thieme,et al.  Fast context-aware recommendations with factorization machines , 2011, SIGIR.

[144]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[145]  Jian Xu,et al.  Lift-Based Bidding in Ad Selection , 2016, AAAI.

[146]  Lexin Li,et al.  Data-driven multi-touch attribution models , 2011, KDD.

[147]  Adam Barth,et al.  The Web Origin Concept , 2011, RFC.

[148]  Jun Wang,et al.  Real-time bidding for online advertising: measurement and analysis , 2013, ADKDD '13.

[149]  Ming-Syan Chen,et al.  Predicting Winning Price in Real Time Bidding with Censored Data , 2015, KDD.

[150]  Wei Li,et al.  Bid landscape forecasting in online ad exchange marketplace , 2011, KDD.

[151]  L. Shapley A Value for n-person Games , 1988 .

[152]  Jun Wang,et al.  Functional Bid Landscape Forecasting for Display Advertising , 2016, ECML/PKDD.

[153]  Shuai Yuan,et al.  A Dynamic Pricing Model for Unifying Programmatic Guarantee and Real-Time Bidding in Display Advertising , 2014, ADKDD'14.

[154]  Markus Jakobsson,et al.  Crimeware: Understanding New Attacks and Defenses , 2008 .

[155]  Ali Jalali,et al.  Real time bid optimization with smooth budget delivery in online advertising , 2013, ADKDD '13.

[156]  Masayuki Arai,et al.  Neural Feature Embedding for User Response Prediction in Real-Time Bidding (RTB) , 2017, SoMePeAS@ECIR.

[157]  SaberiAmin,et al.  AdWords and generalized online matching , 2007 .

[158]  D. Kenny,et al.  Contextual marketing--the real business of the Internet. , 2000, Harvard business review.

[159]  Jun Wang,et al.  Managing Risk of Bidding in Display Advertising , 2017, WSDM.

[160]  Sergei Vassilvitskii,et al.  WWW 2009 MADRID! Track: Internet Monetization / Session: Web Monetization Adaptive Bidding for Display Advertising ABSTRACT , 2022 .

[161]  Steven A. Matthews A Technical Primer on Auction Theory I: Independent Private Values , 1995 .

[162]  Wen Zhang,et al.  How much can behavioral targeting help online advertising? , 2009, WWW '09.

[163]  Sudipto Guha,et al.  Selective Call Out and Real Time Bidding , 2010, WINE.

[164]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[165]  W. Bruce Croft,et al.  A Language Modeling Approach to Information Retrieval , 1998, SIGIR Forum.

[166]  Jun Wang,et al.  Unifying user-based and item-based collaborative filtering approaches by similarity fusion , 2006, SIGIR.

[167]  Alexander J. Smola,et al.  Bid generation for advanced match in sponsored search , 2011, WSDM '11.

[168]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[169]  Jian Hu,et al.  Optimizing search engine revenue in sponsored search , 2009, SIGIR.

[170]  Joaquin Quiñonero Candela,et al.  Practical Lessons from Predicting Clicks on Ads at Facebook , 2014, ADKDD'14.