To swing or not to swing: learning when (not) to advertise

Web textual advertising can be interpreted as a search problem over the corpus of ads available for display in a particular context. In contrast to conventional information retrieval systems, which always return results if the corpus contains any documents lexically related to the query, in Web advertising it is acceptable, and occasionally even desirable, not to show any results. When no ads are relevant to the user's interests, then showing irrelevant ads should be avoided since they annoy the user and produce no economic benefit. In this paper we pose a decision problem "whether to swing", that is, whether or not to show any of the ads for the incoming request. We propose two methods for addressing this problem, a simple thresholding approach and a machine learning approach, which collectively analyzes the set of candidate ads augmented with external knowledge. Our experimental evaluation, based on over 28,000 editorial judgments, shows that we are able to predict, with high accuracy, when to "swing" for both content match and sponsored search advertising.

[1]  W. Bruce Croft,et al.  Predicting query performance , 2002, SIGIR '02.

[2]  Andrea Esuli,et al.  Best bets: thousands of queries in search of a client , 2004, WWW Alt. '04.

[3]  Javed A. Aslam,et al.  Relevance score normalization for metasearch , 2001, CIKM '01.

[4]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[5]  C. Yoo,et al.  Preattentive Processing of Web Advertising , 2007 .

[6]  Vassilis Plachouras,et al.  SEMANTIC ASSOCIATIONS FOR CONTEXTUAL ADVERTISING , 2008 .

[7]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[8]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[9]  Ping Zhang,et al.  UNDERSTANDING CONSUMERS ATTITUDE TOWARD ADVERTISING , 2002 .

[10]  R. Manmatha,et al.  Modeling score distributions for combining the outputs of search engines , 2001, SIGIR '01.

[11]  Vassilis Plachouras,et al.  A noisy-channel approach to contextual advertising , 2007, ADKDD '07.

[12]  Weiguo Fan,et al.  Learning to advertise , 2006, SIGIR.

[13]  W. Bruce Croft,et al.  Query performance prediction in web search environments , 2007, SIGIR.

[14]  Leonid Zhukov,et al.  Clustering of bipartite advertiser-keyword graph , 2003 .

[15]  Katherine Gallagher,et al.  The Medium Is Not the Message: Advertising Effectiveness and Content Evaluation in Print and on the Web , 2001, Journal of Advertising Research.

[16]  P. Chatterjee,et al.  Modeling the Clickstream: Implications for Web-Based Advertising Efforts , 2003 .

[17]  Xin Jin,et al.  Sensitive webpage classification for content advertising , 2007, ADKDD '07.

[18]  Gareth Quin Best BETs , 2001 .

[19]  Daniel A. Keim,et al.  On Knowledge Discovery and Data Mining , 1997 .

[20]  Iadh Ounis,et al.  Inferring Query Performance Using Pre-retrieval Predictors , 2004, SPIRE.

[21]  Ellen M. Voorhees,et al.  Retrieval evaluation with incomplete information , 2004, SIGIR '04.

[22]  Hemant K. Bhargava,et al.  Implementing Sponsored Search in Web Search Engines: Computational Evaluation of Alternative Mechanisms , 2007, INFORMS J. Comput..

[23]  Vassilis Plachouras,et al.  Online learning from click data for sponsored search , 2008, WWW.

[24]  Philip Koehn,et al.  Statistical Machine Translation , 2010, EAMT.

[25]  Andrei Z. Broder,et al.  A semantic approach to contextual advertising , 2007, SIGIR.

[26]  David Yarowsky,et al.  Statistical Machine Translation: Final Report , 1999 .

[27]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[28]  Mehran Sahami,et al.  A web-based kernel function for measuring the similarity of short text snippets , 2006, WWW '06.

[29]  Berthier A. Ribeiro-Neto,et al.  Impedance coupling in content-targeted advertising , 2005, SIGIR '05.

[30]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[31]  Susan T. Dumais,et al.  Similarity Measures for Short Segments of Text , 2007, ECIR.

[32]  Joshua Goodman,et al.  Finding advertising keywords on web pages , 2006, WWW '06.