Language Model Mixtures for Contextual Ad Placement in Personal Blogs

We introduce a method for content-based advertisement selection for personal blog pages, based on combining multiple representations of the blog. The core idea behind the method is that personal blogs represent individuals, whose interests can be modeled by the language used in the blog itself combined with the language used in related sources of information, such as comments posted to a blog post or the blogger’s community. An evaluation of our ad placement method shows improvement over state-of-the-art ad placement methods which were not designed for blog pages.

[1]  Joshua Goodman,et al.  Finding advertising keywords on web pages , 2006, WWW '06.

[2]  Eric Miraglia,et al.  Predicting the Future of Internet Advertising , 2005 .

[3]  Craig MacDonald,et al.  On the TREC Blog Track , 2021, ICWSM.

[4]  Lois Ann Scheidt,et al.  Bridging the gap: a genre analysis of Weblogs , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[5]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[6]  Ping Zhang,et al.  UNDERSTANDING CONSUMERS ATTITUDE TOWARD ADVERTISING , 2002 .

[7]  Djoerd Hiemstra,et al.  Using language models for information retrieval , 2001 .

[8]  Gilad Mishne,et al.  Predicting Movie Sales from Blogger Sentiment , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[9]  Gilad Mishne,et al.  Deriving wishlists from blogs show us your blog, and we'll tell you what books to buy , 2006, WWW '06.

[10]  Hemant K. Bhargava,et al.  Paid placement strategies for internet search engines , 2002, WWW '02.

[11]  Berthier A. Ribeiro-Neto,et al.  Impedance coupling in content-targeted advertising , 2005, SIGIR '05.

[12]  Bernardo A. Huberman,et al.  The Structure of Collaborative Tagging Systems , 2005, ArXiv.

[13]  Luc De Raedt,et al.  Machine Learning: ECML 2001 , 2001, Lecture Notes in Computer Science.

[14]  Naoki Abe,et al.  Unintrusive Customization Techniques for Web Advertising , 1999, Comput. Networks.

[15]  Tefko Saracevic,et al.  Information science: What is it? , 1968 .

[16]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[17]  Ravi Kumar,et al.  On the Bursty Evolution of Blogspace , 2003, WWW '03.

[18]  Adam Tauman Kalai,et al.  On-line algorithms for combining language models , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[19]  Victor Lavrenko,et al.  Optimal Mixture Models in IR , 2002, ECIR.

[20]  Michelle Gumbrecht,et al.  Blogs as “Protected Space” , 2004 .

[21]  M. de Rijke,et al.  Monolingual Document Retrieval for European Languages , 2004, Information Retrieval.

[22]  R. Rosenfeld,et al.  Two decades of statistical language modeling: where do we go from here? , 2000, Proceedings of the IEEE.

[23]  Ramanathan V. Guha,et al.  The predictive power of online chatter , 2005, KDD '05.

[24]  Ralf D. Brown,et al.  Applying Statistical English Language Modelling to Symbolic Machine Translation , 1995, TMI.

[25]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[26]  Donna L. Hoffman,et al.  New metrics for new media: toward the development of Web measurement standards , 1997, World Wide Web J..