Linking visual concept detection with viewer demographics

The estimation of demographic target groups for web videos -- with applications in ad targeting -- poses a challenging problem, as the textual description and view statistics available for many clips is extremely sparse. Therefore, the goal of this paper is to link a clip's popularity across different viewer ages and genders on the one hand with the video content on the other: Employing user comments and user profiles on YouTube, we show that there is a strong correlation between demographic target groups and semantic concepts appearing in the video (like "teenage male" and "skateboarding"). Based on this observation, we suggest two approaches: First, the demographic target group of a clip is predicted automatically via a content-based concept detection. Second, should sufficient view statistics already give a good impression of a video's audience, we show that this information can serve as a valuable additional signal to disambiguate concept detection. Our experimental results on a dataset of 14,000 YouTube clips commented by 1 mio. users show that -- though content-based viewership estimation is a challenging problem -- suitable demographic groups can be suggested by concept detection. Also, a combination with demographic information as an additional signal leads to relative improvements of concept detection accuracy by 47%.

[1]  Theodoros Bozios,et al.  Advanced Techniques for Personalized Advertising in a Digital TV Environment : The iMEDIA System , 2001 .

[2]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[3]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[4]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[5]  N. Hollis Ten Years of Learning on How Online Advertising Builds Brands , 2005, Journal of Advertising Research.

[6]  Alan F. Smeaton,et al.  Large Scale Evaluations of Multimedia Information Retrieval: The TRECVid Experience , 2005, CIVR.

[7]  Shih-Fu Chang,et al.  To search or to label?: predicting the performance of search-based automatic image classifiers , 2006, MIR '06.

[8]  Joshua Goodman,et al.  Finding advertising keywords on web pages , 2006, WWW '06.

[9]  Hua Li,et al.  Demographic prediction based on user's browsing behavior , 2007, WWW '07.

[10]  Hsuan-Tien Lin,et al.  A note on Platt’s probabilistic outputs for support vector machines , 2007, Machine Learning.

[11]  Tao Mei,et al.  VideoSense: a contextual video advertising system , 2007, ACM Multimedia.

[12]  Srinivasan H. Sengamedu,et al.  vADeo: video advertising system , 2007, ACM Multimedia.

[13]  Pietro Perona,et al.  A walk through the web’s video clips , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[14]  Marcel Worring,et al.  Learning tag relevance by neighbor voting for social image retrieval , 2008, MIR '08.

[15]  Bernard J. Jansen,et al.  Sponsored search: an overview of the concept, history, and technology , 2008, Int. J. Electron. Bus..

[16]  Tao Mei,et al.  Contextual in-image advertising , 2008, ACM Multimedia.

[17]  Jun Yang,et al.  (Un)Reliability of video concept detection , 2008, CIVR '08.

[18]  Hrishikesh B. Aradhye,et al.  Video2Text: Learning to Annotate Video Content , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[19]  Wei-Ying Ma,et al.  Argo: intelligent advertising by mining a user's interest from his photo collections , 2009, KDD Workshop on Data Mining and Audience Intelligence for Advertising.

[20]  Wen Zhang,et al.  How much can behavioral targeting help online advertising? , 2009, WWW '09.

[21]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[22]  Marcel Worring,et al.  Concept-Based Video Retrieval , 2009, Found. Trends Inf. Retr..

[23]  Markus Koch,et al.  TubeTagger - YouTube-based Concept Detection , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[24]  George Karypis,et al.  Content-Based Methods for Predicting Web-Site Demographic Attributes , 2010, 2010 IEEE International Conference on Data Mining.

[25]  Adrian Ulges,et al.  Visual Concept Learning from Weakly Labeled Web Videos , 2010, Video Search and Mining.

[26]  Mohan S. Kankanhalli,et al.  Multimodal fusion for multimedia analysis: a survey , 2010, Multimedia Systems.

[27]  Daniel P. W. Ellis,et al.  IBM Research and Columbia University TRECVID-2011 Multimedia Event Detection (MED) System , 2011, TRECVID.