Features with Feelings - Incorporating User Preferences in Video Categorization

Rapid growth of video content over internet has necessitated an immediate need to organize these large databases into meaningful categories. In this paper, we explore the benefits of leveraging social attitudes (beliefs, opinions, interests and evaluations of people) with machine learning concepts (audio/video features) in the challenging and pressing task of organization of online video databases. Through the analysis of view counts, we model social participation (people's choices) towards a video's contents. Observations reveal that viewership patterns are correlated with video genres. We propose logistic growth models to characterize videos based on usage and obtain a probability of video category. We then combine these subjectively assessed priors with likelihood of video class (as estimated from objective audio/video features) to establish the final category in a Bayesian framework. We provide a comparitive analysis of classification accuracies when a) categories are known a priori b) when they are not known a priori. Experimentally, we establish improvement in classification accuracy upon incorporating social attitudes with state-of-the-art audio/video features.

[1]  Diane J. Cook,et al.  Automatic Video Classification: A Survey of the Literature , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[2]  Jiebo Luo,et al.  Large-scale multimodal semantic concept detection for consumer video , 2007, MIR '07.

[3]  Hong-Jiang Zhang Relevance Feedback in Content-Based Image Retrieval , 2003 .

[4]  Yongdong Zhang,et al.  Google challenge: incremental-learning for web video categorization on robust semantic feature space , 2009, ACM Multimedia.

[5]  Thomas S. Huang,et al.  Relevance feedback in image retrieval: A comprehensive review , 2003, Multimedia Systems.

[6]  Keith B. Hall,et al.  Improved video categorization from text metadata and user comments , 2011, SIGIR '11.

[7]  Didier Sornette,et al.  Viral, Quality, and Junk Videos on YouTube: Separating Content from Noise in an Information-Rich Environment , 2008, AAAI Spring Symposium: Social Information Processing.

[8]  Maja Pantic,et al.  Social signal processing: Survey of an emerging domain , 2009, Image Vis. Comput..

[9]  A. Tsoularis,et al.  Analysis of logistic growth models. , 2002, Mathematical biosciences.

[10]  Hrishikesh B. Aradhye,et al.  Boosting video classification using cross-video signals , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Baoxin Li,et al.  YouTubeCat: Learning to categorize wild web videos , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  Yang Song,et al.  Taxonomic classification for web-based videos , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.