Automated Coding of Political Video Ads for Political Science Research

With the advent of new media technology and the ability to identify more information about potential voters, political campaigns have aggressively changed their campaign strategies. Election campaigns increasingly rely on online video advertising to reach voters. Until now, the contents of these ads are manually coded for political science research to study campaign strategies. Manual coding is tremendously time consuming and not scalable to handle the expected increase in online ads. We make the first attempt to investigate automated coding of the content of political video ads for political science research. Specifically, we focus on the problem of classifying a political ad into one of these categories: attack ads, promoting ads, and contrast ads. Together with the domain expert, we introduce a concrete definition for each of these categories. We made available the ground truth labels of 773 political ads of the 2016 primary presidential election. We investigate the effectiveness of several classifiers using single modality and two modalities. The best average F1 score is 0.845 using text features from audio and embedded text in image frames.

[1]  Mohan S. Kankanhalli,et al.  SalAd: A Multimodal Approach for Contextual Video Advertising , 2015, 2015 IEEE International Symposium on Multimedia (ISM).

[2]  Jun Hu,et al.  Advertising object in web videos , 2013, Neurocomputing.

[3]  Andreas Jungherr,et al.  Hacking the electorate: How campaigns perceive voters , 2017 .

[4]  Srinivasan H. Sengamedu,et al.  vADeo: video advertising system , 2007, ACM Multimedia.

[5]  Matthew A. Shapiro,et al.  What's congress doing on twitter? , 2013, CSCW.

[6]  Daniel M. Butler,et al.  A Field Experiment on Legislators’ Home Styles: Service versus Policy , 2012 .

[7]  Hui Zhang,et al.  A sports video browsing and retrieval system based on multimodal analysis: SportsBR , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[8]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[9]  Winston H. Hsu,et al.  AdImage: video advertising by image matching and ad scheduling optimization , 2008, SIGIR '08.

[10]  Naci Karkin,et al.  Political use of twitter: the case of metropolitan mayor candidates in 2014 local elections in Turkey , 2014, ICEGOV.

[11]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying scene breaks , 1995, MULTIMEDIA '95.

[12]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[13]  Ling Guan,et al.  Recognizing Human Emotional State From Audiovisual Signals , 2008, IEEE Transactions on Multimedia.

[14]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[15]  Changsheng Xu,et al.  Using Webcast Text for Semantic Event Detection in Broadcast Sports Video , 2008, IEEE Transactions on Multimedia.

[16]  Raahat Devender Singh,et al.  Novel Research in the Field of Shot Boundary Detection - A Survey , 2014, ISI.

[17]  Theodoros Giannakopoulos pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis , 2015, PloS one.

[18]  Anthony Moretti The Persuadable Voter: Wedge Issues in Presidential Campaigns , 2010 .

[19]  Mohamed Hefeeda,et al.  SmartAd: A smart system for effective advertising in online videos , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[20]  Andreas Jungherr Twitter in Politics: A Comprehensive Literature Review , 2014 .

[21]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[22]  Qi Tian,et al.  A unified framework for semantic shot classification in sports video , 2005, IEEE Trans. Multim..

[23]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[24]  Tao Mei,et al.  VideoSense: towards effective online video advertising , 2007, ACM Multimedia.

[25]  Erik Cambria,et al.  SenticNet 3: A Common and Common-Sense Knowledge Base for Cognition-Driven Sentiment Analysis , 2014, AAAI.

[26]  Yi-Hsuan Yang,et al.  Machine Recognition of Music Emotion: A Review , 2012, TIST.

[27]  Chng Eng Siong,et al.  Soccer replay detection using scene transition structure analysis , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[28]  Libby Hemphill,et al.  Tweet acts: how constituents lobby congress via Twitter , 2014, CSCW.

[29]  Tao Mei,et al.  AdOn: an intelligent overlay video advertising system , 2009, SIGIR.

[30]  Uma Mudenagudi,et al.  A Study on Keyframe Extraction Methods for Video Summary , 2011, 2011 International Conference on Computational Intelligence and Communication Networks.

[31]  Steven Skiena,et al.  Lydia: A System for Large-Scale News Analysis , 2005, SPIRE.

[32]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[33]  Ling Guan,et al.  Kernel Cross-Modal Factor Analysis for Information Fusion With Application to Bimodal Emotion Recognition , 2012, IEEE Transactions on Multimedia.