Visual memes in social media: tracking real-world news in YouTube videos

We propose visual memes, or frequently reposted short video segments, for tracking large-scale video remix in social media. Visual memes are extracted by novel and highly scalable detection algorithms that we develop, with over 96% precision and 80% recall. We monitor real-world events on YouTube, and we model interactions using a graph model over memes, with people and content as nodes, and meme postings as links. This allows us to define several measures of influence. These abstractions, using more than two million video shots from several large-scale event datasets, enable us to quantify and efficiently extract several important observations: over half of the videos contain re-mixed content, which appears rapidly; video view counts, particularly high ones, are poorly correlated with the virality of content; the influence of traditional news media versus citizen journalists varies from event to event; iconic single images of an event are easily extracted; and content that will have long lifespan can be predicted within a day after it first appears. Visual memes can be applied to a number of social media scenarios: brand monitoring, social buzz tracking, ranking content and users, among others.

[1]  Hung-Khoon Tan,et al.  Accelerating near-duplicate video matching by combining visual similarity and alignment distortion , 2008, ACM Multimedia.

[2]  John R. Smith,et al.  Design and evaluation of an effective and efficient video copy detection system , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[3]  Shih-Fu Chang,et al.  Internet image archaeology: automatically tracing the manipulation history of photographs on the web , 2008, ACM Multimedia.

[4]  Shih-Fu Chang,et al.  Semi-supervised hashing for scalable image retrieval , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[6]  Pablo Rodriguez,et al.  I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system , 2007, IMC '07.

[7]  Munmun De Choudhury,et al.  What makes conversations interesting?: themes, participants and consequences of conversations in online social media , 2009, WWW '09.

[8]  Nuria Oliver,et al.  Understanding near-duplicate videos: a user-centric approach , 2009, ACM Multimedia.

[9]  Ting Liu,et al.  Clustering Billions of Images with Large Scale Nearest Neighbor Search , 2007, 2007 IEEE Workshop on Applications of Computer Vision (WACV '07).

[10]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[11]  D. Meadows-Klue The Tipping Point: How Little Things Can Make a Big Difference , 2004 .

[12]  Hung-Khoon Tan,et al.  Beyond search: Event-driven summarization for web videos , 2011, TOMCCAP.

[13]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[14]  Virgílio A. F. Almeida,et al.  Video interactions in online video social networks , 2009, TOMCCAP.

[15]  Ryan Shaw,et al.  International remix: video editing for the web , 2006, MM '06.

[16]  Didier Sornette,et al.  Viral, Quality, and Junk Videos on YouTube: Separating Content from Noise in an Information-Rich Environment , 2008, AAAI Spring Symposium: Social Information Processing.

[17]  Daniel Gatica-Perez,et al.  Voices of Vlogging , 2010, ICWSM.

[18]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[19]  G. Breeuwsma Geruchten als besmettelijke ziekte. Het succesverhaal van de Hush Puppies. Bespreking van Malcolm Gladwell, The tipping point. How little things can make a big difference. London: Little, Brown and Company, 2000 , 2000 .

[20]  C. Gini Measurement of Inequality of Incomes , 1921 .

[21]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[22]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[23]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[24]  Jing Huang,et al.  Spatial Color Indexing and Applications , 2004, International Journal of Computer Vision.

[25]  Michael J. Fischer,et al.  An improved equivalence algorithm , 1964, CACM.