Misleading Metadata Detection on YouTube

YouTube is the leading social media platform for sharing videos. As a result, it is plagued with misleading content that includes staged videos presented as real footages from an incident, videos with misrepresented context and videos where audio/video content is morphed. We tackle the problem of detecting such misleading videos as a supervised classification task. We develop UCNet - a deep network to detect fake videos and perform our experiments on two datasets - VAVD created by us and publicly available FVC [8]. We achieve a macro averaged F-score of 0.82 while training and testing on a 70:30 split of FVC, while the baseline model scores 0.36. We find that the proposed model generalizes well when trained on one dataset and tested on the other.

[1]  Alex Hai Wang,et al.  Don't follow me: Spam detection in Twitter , 2010, 2010 International Conference on Security and Cryptography (SECRYPT).

[2]  Yiannis Kompatsiaris,et al.  Web Video Verification using Contextual Cues , 2017, MFSec@ICMR.

[3]  Qiaozhu Mei,et al.  Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts , 2015, WWW.

[4]  Cristina Radulescu,et al.  Identification of spam comments using natural language processing techniques , 2014, 2014 IEEE 10th International Conference on Intelligent Computer Communication and Processing (ICCP).

[5]  Vania Dimitrova,et al.  Identifying Relevant YouTube Comments to Derive Socially Augmented User Models: A Semantically Enriched Machine Learning Approach , 2011, UMAP Workshops.

[6]  Hila Becker,et al.  Learning similarity metrics for event identification in social media , 2010, WSDM '10.

[7]  Garcia-MolinaHector,et al.  Combating spam in tagging systems , 2008 .

[8]  Georgia Koutrika,et al.  Combating spam in tagging systems , 2007, AIRWeb '07.

[9]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[10]  Krishna P. Gummadi,et al.  Towards Detecting Anomalous User Behavior in Online Social Networks , 2014, USENIX Security Symposium.

[11]  Claire Cardie,et al.  Finding Deceptive Opinion Spam by Any Stretch of the Imagination , 2011, ACL.

[12]  V. Dimitrova,et al.  Semantically Enriched Machine Learning Approach to Filter YouTube Comments for Socially Augmented User Models , 2011 .