论文信息 - CHECKER: Detecting Clickbait Thumbnails with Weak Supervision and Co-teaching

CHECKER: Detecting Clickbait Thumbnails with Weak Supervision and Co-teaching

Clickbait thumbnails on video-sharing platforms (e.g., YouTube, Dailymotion) are small catchy images that are designed to entice users to click to view the linked videos. Despite their usefulness, the landing videos after click are often inconsistent with what the thumbnails have advertised, causing poor user experience and undermining the reputation of the platforms. In this work, therefore, we aim to develop a computational solution, named as CHECKER, to detect clickbait thumbnails with high accuracy. Due to the fuzziness in the definition of clickbait thumbnails and subsequent challenges in creating high-quality labeled samples, the industry has not coped with clickbait thumbnails adequately. To address this challenge, CHECKER shares a novel clickbait thumbnail dataset and codebase with the industry, and exploits: (1) the weak supervision framework to generate many noisy-but-useful labels, and (2) the co-teaching framework to learn robustly using such noisy labels.Moreover, we also investigate how to detect clickbaits on video-sharing platforms with both thumbnails and titles, and exploit recent advances in vision-language models. In the empirical validation, CHECKER outperforms five baselines by at least 6.4% in F1-score and 4.2% in AUC-ROC. The codebase and dataset from our paper are available at: https://github.com/XPandora/CHECKER.

[1] Lanyu Shang,et al. Towards Reliable Online Clickbait Video Detection: A Content-Agnostic Approach , 2019, Knowl. Based Syst..

[2] Matthieu Cord,et al. MUTAN: Multimodal Tucker Fusion for Visual Question Answering , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[3] Zhou Yu,et al. Beyond Bilinear: Generalized Multimodal Factorized High-Order Pooling for Visual Question Answering , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[4] Rami Puzis,et al. Detecting Clickbait in Online Social Media: You Won't Believe How We Did It , 2017, ArXiv.

[5] Yimin Chen,et al. Misleading Online Content: Recognizing Clickbait as "False News" , 2015, WMDD@ICMI.

[6] Christopher R'e,et al. Fast and Three-rious: Speeding Up Weak Supervision with Triplet Methods , 2020, ICML.

[7] Prakhar Biyani,et al. "8 Amazing Secrets for Getting More Clicks": Detecting Clickbaits in News Streams Using Article Informality , 2016, AAAI.

[8] Naeemul Hassan,et al. Diving Deep into Clickbaits: Who Use Them to What Extents in Which Topics with What Effects? , 2017, ASONAM.

[9] Christopher Ré,et al. Snorkel: Rapid Training Data Creation with Weak Supervision , 2017, Proc. VLDB Endow..

[10] Ahmed El Kholy,et al. UNITER: Learning UNiversal Image-TExt Representations , 2019, ECCV 2020.

[11] Peng Xu,et al. Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning , 2019, EMNLP.

[12] Christopher Ré,et al. Snorkel: Fast Training Set Generation for Information Extraction , 2017, SIGMOD Conference.

[13] S. Shyam Sundar,et al. Does Clickbait Actually Attract More Clicks? Three Clickbait Studies You Must Read , 2021, CHI.

[14] Sen Wu,et al. Train and You'll Miss It: Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings , 2020, ArXiv.

[15] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Cho-Jui Hsieh,et al. VisualBERT: A Simple and Performant Baseline for Vision and Language , 2019, ArXiv.

[17] S. Shyam Sundar,et al. 5 Sources of Clickbaits You Should Know! Using Synthetic Clickbaits to Improve Prediction and Distinguish between Bot-Generated and Human-Written Headlines , 2019, 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[18] Mohit Bansal,et al. LXMERT: Learning Cross-Modality Encoder Representations from Transformers , 2019, EMNLP.

[19] Xingrui Yu,et al. Co-teaching: Robust training of deep neural networks with extremely noisy labels , 2018, NeurIPS.

[20] Savvas Zannettou,et al. The Good, the Bad and the Bait: Detecting and Characterizing Clickbait on YouTube , 2018, 2018 IEEE Security and Privacy Workshops (SPW).

[21] Matthieu Cord,et al. BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection , 2019, AAAI.

[22] Niloy Ganguly,et al. Stop Clickbait: Detecting and preventing clickbaits in online news media , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[23] Amol Agrawal,et al. Clickbait detection using deep learning , 2016, 2016 2nd International Conference on Next Generation Computing Technologies (NGCT).

[24] Yu Cheng,et al. UNITER: UNiversal Image-TExt Representation Learning , 2019, ECCV.

[25] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[26] Yoshua Bengio,et al. A Closer Look at Memorization in Deep Networks , 2017, ICML.

[27] Huan Liu,et al. Deep Headline Generation for Clickbait Detection , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[28] Trevor Darrell,et al. Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding , 2016, EMNLP.

[29] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Martin Potthast,et al. Towards Crowdsourcing Clickbait Labels for YouTube Videos , 2018, HCOMP.