Information spreading on social media contributes to the formation of collective opinions. Millions of social media users are exposed every day to popular memes — some generated organically by grassroots activity, others sustained by advertising, information campaigns or more or less transparent coordinated efforts. While most information campaigns are benign, some may have nefarious purposes, including terrorist propaganda, political astroturf, and financial market manipulation. This poses a crucial technological challenge with deep social implications: can we detect whether the spreading of a viral meme is being sustained by a promotional campaign? Here we study trending memes that attract attention either organically, or by means of advertisement. We designed a machine learning framework capable to detect promoted campaigns and separate them from organic ones in their early stages. Using a dataset of millions of posts associated with trending Twitter hashtags, we prove that remarkably accurate early detection is possible, achieving 95% AUC score. Feature selection analysis reveals that network diffusion patterns and content cues are powerful early detection signals.
[1]
Jacob Ratkiewicz,et al.
Detecting and Tracking Political Abuse in Social Media
,
2011,
ICWSM.
[2]
Christopher M. Danforth,et al.
Positivity of the English Language
,
2011,
PloS one.
[3]
Janyce Wiebe,et al.
Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis
,
2005,
HLT.
[4]
Johan Bollen,et al.
Computational Fact Checking from Knowledge Networks
,
2015,
PloS one.
[5]
Owen Rambow,et al.
Sentiment Analysis of Twitter Data
,
2011
.