Cashtag Piggybacking

Microblogs are increasingly exploited for predicting prices and traded volumes of stocks in financial markets. However, it has been demonstrated that much of the content shared in microblogging platforms is created and publicized by bots and spammers. Yet, the presence (or lack thereof) and the impact of fake stock microblogs has never been systematically investigated before. Here, we study 9M tweets related to stocks of the five main financial markets in the US. By comparing tweets with financial data from Google Finance, we highlight important characteristics of Twitter stock microblogs. More importantly, we uncover a malicious practice—referred to as cashtag piggybacking—perpetrated by coordinated groups of bots and likely aimed at promoting low-value stocks by exploiting the popularity of high-value ones. Among the findings of our study is that as much as 71% of the authors of suspicious financial tweets are classified as bots by a state-of-the-art spambot-detection algorithm. Furthermore, 37% of them were suspended by Twitter a few months after our investigation. Our results call for the adoption of spam- and bot-detection techniques in all studies and applications that exploit user-generated content for predicting the stock market.

[1]  Igor Mozetic,et al.  Twitter sentiment around the Earnings Announcement events , 2016, PloS one.

[2]  Christopher M. Danforth,et al.  Vaporous Marketing: Uncovering Pervasive Electronic Cigarette Advertisements on Twitter , 2015, PloS one.

[3]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[4]  Emilio Ferrara,et al.  Disinformation and Social Bot Operations in the Run Up to the 2017 French Presidential Election , 2017, First Monday.

[5]  Maurizio Tesconi,et al.  Nowcasting of Earthquake Consequences Using Big Social Data , 2017, IEEE Internet Computing.

[6]  Philip S. Yu,et al.  Detecting deception in Online Social Networks , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[7]  Roberto Di Pietro,et al.  Exploiting Digital DNA for the Analysis of Similarities in Twitter Behaviours , 2017, 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[8]  Emilio Ferrara,et al.  Social Bots Distort the 2016 US Presidential Election Online Discussion , 2016, First Monday.

[9]  Filippo Menczer,et al.  Hoaxy: A Platform for Tracking Online Misinformation , 2016, WWW.

[10]  David Wetherall,et al.  Computer networks, 5th Edition , 2011 .

[11]  Hossein Hamooni,et al.  On-Demand Bot Detection and Archival System , 2017, WWW.

[12]  Christos Faloutsos,et al.  Inferring Strange Behavior from Connectivity Pattern in Social Networks , 2014, PAKDD.

[13]  Chao Yang,et al.  Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers , 2013, IEEE Trans. Inf. Forensics Secur..

[14]  Ben Y. Zhao,et al.  Echo Chambers in Investment Discussion Boards , 2017, ICWSM.

[15]  Lu Hong,et al.  Groups of diverse problem solvers can outperform groups of high-ability problem solvers. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  André Freitas,et al.  SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs and News , 2017, *SEMEVAL.

[17]  Fangzhao Wu,et al.  Social Spammer and Spam Message Co-Detection in Microblogging with Social Context Regularization , 2015, CIKM.

[18]  Rose Yu,et al.  GLAD: group anomaly detection in social media analysis , 2014, ACM Trans. Knowl. Discov. Data.

[19]  Chao Yang,et al.  Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers , 2011, IEEE Transactions on Information Forensics and Security.

[20]  Cazzoli Lorenzo,et al.  A Large Scale Study to Understand the Relation between Twitter and Financial Market , 2016 .

[21]  S. Pokharel Wisdom of Crowds: The Value of Stock Opinions Transmitted through Social Media , 2014 .

[22]  Gang Wang,et al.  Value and Misinformation in Collaborative Investing Platforms , 2017, ACM Trans. Web.

[23]  Christos Faloutsos,et al.  Retweeting Activity on Twitter: Signs of Deception , 2015, PAKDD.

[24]  Christos Faloutsos,et al.  Inferring lockstep behavior from connectivity pattern in large graphs , 2016, Knowledge and Information Systems.

[25]  Max Nanis,et al.  Socialbots: voices from the fronts , 2012, INTR.

[26]  Emilio Ferrara,et al.  Manipulation and Abuse on Social Media , 2015, ArXiv.

[27]  Roberto Di Pietro,et al.  Fame for sale: Efficient detection of fake Twitter followers , 2015, Decis. Support Syst..

[28]  Miles Osborne,et al.  Ranking Financial Tweets , 2016, SIGIR.

[29]  Fabrizio Lillo,et al.  $FAKE: Evidence of Spam and Bot Activity in Stock Microblogs on Twitter , 2018, ICWSM.

[30]  Jong Kim,et al.  Early filtering of ephemeral malicious accounts on Twitter , 2014, Comput. Commun..

[31]  Christos Faloutsos,et al.  oddball: Spotting Anomalies in Weighted Graphs , 2010, PAKDD.

[32]  Omar Alonso,et al.  Kondenzer: Exploration and visualization of archived social media , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[33]  Wei Wei,et al.  Correlating S&P 500 stocks with Twitter data , 2012, HotSocial '12.

[34]  Rossano Schifanella,et al.  People Are Strange When You're a Stranger: Impact and Influence of Bots on Social Networks , 2012, ICWSM.

[35]  Martin Hentschel,et al.  Follow the money: A study of cashtags on Twitter , 2014, First Monday.

[36]  Jie Jennifer Zhang,et al.  How Do Consumer Buzz and Traffic in Social Media Marketing Predict the Value of the Firm? , 2013, J. Manag. Inf. Syst..

[37]  Christos Faloutsos,et al.  Spotting Suspicious Behaviors in Multimodal Data: A General Metric and Algorithms , 2016, IEEE Transactions on Knowledge and Data Engineering.

[38]  Filippo Menczer,et al.  The rise of social bots , 2014, Commun. ACM.

[39]  Cameron D. Norman,et al.  Vape, quit, tweet? Electronic cigarettes and smoking cessation on Twitter , 2016, International Journal of Public Health.

[40]  Dan Mercea,et al.  The Brexit Botnet and User-Generated Hyperpartisan News , 2017 .

[41]  Jon Crowcroft,et al.  Do Bots impact Twitter activity? , 2017, WWW.

[42]  Jacob Ratkiewicz,et al.  Detecting and Tracking Political Abuse in Social Media , 2011, ICWSM.

[43]  Aristides Gionis,et al.  Correlating financial time series with micro-blogging activity , 2012, WSDM '12.

[44]  Krishna P. Gummadi,et al.  Strength in Numbers: Robust Tamper Detection in Crowd Computations , 2015, COSN.

[45]  Eric Gilbert,et al.  Widespread Worry and the Stock Market , 2010, ICWSM.

[46]  Stefan Evert,et al.  Japan's 2014 General Election: Political Bots, Right-Wing Internet Activism, and Prime Minister Shinzō Abe's Hidden Nationalist Agenda , 2017, Big Data.

[47]  Xianchao Zhang,et al.  Detecting Spam and Promoting Campaigns in Twitter , 2016, ACM Trans. Web.

[48]  Kai Kupferschmidt Bot-hunters eye mischief in German election. , 2017, Science.

[49]  Krishna P. Gummadi,et al.  Understanding and combating link farming in the twitter social network , 2012, WWW.

[50]  Roberto Di Pietro,et al.  DNA-Inspired Online Behavioral Modeling and Its Application to Spambot Detection , 2016, IEEE Intell. Syst..

[51]  Emilio Ferrara,et al.  "Manipulation and abuse on social media" by Emilio Ferrara with Ching-man Au Yeung as coordinator , 2015, SIGWEB Newsl..

[52]  Hossein Hamooni,et al.  Identifying Correlated Bots in Twitter , 2016, SocInfo.

[53]  Panagiotis Takis Metaxas,et al.  FINAL VERSION DO NOT MAKE MORE CHANGES The Power of Prediction with Social Media , 2013 .

[54]  David A. Broniatowski,et al.  Weaponized Health Communication: Twitter Bots and Russian Trolls Amplify the Vaccine Debate , 2018, American journal of public health.

[55]  Yang Yu,et al.  The impact of social and conventional media on firm equity value: A sentiment analysis approach , 2013, Decis. Support Syst..

[56]  Tomaso Aste,et al.  When Can Social Media Lead Financial Markets? , 2014, Scientific Reports.

[57]  Lisa Gandy,et al.  CashTagNN: Using sentiment of tweets with CashTags to predict stock market prices , 2016, 2016 11th International Conference on Intelligent Systems: Theories and Applications (SITA).

[58]  Angelo Spognardi,et al.  On the capability of evolved spambots to evade detection via genetic engineering , 2019, Online Soc. Networks Media.

[59]  Jie Jennifer Zhang,et al.  Social Media and Firm Equity Value , 2013, Inf. Syst. Res..

[60]  Albert-László Barabási,et al.  Universal features of correlated bursty behaviour , 2011, Scientific Reports.

[61]  Filippo Menczer,et al.  Detection of Promoted Social Media Campaigns , 2016, ICWSM.

[62]  Filippo Menczer,et al.  Online Human-Bot Interactions: Detection, Estimation, and Characterization , 2017, ICWSM.

[63]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[64]  Kalina Bontcheva,et al.  Overview of the Special Issue on Trust and Veracity of Information in Social Media , 2016, TOIS.

[65]  Venkatesan Guruswami,et al.  CopyCatch: stopping group attacks by spotting lockstep behavior in social networks , 2013, WWW.

[66]  Fabrizio Lillo,et al.  A Large Scale Study to Understand the Relation between Twitter and Financial Market , 2016, 2016 Third European Network Intelligence Conference (ENIC).

[67]  Nada Lavrac,et al.  Stream-based active learning for sentiment analysis in the financial domain , 2014, Inf. Sci..

[68]  Guido Caldarelli,et al.  S 1 Appendix , 2016 .

[69]  Angelo Spognardi,et al.  From Reaction to Proaction: Unexplored Ways to the Detection of Evolving Spambots , 2018, WWW.

[70]  Thanh Tran,et al.  Uncovering Fake Likers in Online Social Networks , 2016, CIKM.

[71]  Jon-Patrick Allem,et al.  Could Social Bots Pose a Threat to Public Health? , 2018, American journal of public health.

[72]  Timm Oliver Sprenger,et al.  TweetTrader.net: Leveraging Crowd Wisdom in a Stock Microblogging Forum , 2011, ICWSM.

[73]  Hossein Hamooni,et al.  DeBot: Twitter Bot Detection via Warped Correlation , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[74]  Roberto Di Pietro,et al.  The Paradigm-Shift of Social Spambots: Evidence, Theories, and Tools for the Arms Race , 2017, WWW.

[75]  Stefan Stieglitz,et al.  Strategies and Influence of Social Bots in a 2017 German state election - A case study on Twitter , 2017, ACIS.

[76]  Emilio Ferrara Disinformation and Social Bot Operations in the Run Up to the 2017 French Presidential Election , 2017 .

[77]  Feng Xiao,et al.  SybilLimit: A Near-Optimal Social Network Defense against Sybil Attacks , 2008, 2008 IEEE Symposium on Security and Privacy (sp 2008).

[78]  Sameena Shah,et al.  Learning Stock Market Sentiment Lexicon and Sentiment-Oriented Word Vector from StockTwits , 2017, CoNLL.

[79]  Marco Furini,et al.  On using cashtags to predict companies stock trends , 2017, 2017 14th IEEE Annual Consumer Communications & Networking Conference (CCNC).

[80]  Panagiotis Takis Metaxas,et al.  The power of prediction with social media , 2013, Internet Res..

[81]  Christos Faloutsos,et al.  Catching Synchronized Behaviors in Large Networks , 2016, ACM Trans. Knowl. Discov. Data.

[82]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[83]  Roberto Di Pietro,et al.  Social Fingerprinting: Detection of Spambot Groups Through DNA-Inspired Behavioral Modeling , 2017, IEEE Transactions on Dependable and Secure Computing.

[84]  Maurizio Tesconi,et al.  Enriching Digital Libraries with Crowdsensed Data - Twitter Monitor and the SoBigData Ecosystem , 2019, IRCDL.

[85]  Jong Kim,et al.  WarningBird: A Near Real-Time Detection System for Suspicious URLs in Twitter Stream , 2013, IEEE Transactions on Dependable and Secure Computing.

[86]  Mark Coates,et al.  Weblog Analysis for Predicting Correlations in Stock Price Evolutions , 2012, ICWSM.

[87]  Christos Faloutsos,et al.  HoloScope: Topology-and-Spike Aware Fraud Detection , 2017, CIKM.