论文信息 - BotPercent: Estimating Twitter Bot Populations from Groups to Crowds

BotPercent: Estimating Twitter Bot Populations from Groups to Crowds

Twitter bot detection has become increasingly important in combating misinformation, identifying malicious online campaigns, and protecting the integrity of social media discourse. While existing bot detection literature mostly focuses on identifying individual bots, it remains underexplored how to estimate the proportion of bots within specific communities and social networks, which has great implications for both content moderators and day-to-day users. In this work, we propose community-level bot detection, a novel approach to estimating the amount of malicious interference in online communities by estimating the percentage of bot accounts. Specifically, we introduce BotPercent, an amalgamation of Twitter bot-detection datasets and feature-, text-, and graph-based models that overcome generalization issues in existing individual-level models, resulting in a more accurate community-level bot estimation. Experiments demonstrate that BotPercent achieves state-of-the-art community-level bot detection performance on the TwiBot-22 benchmark while showing great robustness towards the tampering of specific user features. Armed with BotPercent, we analyze bot rates in different Twitter groups and communities, such as all active Twitter users, users that interact with partisan news media, users that participate in Elon Musk's content moderation votes, and the political communities in different countries and regions. Our experimental results demonstrate that the existence of Twitter bots is not homogeneous, but rather a spatial-temporal distribution whose heterogeneity should be taken into account for content moderation, social media policy making, and more. The BotPercent implementation is available at https://github.com/TamSiuhin/BotPercent

[1] Daniel M. Romero,et al. Just Another Day on Twitter: A Complete 24 Hours of Twitter Data , 2023, Proceedings of the International AAAI Conference on Web and Social Media.

[2] Saiph Savage,et al. Datavoidant: An AI System for Addressing Political Data Voids on Social Media , 2022, Proc. ACM Hum. Comput. Interact..

[3] Onur Varol. Should we agree to disagree about Twitter's bot problem? , 2022, ArXiv.

[4] Shangbin Feng,et al. BIC: Twitter Bot Detection with Text-Graph Interaction and Semantic Consistency , 2022 .

[5] H. Alashwal,et al. Bot-MGAT: A Transfer Learning Model Based on a Multi-View Graph Attention Network to Detect Social Bots , 2022, Applied Sciences.

[6] K. Carley,et al. BotBuster: Multi-platform Bot Detection Using A Mixture of Experts , 2022, ICWSM.

[7] Huailiang Peng,et al. Domain-Aware Federated Social Bot Detection with Multi-Relational Graph Neural Networks , 2022, 2022 International Joint Conference on Neural Networks (IJCNN).

[8] Haiyong Xie,et al. RoSGAS: Adaptive Social Bot Detection with Reinforced Self-supervised GNN Architecture Search , 2022, ACM Trans. Web.

[9] P. Ho,et al. DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data , 2022, Social Network Analysis and Mining.

[10] K. Carley,et al. Stabilizing a supervised bot detection algorithm: How much data is needed for consistent predictions? , 2022, Online Soc. Networks Media.

[11] Kai-Cheng Yang,et al. Botometer 101: social bot practicum for computational social scientists , 2022, Journal of Computational Social Science.

[12] Haiyong Xie,et al. Social Bots Detection via Fusing BERT and Graph Convolutional Networks , 2021, Symmetry.

[13] Yizhou Sun,et al. Graph-less Neural Networks: Teaching Old MLPs New Tricks via Distillation , 2021, ICLR.

[14] Minnan Luo,et al. Heterogeneity-aware Twitter Bot Detection with Relational Graph Transformers , 2021, AAAI.

[15] Ninghao Liu,et al. EDITS: Modeling and Mitigating Data Bias for Graph Neural Networks , 2021, WWW.

[16] Unil Yun,et al. Bot2Vec: A general approach of intra-community oriented representation learning for bot detection in different types of social networks , 2021, Inf. Syst..

[17] P. Prałat,et al. Detecting bots in social-networks using node and structural embeddings , 2023, Journal of Big Data.

[18] Chang Zhou,et al. Are we really making much progress?: Revisiting, benchmarking and refining heterogeneous graph neural networks , 2021, KDD.

[19] Minnan Luo,et al. TwiBot-20: A Comprehensive Twitter Bot Detection Benchmark , 2021, CIKM.

[20] Minnan Luo,et al. SATAR: A Self-supervised Approach to Twitter Account Representation Learning and its Application in Bot Detection , 2021, CIKM.

[21] Minnan Luo,et al. BotRGCN: Twitter bot detection with relational graph convolutional networks , 2021, ASONAM.

[22] Ruslan Salakhutdinov,et al. Towards Understanding and Mitigating Social Biases in Language Models , 2021, ICML.

[23] Emily M. Bender,et al. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜 , 2021, FAccT.

[24] Siva Reddy,et al. StereoSet: Measuring stereotypical bias in pretrained language models , 2020, ACL.

[25] W. Marcellino,et al. Counter-Radicalization Bot Research: Using Social Bots to Fight Violent Extremism , 2020 .

[26] David Dukić,et al. Are You Human? Detecting Bots on Twitter Using BERT , 2020, 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA).

[27] W. Ahmed,et al. COVID-19 and the “Film Your Hospital” Conspiracy Theory: Social Network Analysis of Twitter Data , 2020, Journal of medical Internet research.

[28] Fenglong Ma,et al. DETERRENT: Knowledge Guided Graph Attention Network for Detecting Healthcare Misinformation , 2020, KDD.

[29] A. Flammini,et al. Detection of Novel Social Bots by Ensembles of Specialized Classifiers , 2020, CIKM.

[30] Jun Hu,et al. Fake News Detection via Knowledge-driven Multimodal Graph Convolutional Networks , 2020, ICMR.

[31] Cheng-Te Li,et al. GCAN: Graph-aware Co-Attention Networks for Explainable Fake News Detection on Social Media , 2020, ACL.

[32] Emilio Ferrara,et al. What types of COVID-19 conspiracies are populated by Twitter bots? , 2020, First Monday.

[33] Yizhou Sun,et al. Heterogeneous Graph Transformer , 2020, WWW.

[34] Matti Rossi,et al. Detecting Political Bots on Twitter during the 2019 Finnish Parliamentary Election , 2020, HICSS.

[35] Filippo Menczer,et al. Scalable and Generalizable Social Bot Detection through Data Selection , 2019, AAAI.

[36] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[37] Guido Caldarelli,et al. The role of bot squads in the political propaganda on Twitter , 2019, Communications Physics.

[38] Kathleen M. Carley,et al. Bot Impacts on Public Sentiment and Community Structures: Comparative Analysis of Three Elections in the Asia-Pacific , 2020, SBP-BRiMS.