Analyzing and characterizing political discussions in WhatsApp public groups

We present a thorough characterization of what we believe to be the first significant analysis of the behavior of groups in WhatsApp in the scientific literature. Our characterization of over 270,000 messages and about 7,000 users spanning a 28-day period is done at three different layers. The message layer focuses on individual messages, each of which is the result of specific posts performed by a user. The user layer characterizes the user actions while interacting with a group. The group layer characterizes the aggregate message patterns of all users that participate in a group. We analyze 81 public groups in WhatsApp and classify them into two categories, political and non-political groups according to keywords associated with each group. Our contributions are two-fold. First, we introduce a framework and a number of metrics to characterize the behavior of communication groups in mobile messaging systems such as WhatsApp. Second, our analysis underscores a Zipf-like profile for user messages in political groups. Also, our analysis reveals that Whatsapp messages are multimedia, with a combination of different forms of content. Multimedia content (i.e., audio, image, and video) and emojis are present in 20% and 11.2% of all messages respectively. Political groups use more text messages than non-political groups. Second, we characterize novel features that represent the behavior of a public group, with multiple conversational turns between key members, with the participation of other members of the group.

[1]  Gaogang Xie,et al.  User Behavior Characterization of a Large-scale Mobile Live Streaming System , 2015, WWW.

[2]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[3]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[4]  Virgílio A. F. Almeida,et al.  A hierarchical characterization of a live streaming media workload , 2006, TNET.

[5]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[6]  Phuoc Tran-Gia,et al.  Group-based communication in WhatsApp , 2016, 2016 IFIP Networking Conference (IFIP Networking) and Workshops.

[7]  Jie Tang,et al.  Who to Invite Next? Predicting Invitees of Social Groups , 2017, IJCAI.

[8]  Jun Chen,et al.  Who Is Answering to Whom? Finding “Reply-To” Relations in Group Chats with Long Short-Term Memory Networks , 2018 .

[9]  Anja Feldmann,et al.  An analysis of Internet chat systems , 2003, IMC '03.

[10]  Jiangchuan Liu,et al.  Understanding the Characteristics of Internet Short Video Sharing: YouTube as a Case Study , 2007, ArXiv.

[11]  Patrick P. C. Lee,et al.  Fine-grained dissection of WeChat in cellular networks , 2015, 2015 IEEE 23rd International Symposium on Quality of Service (IWQoS).

[12]  Jörg Schwenk,et al.  More is Less: How Group Chats Weaken the Security of Instant Messengers Signal, WhatsApp, and Threema , 2017, IACR Cryptol. ePrint Arch..

[13]  Bin Chen,et al.  Patterns of cascading behavior in WeChat moments , 2017, 2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS).

[14]  Aditya Mahajan,et al.  Forensic Analysis of Instant Messenger Applications on Android Devices , 2013, ArXiv.

[15]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[16]  Rodrigo de Oliveira,et al.  What's up with whatsapp?: comparing mobile instant messaging behaviors with traditional SMS , 2013, MobileHCI '13.

[17]  Karthik Bhat,et al.  WhatsApp for Monitoring and Response during Critical Events: Aggie in the Ghana 2016 Election , 2017, ISCRAM.

[18]  Virgílio A. F. Almeida,et al.  Characterizing user behavior in online social networks , 2009, IMC '09.