Unsupervised Belief Representation Learning in Polarized Networks: A Variational Graph Auto-Encoder Approach

This paper develops a novel unsupervised algorithm for belief representation learning in polarized networks that (i) uncovers the latent dimensions of the underlying belief space and (ii) jointly embeds users and content items (that they interact with) into that space in a manner that facilitates a number of downstream tasks, such as stance detection, stance prediction, and ideology mapping. Inspired by total correlation in information theory, we propose a novel Information-Theoretic Variational Graph Auto-Encoder (InfoVGAE) that learns to project both users and content items (e.g., posts that represent user views) into an appropriate disentangled latent space. In order to better disentangle orthogonal latent variables in that space, we develop total correlation regularization, PI control module, and adopt rectified Gaussian Distribution for the latent space. The latent representation of users and content can then be used to quantify their ideological leaning and detect/predict their stances on issues. We evaluate the performance of the proposed InfoVGAE on three real-world datasets, of which two are collected from Twitter and one from U.S. Congress voting records. The evaluation results show that our model outperforms state-of-the-art unsupervised models and produce comparable result with supervised models. We also discuss stance prediction and user ranking within ideological groups. The code and datasets are available at https://github.com/happydeadline/InfoVGAE.

[1]  Preslav Nakov,et al.  Unsupervised User Stance Detection on Twitter , 2019, ICWSM.

[2]  Ambuj K. Singh,et al.  The social media genome: Modeling individual topic-specific behavior in social media , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[3]  Osmar R. Zaïane,et al.  Unsupervised Model for Topic Viewpoint Discovery in Online Debates Leveraging Author Interactions , 2018, ICWSM.

[4]  James R. Foulds,et al.  Joint Models of Disagreement and Stance in Online Debate , 2015, ACL.

[5]  Jan Kautz,et al.  Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[6]  Stefano Ermon,et al.  InfoVAE: Balancing Learning and Inference in Variational Autoencoders , 2019, AAAI.

[7]  Martin J. Wainwright,et al.  Estimating Divergence Functionals and the Likelihood Ratio by Convex Risk Minimization , 2008, IEEE Transactions on Information Theory.

[8]  E. Giné,et al.  On the Bootstrap of $U$ and $V$ Statistics , 1992 .

[9]  Gyu Sang Choi,et al.  Fake News Stance Detection Using Deep Learning Architecture (CNN-LSTM) , 2020, IEEE Access.

[10]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[11]  Diana Inkpen,et al.  A Dataset for Multi-Target Stance Detection , 2017, EACL.

[12]  Vincent Ng,et al.  Why are You Taking this Stance? Identifying and Classifying Reasons in Ideological Debates , 2014, EMNLP.

[13]  Yizhou Sun,et al.  TIMME: Twitter Ideology-detection via Multi-task Multi-relational Embedding , 2020, KDD.

[14]  Michael Satosi Watanabe,et al.  Information Theoretical Analysis of Multivariate Correlation , 1960, IBM J. Res. Dev..

[15]  Jun Wang,et al.  Controllable and Diverse Text Generation in E-commerce , 2021, WWW.

[16]  James Allan,et al.  Explaining Controversy on Social Media via Stance Summarization , 2018, SIGIR.

[17]  Xueyan Liu,et al.  Interpretable Variational Graph Autoencoder with Noninformative Prior , 2021, Future Internet.

[18]  Kristina Lerman,et al.  Who Falls for Online Political Manipulation? , 2018, WWW.

[19]  Max Welling,et al.  Variational Graph Auto-Encoders , 2016, ArXiv.

[20]  Mucahid Kutlu,et al.  Embeddings-Based Clustering for Target Specific Stances: The Case of a Polarized Turkey , 2020, ArXiv.

[21]  Masaki Aono,et al.  Tweet Stance Detection Using an Attention based Neural Ensemble Model , 2019, NAACL.

[22]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[23]  Masashi Sugiyama,et al.  Density-ratio matching under the Bregman divergence: a unified framework of density-ratio estimation , 2012 .

[24]  Jun Wang,et al.  ControlVAE: Controllable Variational Autoencoder , 2020, ICML.

[25]  Preslav Nakov,et al.  Predicting the Topical Stance and Political Leaning of Media using Tweets , 2020, ACL.

[26]  A. Smeaton,et al.  On Using Twitter to Monitor Political Sentiment and Predict Election Results , 2011 .

[27]  Tarek F. Abdelzaher,et al.  Disentangling Overlapping Beliefs by Structured Matrix Factorization , 2020, ArXiv.

[28]  Aristides Gionis,et al.  Political Discourse on Social Media: Echo Chambers, Gatekeepers, and the Price of Bipartisanship , 2018, WWW.

[29]  Charu C. Aggarwal,et al.  Unveiling polarization in social networks: A matrix factorization approach , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[30]  Rob Brekelmans,et al.  Auto-Encoding Total Correlation Explanation , 2018, AISTATS.

[31]  Timothy Baldwin,et al.  #ISISisNotIslam or #DeportAllMuslims?: predicting unspoken views , 2016, WebSci.

[32]  Pablo Barberá,et al.  Understanding the Political Representativeness of Twitter Users , 2015 .

[33]  Walid Magdy,et al.  Improved Stance Prediction in a User Similarity Feature Space , 2017, ASONAM.

[34]  Nkambou Roger,et al.  Semi-Supervised Multimodal Deep Learning Model for Polarity Detection in Arguments , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[35]  L. Akoglu Quantifying Political Polarity Based on Bipartite Opinion Networks , 2014, ICWSM.

[36]  David Duvenaud,et al.  Isolating Sources of Disentanglement in VAEs , 2018, 1802.04942.

[37]  Wenyu Liu,et al.  Multiple Instance Detection Network with Online Instance Classifier Refinement , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Yizhou Sun,et al.  Mining Heterogeneous Information Networks: Principles and Methodologies , 2012, Mining Heterogeneous Information Networks: Principles and Methodologies.

[39]  Estevam R. Hruschka,et al.  Tweet sentiment analysis with classifier ensembles , 2014, Decis. Support Syst..

[40]  Walid Magdy,et al.  Your Stance is Exposed! Analysing Possible Factors for Stance Detection on Social Media , 2019, Proc. ACM Hum. Comput. Interact..

[41]  Svitlana Volkova,et al.  Inferring User Political Preferences from Streaming Communications , 2014, ACL.

[42]  The new Voteview.com: preserving and continuing Keith Poole’s infrastructure for scholars, students and observers of Congress , 2018 .

[43]  Andriy Mnih,et al.  Disentangling by Factorising , 2018, ICML.

[44]  Emilio Ferrara,et al.  Social media polarization and echo chambers: A case study of COVID-19 , 2021, ArXiv.

[45]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[46]  Nolan McCarty,et al.  Income redistribution and the realignment of American politics , 1997 .

[47]  Max Welling,et al.  Modeling Relational Data with Graph Convolutional Networks , 2017, ESWC.

[48]  Preslav Nakov,et al.  Contrastive Language Adaptation for Cross-Lingual Stance Detection , 2019, EMNLP.