Unveiling polarization in social networks: A matrix factorization approach

This paper presents unsupervised algorithms to uncover polarization in social networks (namely, Twitter) and identify polarized groups. The approach is language-agnostic and thus broadly applicable to global and multilingual media. In cases of conflict, dispute, or situations involving multiple parties with contrasting interests, opinions get divided into different camps. Previous manual inspection of tweets has shown that such situations produce distinguishable signatures on Twitter, as people take sides leading to clusters that preferentially propagate information confirming their individual cluster-specific bias. We propose a model for polarized social networks, and show that approaches based on factorizing the matrix of sources and their claims can automate the discovery of polarized clusters with no need for prior training or natural language processing. In turn, identifying such clusters offers insights into prevalent social conflicts and helps automate the generation of less biased descriptions of ongoing events. We evaluate our factorization algorithms and their results on multiple Twitter datasets involving polarization of opinions, demonstrating the efficacy of our approach. Experiments show that our method is almost always correct in identifying the polarized information from real-world twitter traces, and outperforms the baseline mechanisms by a large margin.

[1]  Md. Yusuf Sarwar Uddin,et al.  On diversifying source selection in social sensing , 2012, 2012 Ninth International Conference on Networked Sensing (INSS).

[2]  Noah E. Friedkin,et al.  A Structural Theory of Social Influence: List of Tables and Figures , 1998 .

[3]  Tarek Abdelzaher,et al.  Exploiting Social Media for Army Operations: Syrian Civil War Use Case , 2014 .

[4]  Lada A. Adamic,et al.  Exposure to ideologically diverse news and opinion on Facebook , 2015, Science.

[5]  Alex M. Andrew,et al.  Boosting: Foundations and Algorithms , 2012 .

[6]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[7]  Charu C. Aggarwal,et al.  Recursive Ground Truth Estimator for Social Data Streams , 2016, 2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).

[8]  Jure Leskovec,et al.  Inferring networks of diffusion and influence , 2010, KDD.

[9]  Sujay Sanghavi,et al.  Learning the graph of epidemic cascades , 2012, SIGMETRICS '12.

[10]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[11]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[12]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[13]  Claire Cardie,et al.  A Measure of Polarization on Social Media Networks Based on Community Boundaries , 2013, ICWSM.

[14]  Tarek F. Abdelzaher,et al.  Finding true and credible information on Twitter , 2014, 17th International Conference on Information Fusion (FUSION).

[15]  Charu C. Aggarwal,et al.  Community Detection with Edge Content in Social Media Networks , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[16]  Saif Mohammad,et al.  Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..

[17]  Dan Roth,et al.  Provenance-Assisted Classification in Social Networks , 2014, IEEE Journal of Selected Topics in Signal Processing.

[18]  Indranil Gupta,et al.  Social Trove: A Self-Summarizing Storage Service for Social Sensing , 2015, 2015 IEEE International Conference on Autonomic Computing.

[19]  Boleslaw K. Szymanski,et al.  Crowd-Sensing with Polarized Sources , 2014, 2014 IEEE International Conference on Distributed Computing in Sensor Systems.

[20]  Clement Levallois,et al.  Umigon: sentiment analysis for tweets based on terms lists and heuristics , 2013, *SEMEVAL.

[21]  Jacob Ratkiewicz,et al.  Political Polarization on Twitter , 2011, ICWSM.

[22]  Charu C. Aggarwal,et al.  Online community detection in social sensing , 2013, WSDM.

[23]  Dirk Van,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[24]  Yizhou Sun,et al.  Trust analysis with clustering , 2011, WWW.

[25]  Charu C. Aggarwal,et al.  Using humans as sensors: An estimation-theoretic perspective , 2014, IPSN-14 Proceedings of the 13th International Symposium on Information Processing in Sensor Networks.

[26]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[27]  Sung-Hyuk Cha Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions , 2007 .

[28]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[29]  Divesh Srivastava,et al.  Truth Discovery and Copying Detection in a Dynamic World , 2009, Proc. VLDB Endow..

[30]  Jure Leskovec,et al.  On the Convexity of Latent Social Network Inference , 2010, NIPS.