Search bias quantification: investigating political bias in social media and web search

Users frequently use search systems on the Web as well as online social media to learn about ongoing events and public opinion on personalities. Prior studies have shown that the top-ranked results returned by these search engines can shape user opinion about the topic (e.g., event or person) being searched. In case of polarizing topics like politics, where multiple competing perspectives exist, the political bias in the top search results can play a significant role in shaping public opinion towards (or away from) certain perspectives. Given the considerable impact that search bias can have on the user, we propose a generalizable search bias quantification framework that not only measures the political bias in ranked list output by the search system but also decouples the bias introduced by the different sources—input data and ranking system. We apply our framework to study the political bias in searches related to 2016 US Presidential primaries in Twitter social media search and find that both input data and ranking system matter in determining the final search output bias seen by the users. And finally, we use the framework to compare the relative bias for two popular search systems—Twitter social media search and Google web search—for queries related to politicians and political events. We end by discussing some potential solutions to signal the bias in the search results to make the users more aware of them.

[1]  Danai Koutra,et al.  Events and Controversies: Influences of a Shocking News Event on Information Seeking , 2014, WWW.

[2]  Krishna P. Gummadi,et al.  Quantifying Search Bias: Investigating Sources of Bias for Political Searches in Social Media , 2017, CSCW.

[3]  Ana-Maria Popescu,et al.  Democrats, republicans and starbucks afficionados: user classification in twitter , 2011, KDD.

[4]  Sean A. Munson,et al.  Presenting diverse political opinions: how and how much , 2010, CHI.

[5]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[6]  Craig MacDonald,et al.  Topic-centric Classification of Twitter User's Political Orientation , 2015, FDIA.

[7]  Jati K. Sengupta,et al.  Introduction to Information , 1993 .

[8]  Steve Whittaker,et al.  Dice in the Black Box: User Experiences with an Inscrutable Algorithm , 2018, AAAI Spring Symposia.

[9]  Jacob Ratkiewicz,et al.  Predicting the Political Alignment of Twitter Users , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[10]  Qiaozhu Mei,et al.  Classifying the Political Leaning of News Articles and Users from User Votes , 2011, ICWSM.

[11]  Bryan C. Semaan,et al.  Social media supporting political deliberation across multiple public spheres: towards depolarization , 2014, CSCW.

[12]  S. Gosling,et al.  The Secret Lives of Liberals and Conservatives: Personality Profiles, Interaction Styles, and the Things They Leave Behind , 2008 .

[13]  D. P. Baron,et al.  Persistent Media Bias , 2004 .

[14]  Krishna P. Gummadi,et al.  Purple Feed: Identifying High Consensus News Posts on Social Media , 2018, AIES.

[15]  Tim Groseclose,et al.  A Measure of Media Bias , 2005 .

[16]  Derek L. Hansen,et al.  Computing political preference among twitter followers , 2011, CHI.

[17]  Latanya Sweeney,et al.  Discrimination in online ad delivery , 2013, CACM.

[18]  Kristina Lerman,et al.  The Role of Social Media in the Discussion of Controversial Topics , 2013, 2013 International Conference on Social Computing.

[19]  Karrie Karahalios,et al.  Auditing Algorithms : Research Methods for Detecting Discrimination on Internet Platforms , 2014 .

[20]  Davood Rafiei,et al.  Predicting political preference of Twitter users , 2013, ASONAM.

[21]  C. Allen,et al.  Stanford Encyclopedia of Philosophy , 2011 .

[22]  Krishna P. Gummadi,et al.  Message Impartiality in Social Media Discussions , 2016, ICWSM.

[23]  Aristides Gionis,et al.  Quantifying Controversy on Social Media , 2018, ACM Trans. Soc. Comput..

[24]  Abbe Mowshowitz,et al.  Measuring search engine bias , 2005, Inf. Process. Manag..

[25]  Venkata Rama Kiran Garimella,et al.  Political Hashtag Trends , 2013, ECIR.

[26]  Thorsten Joachims,et al.  In Google We Trust: Users' Decisions on Rank, Position, and Relevance , 2007, J. Comput. Mediat. Commun..

[27]  David Lazer,et al.  Location, Location, Location: The Impact of Geolocation on Web Search Personalization , 2015, Internet Measurement Conference.

[28]  Krishna P. Gummadi,et al.  Media Bias Monitor: Quantifying Biases of Social Media News Outlets at Large-Scale , 2018, ICWSM.

[29]  D. Boyd,et al.  Dynamic Debates: An Analysis of Group Polarization Over Time on Twitter , 2010 .

[30]  Robert M. Bond,et al.  Quantifying Social Media’s Political Space: Estimating Ideology from Publicly Revealed Preferences on Facebook , 2015, American Political Science Review.

[31]  Karrie Karahalios,et al.  First I "like" it, then I hide it: Folk Theories of Social Feeds , 2016, CHI.

[32]  Derek Ruths,et al.  Classifying Political Orientation on Twitter: It's Not Easy! , 2013, ICWSM.

[33]  Huan Liu,et al.  Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose , 2013, ICWSM.

[34]  Krishna P. Gummadi,et al.  Cognos: crowdsourcing search for topic experts in microblogs , 2012, SIGIR '12.

[35]  Aristides Gionis,et al.  A Motif-Based Approach for Identifying Controversy , 2017, ICWSM.

[36]  Wai-Tat Fu,et al.  #Snowden: Understanding Biases Introduced by Behavioral Differences of Opinion Groups on Social Media , 2016, CHI.

[37]  Itai Himelboim,et al.  Birds of a Feather Tweet Together: Integrating Network and Content Analyses to Examine Cross-Ideology Exposure on Twitter , 2013, J. Comput. Mediat. Commun..

[38]  GangulyNiloy,et al.  Inferring who-is-who in the Twitter social network , 2012 .

[39]  Noah A. Smith,et al.  Shedding (a Thousand Points of) Light on Biased Language , 2010, Mturk@HLT-NAACL.

[40]  Christopher Olston,et al.  Search result diversity for informational queries , 2011, WWW.

[41]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[42]  Elizabeth Van Couvering,et al.  Search engine bias : the structuration of traffic on the World-Wide Web , 2010 .

[43]  Mung Chiang,et al.  Quantifying Political Leaning from Tweets, Retweets, and Retweeters , 2016, IEEE Transactions on Knowledge and Data Engineering.

[44]  Ingmar Weber,et al.  Is Twitter a Public Sphere for Online Conflicts? A Cross-Ideological and Cross-Hierarchical Look , 2014, SocInfo.

[45]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[46]  Meredith Ringel Morris,et al.  #TwitterSearch: a comparison of microblog search and web search , 2011, WSDM '11.

[47]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[48]  Ingmar Weber,et al.  Cultural Fault Lines and Political Polarization , 2017, WebSci.

[49]  Wei Niu,et al.  BiasWatch: A Lightweight System for Discovering and Tracking Topic-Sensitive Opinion Bias in Social Media , 2015, CIKM.

[50]  S. Dumais,et al.  Promoting Civil Discourse Through Search Engine Diversity , 2014 .

[51]  Krishna P. Gummadi,et al.  Inferring user interests in the Twitter social network , 2014, RecSys '14.

[52]  Jure Leskovec,et al.  Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes , 2016, WWW.

[53]  Balachander Krishnamurthy,et al.  Measuring personalization of web search , 2013, WWW.

[54]  Michael Carl Tschantz,et al.  Automated Experiments on Ad Privacy Settings: A Tale of Opacity, Choice, and Discrimination , 2014, ArXiv.

[55]  Sean A. Munson,et al.  Encouraging Reading of Diverse Political Viewpoints with a Browser Widget , 2013, ICWSM.

[56]  Matthew Purver,et al.  Twitter Language Use Reflects Psychological Differences between Democrats and Republicans , 2015, PloS one.

[57]  David Lazer,et al.  Suppressing the Search Engine Manipulation Effect (SEME) , 2017, Proc. ACM Hum. Comput. Interact..

[58]  A Vespignani,et al.  Topical interests and the mitigation of search engine bias , 2006, Proceedings of the National Academy of Sciences.

[59]  Venkata Rama Kiran Garimella,et al.  Mining web query logs to analyze political issues , 2012, WebSci '12.

[60]  Christo Wilson,et al.  Peeking Beneath the Hood of Uber , 2015, Internet Measurement Conference.

[61]  Krishna P. Gummadi,et al.  Deep Twitter diving: exploring topical groups in microblogs at scale , 2014, CSCW.

[62]  Seungwoo Kang,et al.  NewsCube: delivering multiple aspects of news to mitigate media bias , 2009, CHI.

[63]  Justin M. Rao,et al.  Fair and Balanced? Quantifying Media Bias through Crowdsourced Content Analysis , 2016 .

[64]  Krishna P. Gummadi,et al.  Inferring who-is-who in the Twitter social network , 2012, WOSN '12.

[65]  Ronald E. Robertson,et al.  The search engine manipulation effect (SEME) and its possible impact on the outcomes of elections , 2015, Proceedings of the National Academy of Sciences.

[66]  Emine Yilmaz,et al.  Estimating average precision with incomplete and imperfect judgments , 2006, CIKM '06.

[67]  Jacob Ratkiewicz,et al.  Political Polarization on Twitter , 2011, ICWSM.

[68]  Karrie Karahalios,et al.  "Be Careful; Things Can Be Worse than They Appear": Understanding Biased Algorithms and Users' Behavior Around Them in Rating Platforms , 2017, ICWSM.

[69]  David Lazer,et al.  Measuring Price Discrimination and Steering on E-commerce Web Sites , 2014, Internet Measurement Conference.