Quantifying Search Bias: Investigating Sources of Bias for Political Searches in Social Media

Search systems in online social media sites are frequently used to find information about ongoing events and people. For topics with multiple competing perspectives, such as political events or political candidates, bias in the top ranked results significantly shapes public opinion. However, bias does not emerge from an algorithm alone. It is important to distinguish between the bias that arises from the data that serves as the input to the ranking system and the bias that arises from the ranking system itself. In this paper, we propose a framework to quantify these distinct biases and apply this framework to politics-related queries on Twitter. We found that both the input data and the ranking system contribute significantly to produce varying amounts of bias in the search results and in different ways. We discuss the consequences of these biases and possible mechanisms to signal this bias in social media search systems' interfaces.

[1]  Sean A. Munson,et al.  Presenting diverse political opinions: how and how much , 2010, CHI.

[2]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[3]  Christo Wilson,et al.  Peeking Beneath the Hood of Uber , 2015, Internet Measurement Conference.

[4]  Danai Koutra,et al.  Events and Controversies: Influences of a Shocking News Event on Information Seeking , 2014, WWW.

[5]  Krishna P. Gummadi,et al.  Cognos: crowdsourcing search for topic experts in microblogs , 2012, SIGIR '12.

[6]  Abbe Mowshowitz,et al.  Measuring search engine bias , 2005, Inf. Process. Manag..

[7]  Jacob Ratkiewicz,et al.  Predicting the Political Alignment of Twitter Users , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[8]  David Lazer,et al.  Location, Location, Location: The Impact of Geolocation on Web Search Personalization , 2015, Internet Measurement Conference.

[9]  Thorsten Joachims,et al.  In Google We Trust: Users' Decisions on Rank, Position, and Relevance , 2007, J. Comput. Mediat. Commun..

[10]  Kristina Lerman,et al.  The Role of Social Media in the Discussion of Controversial Topics , 2013, 2013 International Conference on Social Computing.

[11]  Karrie Karahalios,et al.  Auditing Algorithms : Research Methods for Detecting Discrimination on Internet Platforms , 2014 .

[12]  Derek L. Hansen,et al.  Computing political preference among twitter followers , 2011, CHI.

[13]  Davood Rafiei,et al.  Predicting political preference of Twitter users , 2013, ASONAM.

[14]  Krishna P. Gummadi,et al.  Message Impartiality in Social Media Discussions , 2016, ICWSM.

[15]  Latanya Sweeney,et al.  Discrimination in online ad delivery , 2013, CACM.

[16]  Ingmar Weber,et al.  Is Twitter a Public Sphere for Online Conflicts? A Cross-Ideological and Cross-Hierarchical Look , 2014, SocInfo.

[17]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[18]  Krishna P. Gummadi,et al.  Inferring who-is-who in the Twitter social network , 2012, WOSN '12.

[19]  Venkata Rama Kiran Garimella,et al.  Political Hashtag Trends , 2013, ECIR.

[20]  Seungwoo Kang,et al.  NewsCube: delivering multiple aspects of news to mitigate media bias , 2009, CHI.

[21]  Venkata Rama Kiran Garimella,et al.  Mining web query logs to analyze political issues , 2012, WebSci '12.

[22]  D. Boyd,et al.  Dynamic Debates: An Analysis of Group Polarization Over Time on Twitter , 2010 .

[23]  Ronald E. Robertson,et al.  The search engine manipulation effect (SEME) and its possible impact on the outcomes of elections , 2015, Proceedings of the National Academy of Sciences.

[24]  Christopher Olston,et al.  Search result diversity for informational queries , 2011, WWW.

[25]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[26]  Wai-Tat Fu,et al.  #Snowden: Understanding Biases Introduced by Behavioral Differences of Opinion Groups on Social Media , 2016, CHI.

[27]  S. Dumais,et al.  Promoting Civil Discourse Through Search Engine Diversity , 2014 .

[28]  Krishna P. Gummadi,et al.  Inferring user interests in the Twitter social network , 2014, RecSys '14.

[29]  Meredith Ringel Morris,et al.  #TwitterSearch: a comparison of microblog search and web search , 2011, WSDM '11.

[30]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[31]  Emine Yilmaz,et al.  Estimating average precision with incomplete and imperfect judgments , 2006, CIKM '06.

[32]  Jacob Ratkiewicz,et al.  Political Polarization on Twitter , 2011, ICWSM.

[33]  A Vespignani,et al.  Topical interests and the mitigation of search engine bias , 2006, Proceedings of the National Academy of Sciences.

[34]  Itai Himelboim,et al.  Birds of a Feather Tweet Together: Integrating Network and Content Analyses to Examine Cross-Ideology Exposure on Twitter , 2013, J. Comput. Mediat. Commun..

[35]  Noah A. Smith,et al.  Shedding (a Thousand Points of) Light on Biased Language , 2010, Mturk@HLT-NAACL.

[36]  Elizabeth Van Couvering,et al.  Search engine bias : the structuration of traffic on the World-Wide Web , 2010 .

[37]  Michael Carl Tschantz,et al.  Automated Experiments on Ad Privacy Settings , 2014, Proc. Priv. Enhancing Technol..

[38]  Balachander Krishnamurthy,et al.  Measuring personalization of web search , 2013, WWW.

[39]  Michael Carl Tschantz,et al.  Automated Experiments on Ad Privacy Settings: A Tale of Opacity, Choice, and Discrimination , 2014, ArXiv.

[40]  Sean A. Munson,et al.  Encouraging Reading of Diverse Political Viewpoints with a Browser Widget , 2013, ICWSM.

[41]  Matthew Purver,et al.  Twitter Language Use Reflects Psychological Differences between Democrats and Republicans , 2015, PloS one.

[42]  David Lazer,et al.  Measuring Price Discrimination and Steering on E-commerce Web Sites , 2014, Internet Measurement Conference.

[43]  Qiaozhu Mei,et al.  Classifying the Political Leaning of News Articles and Users from User Votes , 2011, ICWSM.

[44]  Bryan C. Semaan,et al.  Social media supporting political deliberation across multiple public spheres: towards depolarization , 2014, CSCW.