Balancing diversity to counter-measure geographical centralization in microblogging platforms

We study whether geographical centralization is reflected in the virtual population of microblogging platforms. A consequence of centralization is the decreased visibility and findability of content from less central locations.We propose to counteract geographical centralization in microblogging timelines by promoting geographical diversity through: 1) a characterization of imbalance in location interaction centralization over a graph of geographical interactions from user generated content; 2) geolocation of microposts using imbalance-aware content features in text classifiers, and evaluation of those classifiers according to their diversity and accuracy; 3) definition of a two-step information filtering algorithm to ensure diversity in summary timelines of events. We study our proposal through an analysis of a dataset of Twitter in Chile, in the context of the 2012 municipal political elections.

[1]  Sean M. McNee,et al.  Being accurate is not enough: how accuracy metrics have hurt recommender systems , 2006, CHI Extended Abstracts.

[2]  Daniele Quercia,et al.  Cultural Dimensions in Twitter: Time, Individualism and Power , 2013, ICWSM.

[3]  Michael S. Bernstein,et al.  Short and tweet: experiments on recommending content from information streams , 2010, CHI.

[4]  Mary Czerwinski,et al.  Identifying relevant social media content: leveraging information diversity and user cognition , 2011, HT '11.

[5]  L. Jost Entropy and diversity , 2006 .

[6]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[7]  Ricardo Baeza-Yates,et al.  Modern Information Retrieval - the concepts and technology behind search, Second edition , 2011 .

[8]  Kalina Bontcheva,et al.  Where's @wally?: a classification approach to geolocating users based on their social ties , 2013, HT.

[9]  Ryan M. Rifkin,et al.  In Defense of One-Vs-All Classification , 2004, J. Mach. Learn. Res..

[10]  Sean A. Munson,et al.  Sidelines: An Algorithm for Increasing Diversity in News and Opinion Aggregators , 2009, ICWSM.

[11]  Daniele Quercia,et al.  The Social World of Twitter: Topics, Geography, and Emotions , 2012, ICWSM.

[12]  Krishna P. Gummadi,et al.  Geographic Dissection of the Twitter Network , 2012, ICWSM.

[13]  Barbara Poblete,et al.  #Santiago is not #Chile, or is it?: a model to normalize social media impact , 2013, ChileCHI '13.

[14]  Susan T. Dumais,et al.  Characterizing Microblogs with Topic Models , 2010, ICWSM.

[15]  Ricardo Baeza-Yates,et al.  The Geographical Life of Search , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[16]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[17]  Danah Boyd,et al.  Tweeting from the Town Square: Measuring Geographic Local Networks , 2010, ICWSM.

[18]  Ed H. Chi,et al.  Tweets from Justin Bieber's heart: the dynamics of the location field in user profiles , 2011, CHI.

[19]  Yong Yu,et al.  Collaborative personalized tweet recommendation , 2012, SIGIR '12.

[20]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[21]  Marti A. Hearst Chapter 2 of the second edition of Modern Information Retrieval Renamed Modern Information Retrieval : The Concepts and Technology behind Search , 2011 .

[22]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[23]  Barbara Poblete,et al.  Do all birds tweet the same?: characterizing twitter around the world , 2011, CIKM '11.