Not at Home on the Range: Peer Production and the Urban/Rural Divide

Wikipedia articles about places, OpenStreetMap features, and other forms of peer-produced content have become critical sources of geographic knowledge for humans and intelligent technologies. In this paper, we explore the effectiveness of the peer production model across the rural/urban divide, a divide that has been shown to be an important factor in many online social systems. We find that in both Wikipedia and OpenStreetMap, peer-produced content about rural areas is of systematically lower quality, is less likely to have been produced by contributors who focus on the local area, and is more likely to have been generated by automated software agents (i.e. "bots"). We then codify the systemic challenges inherent to characterizing rural phenomena through peer production and discuss potential solutions.

[1]  Amanda Menking,et al.  The Heart Work of Wikipedia: Gendered, Emotional Labor in the World's Largest Online Encyclopedia , 2015, CHI.

[2]  Maarten de Rijke,et al.  Finding Similar Sentences across Multiple Languages in Wikipedia , 2006 .

[3]  Brent J. Hecht,et al.  The mining and application of diverse cultural perspectives in user-generated content , 2013 .

[4]  Melanie Eckle,et al.  Quality Assessment of Remote Mapping in OpenStreetMap for Disaster Management Purposes , 2015, ISCRAM.

[5]  Steve Uhlig,et al.  IP geolocation databases: unreliable? , 2011, CCRV.

[6]  Aaron Halfaker,et al.  When the levee breaks: without bots, what happens to Wikipedia's quality control processes? , 2013, OpenSym.

[7]  Brent J. Hecht,et al.  A Tale of Cities: Urban Biases in Volunteered Geographic Information , 2014, ICWSM.

[8]  Matthew Zook,et al.  Augmented Reality in Urban Places: Contested Content and the Duplicity of Code , 2013 .

[9]  Joseph M. Reagle,et al.  Gender Bias in Wikipedia and Britannica , 2011 .

[10]  Loren G. Terveen,et al.  Misalignment Between Supply and Demand of Quality Content in Peer Production Communities , 2021, ICWSM.

[11]  David R. Musicant,et al.  Barriers to the Localness of Volunteered Geographic Information , 2015, CHI.

[12]  Giovanni Quattrone,et al.  Mind the map: the impact of culture and economic affluence on crowd-mapping behaviours , 2014, CSCW.

[13]  Patricia R. Ladd,et al.  The Wikipedia revolution : how a bunch of nobodies created the world's greatest encyclopedia , 2009 .

[14]  S. Greenstein,et al.  Is Wikipedia Biased , 2012 .

[15]  Jason Baldridge,et al.  Simple supervised document geolocation with geodesic grids , 2011, ACL.

[16]  Pascal Neis,et al.  Areal Delineation of Home Regions from Contribution and Editing Patterns in OpenStreetMap , 2014, ISPRS Int. J. Geo Inf..

[17]  Giovanni Quattrone,et al.  There ’ s No Such Thing as the Perfect Map : Quantifying Bias in Spatial Crowdsourcing Datasets , 2014 .

[18]  UhligSteve,et al.  IP geolocation databases , 2011 .

[19]  Loren G. Terveen,et al.  The Success and Failure of Quality Improvement Projects in Peer Production Communities , 2015, CSCW.

[20]  Takahiro Hara,et al.  A Bilingual Dictionary Extracted from the Wikipedia Link Structure , 2008, DASFAA.

[21]  Brent J. Hecht,et al.  WikiBrain: Democratizing computation on Wikipedia , 2014, OpenSym.

[22]  Michael D. Lieberman You Are Where You Edit : Locating Wikipedia Users Through Edit Histories ∗ , 2009 .

[23]  Brent J. Hecht,et al.  A beginner's guide to geographic virtual communities research , 2011 .

[24]  Pramodita Sharma 2012 , 2013, Les 25 ans de l’OMC: Une rétrospective en photos.

[25]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[26]  Pascal Neis,et al.  Assessing the Effect of Data Imports on the Completeness of OpenStreetMap – A United States Case Study , 2013, Trans. GIS.

[27]  Aaron Halfaker,et al.  Bots and Cyborgs: Wikipedia's Immune System , 2012, Computer.

[28]  John Riedl,et al.  Tell me more: an actionable quality model for Wikipedia , 2013, OpenSym.

[29]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[30]  Giovanni Quattrone,et al.  There's No Such Thing as the Perfect Map: Quantifying Bias in Spatial Crowd-sourcing Datasets , 2015, CSCW.

[31]  Jimmy J. Lin,et al.  You Are Where You Edit: Locating Wikipedia Contributors through Edit Histories , 2009, ICWSM.

[32]  M. Goodchild,et al.  Spatial, temporal, and socioeconomic patterns in the use of Twitter and Flickr , 2013 .

[33]  Aaron Halfaker,et al.  A jury of your peers: quality, experience and ownership in Wikipedia , 2009, Int. Sym. Wikis.

[34]  Michael F. Goodchild,et al.  Volunteered geographic information production as a spatial process , 2012, Int. J. Geogr. Inf. Sci..

[35]  R. Stuart Geiger,et al.  Bots, bespoke, code and the materiality of software platforms , 2014 .

[36]  Panayiotis Zaphiris,et al.  Cultural Differences in Collaborative Authoring of Wikipedia , 2006, J. Comput. Mediat. Commun..

[37]  Takahiro Hara,et al.  An Approach for Extracting Bilingual Terminology from Wikipedia , 2008, DASFAA.

[38]  Monica Stephens Gender and the GeoWeb: divisions in the production of user-generated cartographic information , 2013, GeoJournal.

[39]  M. Goodchild,et al.  Researching Volunteered Geographic Information: Spatial Data, Geographic Research, and New Social Practice , 2012 .

[40]  R. Bivand Spatial Dependence: Weighting Schemes, Statistics and Models , 2015 .

[41]  David García,et al.  It's a Man's Wikipedia? Assessing Gender Inequality in an Online Encyclopedia , 2015, ICWSM.

[42]  Darren Gergle,et al.  Measuring self-focus bias in community-maintained knowledge repositories , 2009, C&T.

[43]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[44]  Darren Gergle,et al.  On the "localness" of user-generated content , 2010, CSCW '10.

[45]  Evgeniy Gabrilovich,et al.  Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge , 2006, AAAI.

[46]  Aniket Kittur,et al.  Harnessing the wisdom of crowds in wikipedia: quality through coordination , 2008, CSCW.

[47]  Mark S. Ackerman,et al.  Culture Matters: A Survey Study of Social Q&A Behavior , 2011, ICWSM.

[48]  Barbara A. Kimmelman :Consumers in the Country: Technology and Social Change in Rural America , 2005 .

[49]  Eric Gilbert,et al.  The network in the garden: an empirical analysis of social media in rural life , 2008, CHI.

[50]  Ian H. Witten,et al.  An effective, low-cost measure of semantic relatedness obtained from Wikipedia links , 2008 .

[51]  Aaron D. Shaw,et al.  Apples to oranges?: comparing across studies of open collaboration/peer production , 2011, Int. Sym. Wikis.

[52]  A. Zipf,et al.  A Comparative Study of Proprietary Geodata and Volunteered Geographic Information for Germany , 2010 .

[53]  Barry Wellman,et al.  Small Town in the Internet Society: Chapleau Is No Longer an Island , 2010 .

[54]  John Riedl,et al.  WP:clubhouse?: an exploration of Wikipedia's gender imbalance , 2011, Int. Sym. Wikis.

[55]  Les Gasser,et al.  Information quality work organization in wikipedia , 2008, J. Assoc. Inf. Sci. Technol..

[56]  Eric S. Raymond,et al.  Cathedral & the Bazaar: Musings on Linux and Open Source by an Accidental Revolutionary , 2001 .

[57]  Ben Kei Daniel,et al.  Handbook of research on methods and techniques for studying virtual communities : paradigms and phenomena , 2011 .

[58]  Vyron Antoniou,et al.  How Many Volunteers Does it Take to Map an Area Well? The Validity of Linus’ Law to Volunteered Geographic Information , 2010 .

[59]  Eric S. Raymond,et al.  The cathedral and the bazaar - musings on Linux and Open Source by an accidental revolutionary , 2001 .

[60]  Andrew Lih,et al.  The Wikipedia revolution : how a bunch of nobodies created the world's greatest encyclopedia , 2009 .

[61]  Giovanni Quattrone,et al.  Putting ubiquitous crowd-sourcing into context , 2013, CSCW '13.

[62]  Giovanni Quattrone,et al.  Modelling growth of urban crowd-sourced information , 2014, WSDM.

[63]  D. D. Ingram,et al.  NCHS urban-rural classification scheme for counties. , 2012, Vital and health statistics. Series 2, Data evaluation and methods research.

[64]  Jed R. Brubaker,et al.  'Is' to 'Was': Coordination and Commemoration in Posthumous Activity on Wikipedia Biographies , 2015, CSCW.

[65]  Ian H. Witten,et al.  Mining Meaning from Wikipedia , 2008, Int. J. Hum. Comput. Stud..

[66]  Terry Sicular,et al.  The Urban-Rural Income Gap and Inequality in China , 2007 .

[67]  Mark Graham,et al.  Digital Divisions of Labor and Informational Magnetism: Mapping Participation in Wikipedia , 2015 .