Value Production in a Collaborative Environment

We review some recent endeavors and add some new results to characterize and understand underlying mechanisms in Wikipedia (WP), the paradigmatic example of collaborative value production. We analyzed the statistics of editorial activity in different languages and observed typical circadian and weekly patterns, which enabled us to estimate the geographical origins of contributions to WPs in languages spoken in several time zones. Using a recently introduced measure we showed that the editorial activities have intrinsic dependencies in the burstiness of events. A comparison of the English and Simple English WPs revealed important aspects of language complexity and showed how peer cooperation solved the task of enhancing readability. One of our focus issues was characterizing the conflicts or edit wars in WPs, which helped us to automatically filter out controversial pages. When studying the temporal evolution of the controversiality of such pages we identified typical patterns and classified conflicts accordingly. Our quantitative analysis provides the basis of modeling conflicts and their resolution in collaborative environments and contribute to the understanding of this issue, which becomes increasingly important with the development of information communication technology.

[1]  Ee-Peng Lim,et al.  Measuring article quality in wikipedia: models and evaluation , 2007, CIKM '07.

[2]  John G. Breslin,et al.  A Qualitative and Quantitative Analysis of How Wikipedia Talk Pages Are Used , 2010 .

[3]  Aleksi Aaltonen,et al.  Governing Complex Social Production in the Internet: The Emergence of a Collective Capability in Wikipedia , 2011 .

[4]  Jure Leskovec,et al.  Governance in Social Media: A Case Study of the Wikipedia Promotion Process , 2010, ICWSM.

[5]  Jussi Kangasharju,et al.  Surveying Wikipedia activity: Collaboration, commercialism, and culture , 2012, The International Conference on Information Network 2012.

[6]  Guillaume Deffuant,et al.  Mixing beliefs among interacting agents , 2000, Adv. Complex Syst..

[7]  Reid Priedhorsky,et al.  WikiLit: collecting the wiki and Wikipedia literature , 2011, Int. Sym. Wikis.

[8]  Michael Restivo,et al.  Experimental Study of Informal Rewards in Peer Production , 2012, PloS one.

[9]  Katy Börner,et al.  Analyzing and visualizing the semantic coverage of Wikipedia and its authors , 2005, Complex..

[10]  G. Zipf The Psycho-Biology Of Language: AN INTRODUCTION TO DYNAMIC PHILOLOGY , 1999 .

[11]  G. Caldarelli,et al.  Taxonomy and clustering in collaborative systems: The case of the on-line encyclopedia Wikipedia , 2007, 0710.3058.

[12]  Camille Roth,et al.  Measuring wiki viability: an empirical assessment of the social dynamics of a large sample of wikis , 2008, Int. Sym. Wikis.

[13]  R. Gunning The Fog Index After Twenty Years , 1969 .

[14]  Carlos Castillo,et al.  Emotions and dialogue in a peer-production community: the case of Wikipedia , 2012, WikiSym '12.

[15]  Ortega Soto,et al.  Wikipedia: A quantitative analysis , 2012 .

[16]  David Kauchak,et al.  Simple English Wikipedia: A New Text Simplification Task , 2011, ACL.

[17]  Taemin Kim Park,et al.  The visibility of Wikipedia in scholarly publications , 2011, First Monday.

[18]  András Kornai,et al.  A Practical Approach to Language Complexity: A Wikipedia Case Study , 2012, PloS one.

[19]  Andrzej Nowak,et al.  Linguistic Signs of Destructive and Constructive Processes in Conflict , 2010 .

[20]  Cristian Danescu-Niculescu-Mizil,et al.  For the sake of simplicity: Unsupervised extraction of lexical simplifications from Wikipedia , 2010, NAACL.

[21]  Brendan Luyt,et al.  Improving Wikipedia's accuracy: Is edit age a solution? , 2008, J. Assoc. Inf. Sci. Technol..

[22]  Dennis M. Wilkinson,et al.  Strong regularities in online peer production , 2008, EC '08.

[23]  John Riedl,et al.  WP:clubhouse?: an exploration of Wikipedia's gender imbalance , 2011, Int. Sym. Wikis.

[24]  András Kornai,et al.  Dynamics of Conflicts in Wikipedia , 2012, PloS one.

[25]  Taha Yasseri,et al.  Circadian Patterns of Wikipedia Editorial Activity: A Demographic Analysis , 2011, PloS one.

[26]  Ronald L. Rivest,et al.  The MD5 Message-Digest Algorithm , 1992, RFC.

[27]  Jon M. Kleinberg,et al.  Echoes of power: language effects and power differences in social interaction , 2011, WWW.

[28]  Evgeniy Gabrilovich,et al.  Wikipedia-based Semantic Interpretation for Natural Language Processing , 2014, J. Artif. Intell. Res..

[29]  Simone Paolo Ponzetto,et al.  Knowledge Derived From Wikipedia For Computing Semantic Relatedness , 2007, J. Artif. Intell. Res..

[30]  Matheus Palhares Viana,et al.  Investigating relationships within and between category networks in Wikipedia , 2011, J. Informetrics.

[31]  Paolo Massa,et al.  Social networks of Wikipedia , 2011, HT '11.

[32]  Bikas K. Chakrabarti,et al.  Econophysics and Sociophysics : Trends and Perspectives , 2006 .

[33]  Luciana S. Buriol,et al.  Temporal Analysis of the Wikigraph , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[34]  Iryna Gurevych,et al.  Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary , 2008, LREC.

[35]  J. Bohannon Tracking People's Electronic Footprints , 2006, Science.

[36]  Michael F. Goodchild,et al.  Volunteered geographic information production as a spatial process , 2012, Int. J. Geogr. Inf. Sci..

[37]  Jesús M. González-Barahona,et al.  Temporal characterization of the requests to Wikipedia , 2011, DART@AI*IA.

[38]  Martin Wattenberg,et al.  Proceedings of the 40th Hawaii International Conference on System Sciences- 2007 Talk Before You Type: Coordination in Wikipedia , 2022 .

[39]  Bo Leuf,et al.  The Wiki Way: Quick Collaboration on the Web , 2001 .

[40]  J. Jones Patterns of Revision in Online Writing , 2008 .

[41]  Pierre Baldi,et al.  Mining and tracking evolving web user trends from large web server logs , 2010 .

[42]  Alan Borning,et al.  Collaborative Sensemaking during Admin Permission Granting in Wikipedia , 2011, HCI.

[43]  Simone Paolo Ponzetto,et al.  WikiRelate! Computing Semantic Relatedness Using Wikipedia , 2006, AAAI.

[44]  Markus Krötzsch,et al.  Semantic Wikipedia , 2006, WikiSym '06.

[45]  András Kornai,et al.  Characterization and prediction of Wikipedia edit wars , 2011 .

[46]  Dario Taraborelli,et al.  Beyond Notability. Collective Deliberation on Content Inclusion in Wikipedia , 2010, 2010 Fourth IEEE International Conference on Self-Adaptive and Self-Organizing Systems Workshop.

[47]  Albert-László Barabási,et al.  Universal features of correlated bursty behaviour , 2011, Scientific Reports.

[48]  Hrvoje Štefančić,et al.  Model of Wikipedia growth based on information exchange via reciprocal arcs , 2009, ArXiv.

[49]  Mark Graham,et al.  The most controversial topics in Wikipedia: A multilingual and geographical analysis , 2013, ArXiv.

[50]  Aniket Kittur,et al.  Harnessing the wisdom of crowds in wikipedia: quality through coordination , 2008, CSCW.

[51]  Yana Volkovich,et al.  When the Wikipedians Talk: Network and Tree Structure of Wikipedia Discussion Pages , 2011, ICWSM.

[52]  Mark Dredze,et al.  Learning Simple Wikipedia: A Cogitation in Ascertaining Abecedarian Language , 2010, HLT-NAACL 2010.

[53]  Toru Ishida,et al.  Analysis of discussion contributions in translated Wikipedia articles , 2012, ICIC '12.

[54]  Denilson Barbosa,et al.  Identifying controversial articles in Wikipedia: a comparative study , 2012, WikiSym '12.

[55]  Kwang-Il Goh,et al.  Burstiness and memory in complex systems , 2006 .

[56]  H. S. Heaps,et al.  Information retrieval, computational and theoretical aspects , 1978 .

[57]  Guido Caldarelli,et al.  Preferential attachment in the growth of social networks: the case of Wikipedia , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[58]  Jasmine A. Malinao,et al.  Uncovering the Social Dynamics of Online Elections , 2012, J. Univers. Comput. Sci..

[59]  Cheng Gao,et al.  Evolution of Wikipedia's Category Structure , 2012, ArXiv.

[60]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[61]  Ulrik Brandes,et al.  Visual Analysis of Controversy in User-Generated Encyclopedias∗ , 2007, 2007 IEEE Symposium on Visual Analytics Science and Technology.

[62]  Cristina V. Lopes,et al.  User contribution and trust in Wikipedia , 2009, 2009 5th International Conference on Collaborative Computing: Networking, Applications and Worksharing.

[63]  Vicenç Gómez,et al.  Modeling the structure and evolution of discussion cascades , 2010, HT '11.

[64]  Padraig Cunningham,et al.  Characterizing Wikipedia pages using edit network motif profiles , 2011, SMUC '11.

[65]  Nicolas Jullien,et al.  What We Know About Wikipedia: A Review of the Literature Analyzing the Project(s) , 2012 .

[66]  Aniket Kittur,et al.  What's in Wikipedia?: mapping topics and conflict using socially annotated category structure , 2009, CHI.

[67]  Jacob Ratkiewicz,et al.  Traffic in Social Media I: Paths Through Information Networks , 2010, 2010 IEEE Second International Conference on Social Computing.

[68]  András Kornai,et al.  Edit Wars in Wikipedia , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[69]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.

[70]  Maxi San Miguel,et al.  Opinions, Conflicts and Consensus: Modeling Social Dynamics in a Collaborative Environment , 2012, Physical review letters.

[71]  Rudolf Franz Flesch How to Write Plain English , 1981 .

[72]  Taha Yasseri,et al.  Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data , 2012, PloS one.

[73]  Bart Goethals,et al.  Automatic Vandalism Detection in Wikipedia : Towards a Machine Learning Approach , 2008 .

[74]  Santo Fortunato,et al.  Traffic in Social Media II: Modeling Bursty Popularity , 2010, 2010 IEEE Second International Conference on Social Computing.

[75]  Ee-Peng Lim,et al.  On ranking controversies in wikipedia: models and evaluation , 2008, WSDM '08.

[76]  Insup Lee,et al.  Detecting Wikipedia vandalism via spatio-temporal analysis of revision metadata? , 2010, EUROSEC '10.

[77]  Lev Muchnik,et al.  Fluctuations in Wikipedia access-rate and edit-event data , 2012 .

[78]  Benno Stein,et al.  Automatic Vandalism Detection in Wikipedia , 2008, ECIR.

[79]  Bryan A. Pendleton,et al.  Power of the Few vs. Wisdom of the Crowd: Wikipedia and the Rise of the Bourgeoisie , 2006 .

[80]  Darren Gergle,et al.  Hot off the wiki: dynamics, practices, and structures in Wikipedia's coverage of the Tōhoku catastrophes , 2011, Int. Sym. Wikis.

[81]  Cristina V. Lopes,et al.  Statistical measure of quality in Wikipedia , 2010, SOMA '10.

[82]  David Laniado,et al.  There is no deadline: time evolution of Wikipedia discussions , 2012, WikiSym '12.

[83]  Junghoo Cho,et al.  On the Evolution of Wikipedia , 2007, ICWSM.

[84]  GUSTAV HERDAN QUANTITATIVE LINGUISTICS OR GENERATIVE GRAMMAR? , 1964 .

[85]  J. Giles Internet encyclopaedias go head to head , 2005, Nature.

[86]  R. K. Wiersba Review of "Information Retrieval: Computational and Theoretical Aspects, by H. S. Heaps", Academic Press Inc. , 1980, SIGF.

[87]  Pedro F. Miret,et al.  Wikipedia , 2008, Monatsschrift für Deutsches Recht.

[88]  Matthew J. Betts,et al.  Content Disputes in Wikipedia Reflect Geopolitical Instability , 2011, PloS one.

[89]  Luca de Alfaro,et al.  A content-driven reputation system for the wikipedia , 2007, WWW '07.

[90]  G. N. Gilbert Computational Social Science , 2010 .

[91]  Christian Pentzold,et al.  Foucault@Wiki: first steps towards a conceptual framework for the analysis of Wiki discourses , 2006, WikiSym '06.

[92]  Ed H. Chi,et al.  The singularity is not near: slowing growth of Wikipedia , 2009, Int. Sym. Wikis.

[93]  Les Gasser,et al.  Information quality work organization in wikipedia , 2008, J. Assoc. Inf. Sci. Technol..

[94]  Brian S. Butler,et al.  Don't look now, but we've created a bureaucracy: the nature and roles of policies and rules in wikipedia , 2008, CHI.

[95]  Bernardo A. Huberman,et al.  Assessing the value of cooperation in Wikipedia , 2007, First Monday.

[96]  Paolo Rosso,et al.  Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features , 2011, CICLing.

[97]  Finn Årup Nielsen,et al.  Wikipedia research and tools: Review and comments , 2012 .

[98]  Finn Årup Nielsen,et al.  The People’s Encyclopedia Under the Gaze of the Sages: A Systematic Review of Scholarly Research on Wikipedia , 2012 .

[99]  Jonathan Cohen Computational Methods for Historical Research on Wikipedia’s Archives , 2014 .

[100]  Derek Lackaff,et al.  An Analysis of Topical Coverage of Wikipedia , 2008, J. Comput. Mediat. Commun..

[101]  Emilio Hernández-García,et al.  Wikipedia Information Flow Analysis Reveals the Scale-Free Architecture of the Semantic Space , 2011, PloS one.

[102]  Jesús M. González-Barahona,et al.  On the Inequality of Contributions to Wikipedia , 2008, Proceedings of the 41st Annual Hawaii International Conference on System Sciences (HICSS 2008).

[103]  J. Voß Measuring Wikipedia , 2005 .

[104]  Filippo Menczer,et al.  Modeling Statistical Properties of Written Text , 2009, PloS one.

[105]  Martin Wattenberg,et al.  Visualizing Activity on Wikipedia with Chromograms , 2007, INTERACT.

[106]  Francis M. Tyers,et al.  Extracting bilingual word pairs from Wikipedia , 2008 .

[107]  R. Gunning The Technique of Clear Writing. , 1968 .

[108]  V. Zlatic,et al.  Wikipedias: collaborative web-based encyclopedias as complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[109]  Denilson Barbosa,et al.  Leveraging editor collaboration patterns in wikipedia , 2012, HT '12.

[110]  P. Ingwersen,et al.  Proceedings of ISSI 2005 – The 10th International Conference of the International Society for Scientometrics and Informetrics: Stockholm, Sweden, July 24-28, 2005 , 2005 .

[111]  Ludovic Denoyer,et al.  The Wikipedia XML corpus , 2006, SIGF.

[112]  Santo Fortunato,et al.  Characterizing and modeling the dynamics of online popularity , 2010, Physical review letters.

[113]  M. Besten,et al.  Keep it Simple: A Companion for Simple Wikipedia? , 2008 .

[114]  Jean-Michel Dalle,et al.  Characterizing online communities with their "signals" , 2010 .

[115]  Calton Pu,et al.  Elusive vandalism detection in wikipedia: a text stability-based approach , 2010, CIKM.

[116]  Yoram Louzoun,et al.  Self-emergence of knowledge trees: extraction of the Wikipedia hierarchies. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.