Many activities of editors in Wikipedia can be traced using its database dumps, which register detailed information about every single change to every article. Several researchers have used this information to gain knowledge about the production process of articles, and about activity patterns of authors. In this analysis, we have focused on one of those previous works, by Kittur et al. First, we have followed the same methodology with more recent and comprehensive data. Then, we have extended this methodology to precisely identify which fraction of authors are producing most of the changes in Wikipedia’s articles, and how the behaviour of these authors evolves over time. This enabled us not only to validate some of the previous results, but also to find new interesting evidences. We have found that the analysis of sysops is not a good method for estimating different levels of contributions, since it is dependent on the policy for electing them (which changes over time and for each language). Moreover,we have found new activity patterns classifying authors by their contributions during specific periods of time, instead of using their total number of contributions over the whole life of Wikipedia. Finally, we present a tool that automates this extended methodology, implementing a quick and complete quantitative analysis of every language edition in Wikipedia.
[1]
Martin Wattenberg,et al.
Proceedings of the 40th Hawaii International Conference on System Sciences- 2007 Talk Before You Type: Coordination in Wikipedia
,
2022
.
[2]
J. Voß.
Measuring Wikipedia
,
2005
.
[3]
Martin Wattenberg,et al.
Studying cooperation and conflict between authors with history flow visualizations
,
2004,
CHI.
[4]
Bryan A. Pendleton,et al.
Power of the Few vs. Wisdom of the Crowd: Wikipedia and the Rise of the Bourgeoisie
,
2006
.
[5]
Luciana S. Buriol,et al.
Temporal Analysis of the Wikigraph
,
2006,
2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).