Analysing the Usage of Wikipedia on Twitter: Understanding Inter-Language Links

Wikipedia is a central source of information as 450 million people consult the online encyclopaedia every month to satisfy their information needs. Some of these users also refer to Wikipedia within their tweets. In this paper, we analyse links within tweets referring to a Wikipedia of a language different from the tweet's language. Therefore, we investigate causes for the usage of such inter-language links by comparing the tweeted article and its counterpart in the tweet's language (if there is any) in terms of article quality. We find that the main cause for inter-language links is the non-existence of the article in the tweet's language. Furthermore, we observe that the quality of the tweeted articles is constantly higher in comparison to their counterparts, suggesting that users choose the article of higher quality even when tweeting in another language. Moreover, we find that English is the most dominant target for inter-language links.

[1]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[2]  R. P. Fishburne,et al.  Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted Personnel , 1975 .

[3]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[4]  Filippo Menczer,et al.  The rise of social bots , 2014, Commun. ACM.

[5]  John Riedl,et al.  Tell me more: an actionable quality model for Wikipedia , 2013, OpenSym.

[6]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[7]  Hua Zheng,et al.  Mining the Factors Affecting the Quality of Wikipedia Articles , 2010, 2010 International Conference of Information Science and Management Engineering.

[8]  Chunliang Lu,et al.  User Modeling and Tweets Recommendation Based on Wikipedia Concept Graph ∗ , 2012 .

[9]  Christophe Ley,et al.  Detecting outliers: Do not use standard deviation around the mean, use absolute deviation around the median , 2013 .

[10]  Daniel M. Romero,et al.  Influence and passivity in social media , 2010, ECML/PKDD.

[11]  Eva Zangerle,et al.  #Wikipedia on Twitter: analyzing tweets about Wikipedia , 2015, OpenSym.

[12]  Jeffrey V. Nickerson,et al.  Discovering Context: Classifying Tweets through a Semantic Transform Based on Wikipedia , 2011, HCI.

[13]  Joshua Evan Blumenstock,et al.  Size matters: word count as a measure of quality on wikipedia , 2008, WWW.

[14]  Qi Gao,et al.  Analyzing user modeling on twitter for personalized news recommendations , 2011, UMAP'11.

[15]  M. Osborne,et al.  Bieber no more : First Story Detection using Twitter and Wikipedia , 2012 .

[16]  Ophir Frieder,et al.  A framework for detecting public health trends with Twitter , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[17]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[18]  Aspen Olmsted,et al.  Bot or not , 2017, 2017 12th International Conference for Internet Technology and Secured Transactions (ICITST).

[19]  Sudha Ram,et al.  Who does what: Collaboration patterns in the wikipedia and their impact on article quality , 2011, TMIS.

[20]  Nicolas Jullien,et al.  What We Know About Wikipedia: A Review of the Literature Analyzing the Project(s) , 2012 .

[21]  Les Gasser,et al.  Assessing Information Quality of a Community-Based Encyclopedia , 2005, ICIQ.

[22]  Sushil Jajodia,et al.  Who is tweeting on Twitter: human, bot, or cyborg? , 2010, ACSAC '10.

[23]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[24]  A. Kaplan,et al.  Users of the world, unite! The challenges and opportunities of Social Media , 2010 .

[25]  Bu-Sung Lee,et al.  TwiNER: named entity recognition in targeted twitter stream , 2012, SIGIR '12.

[26]  Pável Calado,et al.  Automatic quality assessment of content created collaboratively by web communities: a case study of wikipedia , 2009, JCDL '09.

[27]  Aniket Kittur,et al.  He says, she says: conflict and coordination in Wikipedia , 2007, CHI.

[28]  Finn Årup Nielsen,et al.  Wikipedia research and tools: Review and comments , 2012 .

[29]  Besiki Stvilia,et al.  Issues of cross-contextual information quality evaluation—The case of Arabic, English, and Korean Wikipedias , 2009 .

[30]  Amit P. Sheth,et al.  User Interests Identification on Twitter Using a Hierarchical Knowledge Base , 2014, ESWC.

[31]  Aniket Kittur,et al.  Harnessing the wisdom of crowds in wikipedia: quality through coordination , 2008, CSCW.

[32]  Yana Volkovich,et al.  When the Wikipedians Talk: Network and Tree Structure of Wikipedia Discussion Pages , 2011, ICWSM.

[33]  Bernardo A. Huberman,et al.  Cooperation and quality in wikipedia , 2007, WikiSym '07.

[34]  Kwan Hui Lim,et al.  Interest classification of Twitter users using Wikipedia , 2013, OpenSym.