Wikipedia and Medicine: Quantifying Readership, Editors, and the Significance of Natural Language

Background Wikipedia is a collaboratively edited encyclopedia. One of the most popular websites on the Internet, it is known to be a frequently used source of health care information by both professionals and the lay public. Objective This paper quantifies the production and consumption of Wikipedia’s medical content along 4 dimensions. First, we measured the amount of medical content in both articles and bytes and, second, the citations that supported that content. Third, we analyzed the medical readership against that of other health care websites between Wikipedia’s natural language editions and its relationship with disease prevalence. Fourth, we surveyed the quantity/characteristics of Wikipedia’s medical contributors, including year-over-year participation trends and editor demographics. Methods Using a well-defined categorization infrastructure, we identified medically pertinent English-language Wikipedia articles and links to their foreign language equivalents. With these, Wikipedia can be queried to produce metadata and full texts for entire article histories. Wikipedia also makes available hourly reports that aggregate reader traffic at per-article granularity. An online survey was used to determine the background of contributors. Standard mining and visualization techniques (eg, aggregation queries, cumulative distribution functions, and/or correlation metrics) were applied to each of these datasets. Analysis focused on year-end 2013, but historical data permitted some longitudinal analysis. Results Wikipedia’s medical content (at the end of 2013) was made up of more than 155,000 articles and 1 billion bytes of text across more than 255 languages. This content was supported by more than 950,000 references. Content was viewed more than 4.88 billion times in 2013. This makes it one of if not the most viewed medical resource(s) globally. The core editor community numbered less than 300 and declined over the past 5 years. The members of this community were half health care providers and 85.5% (100/117) had a university education. Conclusions Although Wikipedia has a considerable volume of multilingual medical content that is extensively read and well-referenced, the core group of editors that contribute and maintain that content is small and shrinking in size.

[1]  Finn Årup Nielsen,et al.  “The sum of all human knowledge”: A systematic review of scholarly research on the content of Wikipedia , 2015, J. Assoc. Inf. Sci. Technol..

[2]  Jean Goodwin,et al.  The Authority of Wikipedia , 2009 .

[3]  M J Ball,et al.  Healthcare Informatics , 2009, Encyclopedia of Database Systems.

[4]  Lee-Anne Ufholz,et al.  References that anyone can edit: review of Wikipedia citations in peer reviewed health science literature , 2014, BMJ : British Medical Journal.

[5]  Jonathan Wareham,et al.  Junior physician's use of Web 2.0 for information seeking and medical education: A qualitative study , 2009, Int. J. Medical Informatics.

[6]  Samy A Azer,et al.  Evaluation of gastroenterology and hepatology articles on Wikipedia: Are they suitable as learning resources for medical students? , 2014, European journal of gastroenterology & hepatology.

[7]  Renke Maas,et al.  Accuracy and Completeness of Drug Information in Wikipedia: A Comparison with Standard Textbooks of Pharmacology , 2014, PloS one.

[8]  Michaël,et al.  Seeking health information online: does Wikipedia matter? , 2009, Journal of the American Medical Informatics Association : JAMIA.

[9]  H. Potts,et al.  Motivations for Contributing to Health-Related Articles on Wikipedia: An Interview Study , 2013, Journal of medical Internet research.

[10]  U. Allahwala,et al.  Wikipedia use amongst medical students – New insights into the digital revolution , 2013, Medical teacher.

[11]  Aaron Halfaker,et al.  Don't bite the newbies: how reverts affect the quantity and quality of Wikipedia work , 2011, Int. Sym. Wikis.

[12]  James M Heilman,et al.  Wikipedia: A Key Tool for Global Public Health Promotion , 2011, Journal of medical Internet research.

[13]  Alina Deshpande,et al.  Global Disease Monitoring and Forecasting with Wikipedia , 2016 .

[14]  Andrew G. West,et al.  Damage Detection and Mitigation in Open Collaboration Applications , 2013 .

[15]  Aaron D. Shaw,et al.  The Wikipedia Gender Gap Revisited: Characterizing Survey Response Bias with Propensity Score Estimation , 2013, PloS one.

[16]  C. Derkay,et al.  Quality of Internet information in pediatric otolaryngology: a comparison of three most referenced websites. , 2012, International journal of pediatric otorhinolaryngology.

[17]  C. Haigh Wikipedia as an evidence source for nursing and healthcare students. , 2011, Nurse education today.

[18]  John S. Brownstein,et al.  Wikipedia Usage Estimates Prevalence of Influenza-Like Illness in the United States in Near Real-Time , 2014, PLoS Comput. Biol..