The state of OA: a large-scale analysis of the prevalence and impact of Open Access articles

Despite growing interest in Open Access (OA) to scholarly literature, there is an unmet need for large-scale, up-to-date, and reproducible studies assessing the prevalence and characteristics of OA. We address this need using oaDOI, an open online service that determines OA status for 67 million articles. We use three samples, each of 100,000 articles, to investigateOA in three populations: (1) all journal articles assigned aCrossref DOI, (2) recent journal articles indexed in Web of Science, and (3) articles viewed by users of Unpaywall, an open-source browser extension that lets users find OA articles using oaDOI. We estimate that at least 28% of the scholarly literature is OA (19M in total) and that this proportion is growing, driven particularly by growth in Gold and Hybrid. The most recent year analyzed (2015) also has the highest percentage of OA (45%). Because of this growth, and the fact that readers disproportionately access newer articles, we find that Unpaywall users encounter OA quite frequently: 47% of articles they view are OA. Notably, themost commonmechanism for OA is not Gold, Green, or Hybrid OA, but rather an under-discussed category we dub Bronze: articles made freeto-read on the publisher website, without an explicit Open license. We also examine the citation impact of OA articles, corroborating the so-called open-access citation advantage: accounting for age and discipline, OA articles receive 18% more citations than average, an effect driven primarily byGreen andHybridOA.We encourage further research using the free oaDOI service, as a way to inform OA policy and practice. Subjects Legal Issues, Science Policy, Data Science

[1]  Thomas J. Walker,et al.  Free Internet Access to Traditional Journals , 1998 .

[2]  Bo-Christer Björk,et al.  The open access movement at a crossroad: Are the big publishers and academic social media taking over? , 2016, Learn. Publ..

[3]  Lauren B. Collister,et al.  The academic, economic and societal impacts of Open Access: an evidence-based review. , 2016, F1000Research.

[4]  Bo-Christer Björk,et al.  Delayed open access: An overlooked high-impact category of openly available scientific literature , 2013, J. Assoc. Inf. Sci. Technol..

[5]  L. Gonzales The Forbidden Forecast: Thinking About Open Access and Library Subscriptions , 2017 .

[6]  Ann Marisa Hanlon,et al.  Asking for Permission: A Survey of Copyright Workflows for Institutional Repositories , 2011 .

[7]  Lauren B. Collister,et al.  The academic, economic and societal impacts of Open Access: an evidence-based review , 2016, F1000Research.

[8]  Fei Shu,et al.  Knowledge sharing in global health research – the impact, uptake and cost of open access to scholarly literature , 2017, Health Research Policy and Systems.

[9]  Stevan Harnad,et al.  Ten-Year Cross-Disciplinary Comparison of the Growth of Open Access and How it Increases Research Citation Impact , 2005, IEEE Data Eng. Bull..

[10]  B. Björk,et al.  Open Access to the Scientific Journal Literature: Situation 2009 , 2010, PloS one.

[11]  晋典 岡部,et al.  Budapest Open Access Initiativeの思想的背景とその受容 , 2011 .

[12]  Vincent Larivière,et al.  Green and Gold Open Access Percentages and Growth, by Discipline , 2012, ArXiv.

[13]  B. Björk,et al.  The Development of Open Access Journal Publishing from 1993 to 2009 , 2011, PloS one.

[14]  C. Oppenheim,et al.  The Access/Impact Problem and the Green and Gold Roads to Open Access: An Update , 2008 .

[15]  Les Carr,et al.  The Access/Impact Problem and the Green and Gold Roads to Open Access: An Update , 2008 .

[16]  G. Franck Open access , 2012, Cell cycle.

[17]  Henk F. Moed,et al.  The effect of "open access" on citation impact: An analysis of ArXiv's condensed matter section , 2006, J. Assoc. Inf. Sci. Technol..

[18]  Quirin Schiermeier,et al.  Scientists in Germany, Peru and Taiwan to lose access to Elsevier journals , 2016, Nature.

[19]  Brian A. Nosek,et al.  How open science helps researchers succeed , 2016, eLife.

[20]  Shinji Mine,et al.  Status of open access in the biomedical field in 2005. , 2009, Journal of the Medical Library Association : JMLA.

[21]  Bo-Christer Björk,et al.  The hybrid model for open access publication of scholarly articles: A failed experiment? , 2012, J. Assoc. Inf. Sci. Technol..

[22]  Iain D. Craig,et al.  Do open access articles have greater citation impact?: A critical review of the literature , 2007, J. Informetrics.

[23]  Bo-Christer Björk,et al.  Gold, green, and black open access , 2017, Learn. Publ..

[24]  B. Björk,et al.  Anatomy of open access publishing: a study of longitudinal development and internal structure , 2012, BMC Medicine.

[25]  Jim Ottaviani The Post-Embargo Open Access Citation Advantage: It Exists (Probably), It’s Modest (Usually), and the Rich Get Richer (of Course) , 2016, PloS one.

[26]  Adèle Paul-Hus,et al.  The journal coverage of Web of Science and Scopus: a comparative analysis , 2015, Scientometrics.

[27]  M. HamidR.Jamali,et al.  Copyright compliance and infringement in ResearchGate full-text journal articles , 2017, Scientometrics.

[28]  Bo-Christer Björk,et al.  Anatomy of green open access , 2014, J. Assoc. Inf. Sci. Technol..

[29]  K. F. レンツ,et al.  the Creative Commons , 2011 .

[30]  Vincent Larivière,et al.  Self-Selected or Mandated, Open Access Increases Citation Impact for Higher Quality Research , 2010, PloS one.

[31]  Kristin Antelman,et al.  Leveraging the growth of open access in library collection decision making , 2017 .

[32]  Philip M. Davis,et al.  Open access, readership, citations: a randomized controlled trial of scientific journal publishing , 2011, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[33]  Peter Suber The rise of libre open access , 2012 .

[34]  T. Olijhoek,et al.  Measuring the Degrees of Openness of Scholarly Journals with the Open Access Spectrum (OAS) Evaluation Tool , 2016 .

[35]  A. Ben Wagner,et al.  Open Access Citation Advantage: An Annotated Bibliography. , 2010, Issues in Science and Technology Librarianship.

[36]  B. Greshake Looking into Pandora's Box: The Content of Sci-Hub and its Usage , 2017, bioRxiv.

[37]  Christopher M. Snyder,et al.  Identifying the Effect of Open Access on Citations Using a Panel of Science Journals , 2013 .

[38]  Bo-Christer Björk,et al.  Journal of Informetrics , 2022 .

[39]  Éric Archambault,et al.  Research impact of paywalled versus open access papers , 2016 .

[40]  Ghislaine Chartron,et al.  Availability of digital object identifiers in publications archived by PubMed , 2017, Scientometrics.

[41]  Casey S Greene,et al.  Sci-Hub provides access to nearly all scholarly literature , 2018, eLife.

[42]  J Willinsky,et al.  The nine flavours of open access scholarly publishing. , 2003, Journal of postgraduate medicine.

[43]  Peter Suber,et al.  Gratis and libre open access , 2008 .

[44]  Juan Gorraiz,et al.  Availability of digital object identifiers (DOIs) in Web of Science and Scopus , 2016, J. Informetrics.

[45]  Elise Y. Wong Office of Scholarly Communication—University of California , 2017 .

[46]  Éric Archambault,et al.  Proportion of Open Access Papers Published in Peer-Reviewed Journals at the European and World Levels—1996-2013 , 2014 .

[47]  A. Packer The SciELO Open Access: A Gold Way from the South , 2010 .

[48]  Xiaotian Chen,et al.  Journal Article Retrieval in an Age of Open Access: How Journal Indexes Indicate Open Access Articles , 2013 .

[49]  J. Bohannon Who's downloading pirated papers? Everyone. , 2016, Science.

[50]  D. Chawla Publishers take ResearchGate to court, alleging massive copyright infringement , 2017 .

[51]  Lisa A. Ennis The access principle: The case for open access to research and scholarship , 2007, J. Assoc. Inf. Sci. Technol..

[52]  Philip M. Davis,et al.  The impact of free access to the scientific literature: a review of recent research. , 2011, Journal of the Medical Library Association : JMLA.