Strange attractors in the Web of Science database

Accurate computation of h indices or other indicators of research impact requires access to databases supplying complete and accurate citation information. The Web of Science (WoS) database is widely used for this purpose and it is generally deemed error-free. This note describes an inaccuracy that seems to affect differentially non-English sources and targets in WoS, namely, “phantom citations” (i.e., papers reported by WoS to cite some article when they actually did not) and their concentration around particular articles that are thus dubbed “strange attractors”. The analysis of references in (and citations to) papers in two English sources and two non-English sources reveals that phantom citations and other errors of indexing occur about twice as often with non-English items. These and other errors of commission affect about 1% of the cited references in the WoS database, and they may reveal large-scale problems in the reference matching algorithm in WoS.

[1]  Lars Iselid,et al.  Web of Science and Scopus: a journal title overlap study , 2008, Online Inf. Rev..

[2]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[3]  Debora Shaw,et al.  A new look at evidence of scholarly citation in citation indexes and from web sources , 2008, Scientometrics.

[4]  Iolanda Ventura Quaestiones and Encyclopedias: Some Aspects of the Late Medieval Reception of Pseudo-Aristotelian Problemata in Encyclopaedic and Scientific Culture , 2004 .

[5]  Yvonne Rogers,et al.  Citation counting, citation ranking, and h-index of human-computer interaction researchers: A comparison of Scopus and Web of Science , 2008, J. Assoc. Inf. Sci. Technol..

[6]  Péter Jacsó,et al.  The pros and cons of computing the h-index using Google Scholar , 2008, Online Inf. Rev..

[7]  Michael Levine-Clark,et al.  A Comparative Citation Analysis of Web of Science, Scopus, and Google Scholar , 2008 .

[8]  J. Kearney,et al.  Positive selection from newly formed to marginal zone B cells depends on the rate of clonal production, CD19, and btk. , 2000, Immunity.

[9]  Alasdair A. MacDonald,et al.  Schooling and society : the ordering and reordering of knowledge in the Western Middle Ages , 2004 .

[10]  Lokman I. Meho,et al.  Impact of data sources on citation counts and rankings of LIS faculty: Web of science versus scopus and google scholar , 2007, J. Assoc. Inf. Sci. Technol..

[11]  William H. Walters,et al.  Google Scholar Search Performance: Comparative Recall and Precision , 2009 .

[12]  Miguel A. García-Pérez,et al.  Accuracy and completeness of publication and citation records in the Web of Science, PsycINFO, and Google Scholar: A case study for the computation of h indices in Psychology , 2010, J. Assoc. Inf. Sci. Technol..

[13]  Judit Bar-Ilan,et al.  Which h-index? — A comparison of WoS, Scopus and Google Scholar , 2008, Scientometrics.

[14]  F. W. Lancaster,et al.  Testing the Calculation of a Realistic h-index in Google Scholar, Scopus, and Web of Science for , 2008 .

[15]  Charles Oppenheim,et al.  Comparing alternatives to the Web of Science for coverage of the social sciences' literature , 2007, J. Informetrics.

[16]  Pedro García-Zamora,et al.  Vegetación briofítca de las sierras de Filabres, Cabrera, Alhamilla y Cabo de Gata (Almería, SE de España) , 2000 .

[17]  Lee S. Shulman,et al.  Taking Learning Seriously , 1999 .

[18]  P. Peris,et al.  Evolución en el conocimiento de la fibra , 2007 .

[19]  Péter Jacsó,et al.  The pros and cons of computing the h-index using Web of Science , 2008, Online Inf. Rev..

[20]  Tove Faber Frandsen,et al.  Intradisciplinary differences in database coverage and the consequences for bibliometric research , 2008, J. Assoc. Inf. Sci. Technol..

[21]  Elizabeth S. Vieira,et al.  A comparison of Scopus and Web of Science for a typical university , 2009, Scientometrics.

[22]  Péter Jacsó,et al.  The pros and cons of computing the h-index using Scopus , 2008, Online Inf. Rev..

[23]  Peter Jasco,et al.  Testing the Calculation of a Realistic h-index in Google Scholar, Scopus, and Web of Science for F. W. Lancaster , 2008 .

[24]  María Peñaranda Ortega,et al.  Consecuencias de los errores en las referencias bibliográficas. El caso de la revista Psicothema , 2009 .

[25]  Ronald Rousseau,et al.  The influence of missing publications on the Hirsch index , 2007, J. Informetrics.