The NASA Astrophysics Data System : Sociology , Bibliometrics , and Impact

The NASA Astrophysics Data System (ADS), along with astronomy's journals and data centers, has developed a distributed on-line digital library which has become the dominant means by which astronomers search, access and read their technical literature. By combining data from the text, citation, and reference databases with data from the ADS readership logs we have been able to create Second Order Bibliometric Operators, a customizable class of collaborative lters which permits substantially improved accuracy in literature queries. Using the ADS usage logs along with membership statistics from the International Astronomical Union and data on the population and gross domestic product (GDP) we develop an accurate model for world-wide basic research where the number of scientists in a country is proportional to the GDP of that country, and the amount of basic research done by a country is proportional to the number of scientists in that country times that country's per capita GDP. We examine the obsolescence function as measured by actual reads, and show that it can be well t by the sum of four exponentials with very di erent time constants. We compare the obsolescence function as measured by readership with the obsolescence function as measured by citations. We nd that the citation function is proportional to the sum of two of the components of the readership function. This proves that the normative theory of citation is true in the mean. We further examine in detail the similarities and di erences between the citation rate, the readership rate and the total citations for individual articles, and discuss some of the causes. Using the number of reads as a bibliometric measure for individuals, we introduce the readcite diagram to provide a two-dimensional view of an individual's scienti c productivity. We develop a simple model to account for an individual's reads and cites and use it to show that the position of a person in the read-cite diagram is a function of age, innate productivity, and work history. We show the age biases of both reads and cites, and develop two new bibliometric measures which have substantially less age bias than citations: SumProd, a weighted sum of total citations and the readership rate, intended to show the total productivity of an individual; and Read10, the readership rate for papers published in the last ten years, intended to show an individual's current productivity. We also discuss the e ect of normalization (dividing by the number of authors on a paper) on these statistics. We apply SumProd and Read10 using new, non-parametric techniques to rank and compare di erent astronomical research organizations. We then introduce the concept of utility time to measure the impact of the ADS and the electronic astronomical library on astronomical research. We nd that in 2002 it amounted to the equivalent of 736 FTE researchers, or $250 Million, or the astronomical research done in France. Subject headings: digital libraries; bibliometrics; sociology of science; information retrieval Submitted to The Journal of the American Society for Information Science and Technology 2

[1]  Guenther Eichhorn,et al.  Evolution of Urania into the AVO , 2001, SPIE Optics + Photonics.

[2]  T. J. Phelan,et al.  A compendium of issues for citation analysis , 1999, Scientometrics.

[3]  Michael L. Nelson,et al.  Determining the publication impact of a digital library , 2000 .

[4]  Guenther Eichhorn,et al.  The ADS Abstract Service: One Year Old , 1994 .

[5]  Peter B. Boyce,et al.  The Electronic Astrophysical Journal Letters project , 1995 .

[6]  Andrea Bergmann,et al.  Citation Indexing Its Theory And Application In Science Technology And Humanities , 2016 .

[7]  Virginia Trimble,et al.  Astrophysics in 1998 , 1999 .

[8]  Michael J. Kurtz Second Order Knowledge: Information Retrieval in the Terabyte Era , 1992 .

[9]  Michael D. Cooper,et al.  Using Article Photocopy Data in Bibliographic Models for Journal Collection Management , 1994, The Library Quarterly.

[10]  Stephen S. Murray,et al.  Intelligent text retrieval in the NASA astrophysics data system , 1992 .

[11]  Michael J. Kurtz,et al.  The NASA Astrophysics Data System: Architecture , 2000, astro-ph/0002105.

[12]  Raymond Plante,et al.  The NCSA Astronomy Digital Image Library: The Challenges of the Scientific Data Library , 1997, D Lib Mag..

[13]  Virginia Trimble,et al.  Astrophysics in 2000 , 2001 .

[14]  Maurice B. Line,et al.  Changes in the Use of Literature with Time--Obsolescence Revisited , 1993, Libr. Trends.

[15]  B. G. Corbin,et al.  Digitization of historical astronomical literature , 1995 .

[16]  Alexander S. Szalay,et al.  Virtual Observatories of the Future , 2001 .

[17]  Michael J. Kurtz,et al.  The NASA Astrophysics Data System: Data holdings , 2000, astro-ph/0002103.

[18]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[19]  F. H. Herbstein,et al.  Measuring “publications output” and “publications impact” of faculty members of a university chemistry department , 1993, Scientometrics.

[20]  Michael J. Kurtz,et al.  The NASA Astrophysics Data System: The search engine and its user interface , 2000, astro-ph/0002102.

[21]  Stephen S. Murray,et al.  Second order bibliometric operators in the Astrophysics Data System , 2002, SPIE Astronomical Telescopes + Instrumentation.

[22]  Howard D. White,et al.  Authors as citers over time , 2001, J. Assoc. Inf. Sci. Technol..

[23]  William Wilson Morgan,et al.  Fundamental stellar photometry for standards of spectral type on the revised system of the Yerkes spectral atlas , 1953 .

[24]  Virginia Trimble,et al.  Astrophysics in 2001 , 2002 .

[25]  F. Ochsenbein,et al.  The A&A tables and abstracts: An example of collaboration between data centres and editors , 1995 .

[26]  Lodewijk Woltjer Economic Consequences of the Deterioration of the Astronomical Environment , 1998 .

[27]  Markus Demleitner,et al.  Looking at 3,000,000 References Without Growing Grey Hair , 1999 .

[28]  Michael H. MacRoberts,et al.  Problems of citation analysis: A critical review , 1989, JASIS.

[29]  Barry F. Madore,et al.  The NASA/IPAC Extragalactic Database , 1990 .

[30]  Carol Tenopir,et al.  Relying on electronic journals: Reading patterns of astronomers , 2005, J. Assoc. Inf. Sci. Technol..

[31]  M. Phillips,et al.  Observational Evidence from Supernovae for an Accelerating Universe and a Cosmological Constant , 1998, astro-ph/9805201.

[32]  Mengxiong Liu,et al.  Progress in Documentation the Complexities of citation Practice: a Review of citation studies , 1993, J. Documentation.

[33]  Peter B. Boyce,et al.  Urania, a Linked, Distributed Resource for Astronomy , 1998 .

[34]  Peter Bradford Boyce,et al.  Journals, Data and Abstracts Make an Integrated Electronic Resource , 1996 .

[35]  E. A. Fox,et al.  Combining the Evidence of Multiple Query Representations for Information Retrieval , 1995, Inf. Process. Manag..

[36]  W. Satariano,et al.  Journal Use in Sociology: Citation Analysis versus Readership Patterns , 1978, The Library Quarterly.

[37]  Arlo U. Landolt,et al.  UBVRI Photometric Standard Stars in the Magnitude Range 11 , 1992 .

[38]  Virginia Trimble,et al.  Astrophysics in 1999 , 2000 .

[39]  Michael J. Kurtz,et al.  The NASA Astrophysics Data System: Overview , 2000, astro-ph/0002104.

[40]  Ming-Yueh Tsay,et al.  Library Journal Use and Citation Half-Life in Medical Science , 1998, J. Am. Soc. Inf. Sci..

[41]  P. Seglen,et al.  Citations and journal impact factors: questionable indicators of research quality , 1997, Allergy.

[42]  D. Egret,et al.  The simbad astronomical database , 1991 .

[43]  Michael J. Kurtz,et al.  Journal Citations: An ADS-AAS Collaboration , 1996 .

[44]  Francoise Genova,et al.  It's All Done With Mirrors: Improving Information Access , 1996 .

[45]  Stephen S. Murray,et al.  The NASA Astrophysics Data System: A Heterogeneous Distributed Processing System Application , 1992 .

[46]  James Lequeux,et al.  Physical Properties of Molecular Clouds in the Inner Disk of M31 , 1995 .

[47]  R. E. Burton,et al.  The “half‐life” of some scientific and technical literatures , 1960 .

[48]  G. Abell,et al.  The distribution of rich clusters of galaxies. , 1957 .

[49]  V. A. Wente Scientific and technical information management , 1990 .

[50]  Maurice B. Line,et al.  PROGRESS IN DOCUMENTATION: ‘obsolescence’ and changes in the use of literature with time , 1974 .

[51]  Francoise Genova,et al.  The CDS information hub , 2000, astro-ph/0002095.

[52]  A. Sandison,et al.  THE USE OF OLDER LITERATURE AND ITS OBSOLESCENCE , 1971 .