Temporal patterns of genes in scientific publications

Publications in scientific journals contain a considerable fraction of our scientific knowledge. Analyzing data from publication databases helps us understand how this knowledge is obtained and how it changes over time. In this study, we present a mathematical model for the temporal dynamics of data on the scientific content of publications. Our data set consists of references to thousands of genes in the >15 million publications listed in PubMed. We show that the observed dynamics may result from a simple process: Researchers predominantly publish on genes that already appear in many publications. This might be a rewarding strategy for researchers, because there is a positive correlation between the frequency of a gene in scientific publications and the journal impact of the publications. By comparing the empirical data with model predictions, we are able to detect unusual publication patterns that often correspond to major achievements in the field. We identify interactions between yeast genes from PubMed and show that the frequency differences of genes in publications lead to a biased picture of the resulting interaction network.

[1]  S. Blackmore The Meme Machine , 1999 .

[2]  P. Richerson,et al.  Culture and the Evolutionary Process , 1988 .

[3]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[4]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Roger Guimerà,et al.  Team Assembly Mechanisms Determine Collaboration Network Structure and Team Performance , 2005, Science.

[6]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[8]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[9]  H. Plotkin,et al.  People do more than imitate. , 2000, Scientific American.

[10]  Alfonso Valencia,et al.  Life cycles of successful genes. , 2003, Trends in genetics : TIG.

[11]  L A Dugatkin,et al.  Animals imitate, too. , 2000, Scientific American.

[12]  T. Kuhn,et al.  The Structure of Scientific Revolutions. , 1964 .

[13]  R. Lewontin ‘The Selfish Gene’ , 1977, Nature.

[14]  R. Motani Rulers of the Jurassic seas. , 2000, Scientific American.

[15]  R. Swinburne OBJECTIVE KNOWLEDGE: AN EVOLUTIONARY APPROACH , 1973 .

[16]  C. Lumsden Culture and the Evolutionary Process, Robert Boyd, Peter J. Richerson. University of Chicago Press, Chicago & London (1985), viii, +301. Price $29.95 , 1986 .

[17]  S. Redner Citation statistics from 110 years of physical review , 2005, physics/0506056.

[18]  Alfonso Valencia,et al.  Implementing the iHOP concept for navigation of biomedical literature , 2005, ECCB/JBI.

[19]  D J PRICE,et al.  NETWORKS OF SCIENTIFIC PAPERS. , 1965, Science.

[20]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[21]  A. Valencia,et al.  A gene network for navigating the literature , 2004, Nature Genetics.

[22]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[23]  Greenberg Jm,et al.  The secrets of stardust. , 2000 .

[24]  E. Garfield Citation analysis as a tool in journal evaluation. , 1972, Science.

[25]  Chaomei Chen,et al.  Searching for intellectual turning points: Progressive knowledge domain visualization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[26]  R Boyd,et al.  Meme theory oversimplifies how culture changes. , 2000, Scientific American.

[27]  W Sibbett,et al.  Ultrashort-pulse lasers: big payoffs in a flash. , 2000, Scientific American.

[28]  Robert L. Goldstone,et al.  The simultaneous evolution of author and paper networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[29]  K. Popper Objective Knowledge: An Evolutionary Approach , 1972 .

[30]  Vwani P. Roychowdhury,et al.  Read Before You Cite! , 2003, Complex Syst..

[31]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[32]  S. Redner Aggregation kinetics of popularity , 2002 .

[33]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[34]  A. Valencia,et al.  Protein interaction: same network, different hubs. , 2003, Trends in genetics : TIG.