Quantifying the cognitive extent of science

While the modern science is characterized by an exponential growth in scientific literature, the increase in publication volume clearly does not reflect the expansion of the cognitive boundaries of science. Nevertheless, most of the metrics for assessing the vitality of science or for making funding and policy decisions are based on productivity. Similarly, the increasing level of knowledge production by large science teams, whose results often enjoy greater visibility, does not necessarily mean that "big science" leads to cognitive expansion. Here we present a novel, big-data method to quantify the extents of cognitive domains of different bodies of scientific literature independently from publication volume, and apply it to 20 million articles published over 60-130 years in physics, astronomy, and biomedicine. The method is based on the lexical diversity of titles of fixed quotas of research articles. Owing to large size of quotas, the method overcomes the inherent stochasticity of article titles to achieve <1% precision. We show that the periods of cognitive growth do not necessarily coincide with the trends in publication volume. Furthermore, we show that the articles produced by larger teams cover significantly smaller cognitive territory than (the same quota of) articles from smaller teams. Our findings provide a new perspective on the role of small teams and individual researchers in expanding the cognitive boundaries of science. The proposed method of quantifying the extent of the cognitive territory can also be applied to study many other aspects of "science of science."

[1]  R. Perrucci,et al.  From Little Science to Big Science , 2017 .

[2]  Alexander Zahar,et al.  Shaping written knowledge: the genre and activity of the experimental article in science , 1991, Medical History.

[3]  Benjamin F. Jones,et al.  Multi-University Research Teams: Shifting Impact, Geography, and Stratification in Science , 2008, Science.

[4]  Albert N. Tabah,et al.  Literature Dynamics: Studies on Growth, Diffusion, and Epidemics , 1999 .

[5]  Li Tang,et al.  Tracing the footprint of knowledge spillover: Evidence from U.S.-China collaboration in nanotechnology , 2013, J. Assoc. Inf. Sci. Technol..

[6]  M. Callon,et al.  From translations to problematic networks: An introduction to co-word analysis , 1983 .

[7]  Stasa Milojevic,et al.  Multidisciplinary cognitive content of nanoscience and nanotechnology , 2012, Journal of Nanoparticle Research.

[8]  J. R. Cole,et al.  The Ortega Hypothesis , 1972, Science.

[9]  David Kaiser,et al.  Booms, Busts, and the World of Ideas: Enrollment Pressures and the Challenge of Specialization , 2012, Osiris.

[10]  Scott Jarvis,et al.  vocd: A theoretical and empirical evaluation , 2007 .

[11]  Anthony F. J. van Raan,et al.  The neural net of neural network research , 2005, Scientometrics.

[12]  Michel Zitt,et al.  A simple method for dynamic scientometrics using lexical analysis , 1991, Scientometrics.

[13]  J. Law,et al.  A Co-Word Study of Artificial Intelligence , 1989 .

[14]  Benjamin F. Jones,et al.  Supporting Online Material Materials and Methods Figs. S1 to S3 References the Increasing Dominance of Teams in Production of Knowledge , 2022 .

[15]  Loet Leydesdroff Words and co-words as indicators of intellectual organization , 1989 .

[16]  David Malvern,et al.  Measuring vocabulary diversity using dedicated software , 2000 .

[17]  Rush D. Holt,et al.  Falling Behind? Boom, Bust, and the Global Race for Scientific Talent , 2015 .

[18]  Marco Pautasso,et al.  Publication Growth in Biological Sub-Fields: Patterns, Predictability and Sustainability , 2012 .

[19]  Matjaz Perc,et al.  Self-organization of progress across the century of physics , 2013, Scientific Reports.

[20]  Stephen S. Murray,et al.  E‐prints and journal articles in astronomy: a productive co‐existence , 2006, Learn. Publ..

[21]  Manfred Kochen Integrative mechanisms in literature growth , 1974 .

[22]  John Whittaker,et al.  Creativity and Conformity in Science: Titles, Keywords and Co-word Analysis , 1989 .

[23]  Roger Guimerà,et al.  Team Assembly Mechanisms Determine Collaboration Network Structure and Team Performance , 2005, Science.

[24]  Peder Olesen Larsen,et al.  The rate of growth in scientific publication and the decline in coverage provided by Science Citation Index , 2010, Scientometrics.

[25]  Tiago Moreira,et al.  Structures of Scientific Collaboration , 2009 .

[26]  Derek J. de Solla Price,et al.  Science Since Babylon , 1961 .

[27]  Henk F. Moed,et al.  Mapping of science by combined co-citation and word analysis, I. Structural aspects , 1991, J. Am. Soc. Inf. Sci..

[28]  Stasa Milojevic,et al.  Principles of scientific research team formation and evolution , 2014, Proceedings of the National Academy of Sciences.

[29]  Katy Börner,et al.  A Multi-Level Systems Perspective for the Science of Team Science , 2010, Science Translational Medicine.

[30]  S. Freytag Image And Logic A Material Culture Of Microphysics , 2016 .

[31]  Michel Zitt,et al.  Development of a method for detection and trend analysis of research fronts built by lexical or cocitation analysis , 1994, Scientometrics.

[32]  Cassidy R. Sugimoto,et al.  The cognitive structure of Library and Information Science: Analysis of article title words , 2011, J. Assoc. Inf. Sci. Technol..

[33]  Lutz Bornmann,et al.  Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references , 2014, J. Assoc. Inf. Sci. Technol..

[34]  Arie Rip,et al.  Co-word maps of biotechnology: An example of cognitive scientometrics , 1984, Scientometrics.

[35]  V. Markovich,et al.  Magnetic properties of electron-doped La0.23Ca0.77MnO3 nanoparticles , 2012, Journal of Nanoparticle Research.

[36]  A. Weinberg Impact of Large-Scale Science on the United States: Big science is here to stay, but we have yet to make the hard financial and educational choices it imposes. , 1961, Science.

[37]  Katy Börner,et al.  Open data and open code for big science of science studies , 2014, Scientometrics.

[38]  Yu Xie,et al.  Is American Science in Decline , 2012 .

[39]  Anthony F. J. van Raan,et al.  Monitoring Scientific Developments from a Dynamic Perspective: Self-Organized Structuring to Map Neural Network Research , 1998, Journal of the American Society for Information Science.

[40]  Weimao Ke,et al.  The Scholarly Database and its utility for scientometrics research , 2009, Scientometrics.

[41]  Rie Koizumi,et al.  Relationships between text length and lexical diversity measures: Can we use short texts of less than 100 tokens? , 2012 .

[42]  Jean Pierre Courtial,et al.  Co-word analysis as a tool for describing the network of interactions between basic and technological research: The case of polymer chemsitry , 1991, Scientometrics.

[43]  Roger L. Geiger What Happened after Sputnik? Shaping University Research in the United States , 1997 .

[44]  Matjaz Perc,et al.  Inheritance patterns in citation networks reveal scientific memes , 2014, ArXiv.

[45]  Marcia Bartusiak,et al.  The Day We Found the Universe , 2009 .

[46]  Anthony F. J. van Raan,et al.  On Growth, Ageing, and Fractal Differentiation of Science , 2000, Scientometrics.

[47]  Loet Leydesdorff Words and co-words as indicators of intellectual organization , 1989 .

[48]  W. Bynum,et al.  A History of Molecular Biology , 1999, Nature Medicine.

[49]  David Malvern,et al.  Lexical Diversity and Language Development: Quantification and Assessment , 2004 .

[50]  Michael J. Moravcsik,et al.  Science: Growth and Change , 1972 .

[51]  A. Raan THE NEURAL NET OF NEURAL NETWORK RESEARCH AN EXERCISE IN BIBLIOMETRIC MAPPING , 2022 .

[52]  Andrey Rzhetsky,et al.  Tradition and Innovation in Scientists’ Research Strategies , 2013, ArXiv.

[53]  Henk F. Moed,et al.  Mapping of science by combined co-citation and word analysis: II: Dynamical aspects , 1991, J. Am. Soc. Inf. Sci..

[54]  Erin A. Cech,et al.  How Economics Shapes Science , 2013 .

[55]  W. Myers,et al.  Atypical Combinations and Scientific Impact , 2013 .

[56]  Chaomei Chen,et al.  The New Invisible College: Science for Development , 2009, J. Assoc. Inf. Sci. Technol..