This paper reports a study of the incremental impact of evolving cyberinfrastructure (CI)enabled collaboration networks on scientific capacity and knowledge diffusion. While ample research shows how collaboration contributes to greater productivity, higher-quality scientific outputs, and increased probability of breakthroughs, it is unclear how the early stages of collaboration on data creation supports knowledge generation and diffusion. Further, it is not known whether the ability to garner larger inputs2 increases collaboration capacity and subsequently accelerates the rate of knowledge diffusion. Given that the collaboration capacity of a science team is largely dependent upon the Scientific and Technical (S&T) Human Capital3, the greater a researcher’s S&T human capital, the greater the opportunity to collaborate and access resources. We use “Collaboration Capacity” to refer to this measure of S&T human capital. In this study, we collected metadata for molecular sequences in GenBank4 5 from 1990-2013. The data contain details about sequences, submission date, submitter(s), and associated publications and authors. Based on the collaboration capacity framework (Figure 1), we focused on the relationship between collaboration network size and research productivity and the role of CI-enabled data repositories in accelerating collaboration capacity. Our preliminary results show that the size of CI-enabled collaboration networks at data creation stage was positively related to research productivity as measured by sequence data production, and the extent and rate of knowledge diffusion, represented by patent applications. Shrinking time gaps between data submissions and patent applications support the hypothesis that CIenabled data repositories are an accelerating factor in incremental collaboration capacity. 1 Corresponding Author Contact Information: jqin@syr.edu; (315)443-5642 2 Stephan, P. (2012). How Economics Shapes Science. Cambridge, MA: Harvard University Press. 3 Bozeman, B., Dietz, J., & Gaughan, M.: Scientific and technical human capital: an alternative model for research evaluation. International Journal of Technology Management, 22: 636–655 (2001). 4 NCBI-a. GenBank overview, http://www.ncbi.nlm.nih.gov/genbank/. 5 NCBI-b. Growth of GenBank and WGS, http://www.ncbi.nlm.nih.gov/genbank/statistics.
[1]
Noriko Hara,et al.
An emerging view of scientific collaboration: Scientists' perspectives on collaboration and factors that impact collaboration
,
2003,
J. Assoc. Inf. Sci. Technol..
[2]
Barry Bozeman,et al.
Research Collaboration and Team Science: A State-of-the-Art Review and Agenda
,
2014
.
[3]
Anthony J. G. Hey,et al.
Jim Gray on eScience: a transformed scientific method
,
2009,
The Fourth Paradigm.
[4]
A. Barabasi,et al.
Evolution of the social network of scientific collaborations
,
2001,
cond-mat/0104162.
[5]
Ismael Rafols,et al.
Is science becoming more interdisciplinary? Measuring and mapping six research fields over time
,
2009,
Scientometrics.
[6]
Stanley Wasserman,et al.
Social Network Analysis: Methods and Applications
,
1994,
Structural analysis in the social sciences.
[7]
Jian Qin.
Levels and types of collaboration in interdisciplinary research in the sciences
,
1996
.
[8]
John W. Tukey,et al.
Exploratory Data Analysis.
,
1979
.
[9]
David J. DeWitt,et al.
Scientific data management in the coming decade
,
2005,
SGMD.
[10]
Roger Guimerà,et al.
Team Assembly Mechanisms Determine Collaboration Network Structure and Team Performance
,
2005,
Science.
[11]
Alexander S. Szalay,et al.
Gray's laws: database-centric computing in science
,
2009,
The Fourth Paradigm.
[12]
Jian Qin,et al.
Emergence of collaboration networks around large scale data repositories: a study of the genomics community using GenBank
,
2016,
Scientometrics.
[13]
Mark E. J. Newman,et al.
The Structure and Function of Complex Networks
,
2003,
SIAM Rev..
[14]
M. Newman,et al.
The structure of scientific collaboration networks.
,
2000,
Proceedings of the National Academy of Sciences of the United States of America.
[15]
Monica Gaughan,et al.
Scientific and technical human capital: an alternative model for research evaluation
,
2001,
Int. J. Technol. Manag..