Erratum to: Toward a better scientific collaboration success prediction model through the feature space expansion

The problem with the prediction of scientific collaboration success based on the previous collaboration of scholars using machine learning techniques is addressed in this study. As the exploitation of collaboration network is essential in collaborator discovery systems, in this article an attempt is made to understand how to exploit the information embedded in collaboration networks. We benefit the link structure among the scholars and also among the scholars and the concepts to extract set of features that are correlated with the collaboration success and increase the prediction performance. The effect of considering other aggregate methods in addition to average and maximum, for computing the collaboration features based on the feature of the members is examined as well. A dataset extracted from Northwestern University’s SciVal Expert is used for evaluating the proposed approach. The results demonstrate the capability of the proposed collaboration features in order to increase the prediction performance in combination with the widelyused features like h-index and average citation counts. Consequently, the introduced features are appropriate to incorporate in collaborator discovery systems.

[1]  Daniel L. Fay,et al.  Research collaboration in universities and academic entrepreneurship: the-state-of-the-art , 2012, The Journal of Technology Transfer.

[2]  Jonathon N. Cummings,et al.  Who collaborates successfully?: prior experience reduces collaboration barriers in distributed interdisciplinary research , 2008, CSCW.

[3]  Aristides Gionis,et al.  Estimating Number of Citations Using Author Reputation , 2007, SPIRE.

[4]  Dimitrina S. Dimitrova,et al.  Scientific Collaboration on the Internet , 2009, J. Assoc. Inf. Sci. Technol..

[5]  Shou-De Lin,et al.  On team formation with expertise query in collaborative social networks , 2015, Knowledge and Information Systems.

[6]  Mei Song,et al.  Conceptualizing and advancing research networking systems , 2012, TCHI.

[7]  Jaideep Srivastava,et al.  Predicting Multi-actor collaborations using Hypergraphs , 2014, ArXiv.

[8]  Marina Jirotka,et al.  Supporting Scientific Collaboration: Methods, Tools and Concepts , 2013, Computer Supported Cooperative Work (CSCW).

[9]  Yan Zhang,et al.  To better stand on the shoulder of giants , 2012, JCDL '12.

[10]  魏屹东,et al.  Scientometrics , 2018, Encyclopedia of Big Data.

[11]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[12]  L. Egghe An improvement of the h-index: the g-index , 2006 .

[13]  Kara L Hall,et al.  The ecology of team science: understanding contextual influences on transdisciplinary collaboration. , 2008, American journal of preventive medicine.

[14]  Mike Thelwall,et al.  Determinants of research citation impact in nanoscience and nanotechnology , 2013, J. Assoc. Inf. Sci. Technol..

[15]  Binshan Lin,et al.  Effect of team diversity on software project performance , 2007, Ind. Manag. Data Syst..

[16]  Paul F. Skilton Does the human capital of teams of natural science authors predict citation frequency? , 2009, Scientometrics.

[17]  Theodoros Lappas,et al.  Finding a team of experts in social networks , 2009, KDD.

[18]  Chun Chen,et al.  Using rich social media information for music recommendation via hypergraph model , 2011, TOMCCAP.

[19]  Mingyang WangGuang Discovery of factors influencing citation impact based on a soft fuzzy rough set model , 2012 .

[20]  Lawrence D. Fu,et al.  Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature , 2010, Scientometrics.

[21]  Douglas A. Reynolds,et al.  Language identification using Gaussian mixture model tokenization , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  Atish Das Sarma,et al.  Multi-skill Collaborative Teams based on Densest Subgraphs , 2011, SDM.

[23]  R. Wigand,et al.  Measuring social capital through network analysis and its influence on individual performance , 2014 .

[24]  Schahram Dustdar,et al.  Composing Near-Optimal Expert Teams: A Trade-Off between Skills and Connectivity , 2010, OTM Conferences.

[25]  R. Wears,et al.  Journal prestige, publication bias, and other characteristics associated with citation of published studies in peer-reviewed journals. , 2002, JAMA.

[26]  John Whitfield,et al.  Collaboration: Group theory , 2008, Nature.

[27]  Kjeld Schmidt,et al.  Constructing CSCW: The First Quarter Century , 2013, Computer Supported Cooperative Work (CSCW).

[28]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[29]  Katy Börner,et al.  A Multi-Level Systems Perspective for the Science of Team Science , 2010, Science Translational Medicine.

[30]  Andrea Schiffauerova,et al.  Effect of collaboration network structure on knowledge creation and technological performance: the case of biotechnology in Canada , 2013, Scientometrics.

[31]  Maryam Fazel-Zarandi,et al.  Inferring and validating skills and competencies over time , 2013, Appl. Ontology.

[32]  Tian Yu,et al.  Citation impact prediction for scientific papers using stepwise regression analysis , 2014, Scientometrics.

[33]  D. Sonnenwald Scientific collaboration , 2007, Annu. Rev. Inf. Sci. Technol..

[34]  Howard Gadlin,et al.  Collaboration and Team Science , 2012, Journal of Investigative Medicine.

[35]  Gaganmeet Kaur Awal,et al.  Team formation in social networks based on collective intelligence – an evolutionary approach , 2014, Applied Intelligence.