Research collaboration prediction and recommendation based on network embedding in co‐authorship networks

In large‐scale datasets, the researchers' multiple features need to be learned automatically instead of manually defined and enumerated, to improve the efficiency and effect of research collaboration prediction and recommendation. This paper applies the network embedding method to learn the context of each researcher by which the semantic similarities among researchers are calculated. Firstly, the co‐authorship network is built in a large‐scale dataset where research collaborations are denoted by co‐authorships. Then the researchers' semantic contexts in the network are learned by the network embedding method based on deep learning, and each researcher's dense, low‐dimensional vector is formed. Finally, the semantic similarities among researchers are calculated through vector similarity indices and quantitatively compared by link prediction. Experiments in the field of library and information science (LIS) verify that the method can improve the accuracy and effectiveness of research collaboration prediction and recommendation.

[1]  Nitesh V. Chawla,et al.  Multi-relational Link Prediction in Heterogeneous Information Networks , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[2]  Jing Zhao,et al.  Prediction of Links and Weights in Networks by Reliable Routes , 2015, Scientific Reports.

[3]  Erjia Yan,et al.  Predicting and recommending collaborations: An author-, institution-, and country-level analysis , 2014, J. Informetrics.

[4]  ZhangJinzhu Uncovering mechanisms of co-authorship evolution by multirelations-based link prediction , 2017 .

[5]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[6]  Stefan Kuhlmann,et al.  Across institutional boundaries?: Research collaboration in German public sector nanoscience , 2008 .

[7]  Ronald Rousseau,et al.  This item is the archived peer-reviewed author-version of: Recommending research collaborations using link prediction and random forest classifiers , 2022 .

[8]  Stephanie M Fullerton,et al.  Research guidelines in the era of large-scale collaborations: an analysis of Genome-wide Association Study Consortia. , 2012, American journal of epidemiology.

[9]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[10]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[11]  Omer Levy,et al.  word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method , 2014, ArXiv.

[12]  Tao Zhou,et al.  Measuring multiple evolution mechanisms of complex networks , 2014, Scientific Reports.

[13]  Jinzhu Zhang,et al.  Uncovering mechanisms of co-authorship evolution by multirelations-based link prediction , 2017, Inf. Process. Manag..