TNERec: Topic-Aware Network Embedding for Scientific Collaborator Recommendation

Collaboration is increasingly becoming a vital factor in an academic network, which can bring lots of benefits for scholars. Ubiquitous intelligence also provides an effective way for scholars to find collaborators. However, due to the large-scale of scholarly big data, there is a lot of information hard to capture in networks and we need to dig out valid information from collaboration networks. It is a valuable and urgent task to find appropriate collaborators for scholars. To address these problems, we hypothesize that fusing topic model and structure information could improve the performance of recommendations. In this paper, we propose a collaborator recommendation system, named TNERec (Topic-aware Network Embedding for scientific collaborator Recommendation), learning representations from scholars' research interests and network structure. TNERec first extracts scholars' research interests based on topic model and then learns vectors of scholars with network embedding. Finally, top-k recommendation list is generated based on the scholar vectors. Experimental results on a real-world dataset show the effectiveness of the proposed framework compared with state-of-the-art collaboration recommendation baselines.

[1]  Barry Bozeman,et al.  Research Collaboration and Team Science: A State-of-the-Art Review and Agenda , 2014 .

[2]  Meng Wang,et al.  Exploring dynamic research interest and academic influence for scientific collaborator recommendation , 2017, Scientometrics.

[3]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[4]  Kun Zhang,et al.  Modeling Network with Topic Model and Triangle Motif , 2015, 2015 IEEE 12th Intl Conf on Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom).

[5]  Feng Xia,et al.  MVCWalker: Random Walk-Based Most Valuable Collaborators Recommendation Exploiting Academic Factors , 2014, IEEE Transactions on Emerging Topics in Computing.

[6]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[7]  Jiaying Liu,et al.  VOPRec: Vector Representation Learning of Papers with Text Information and Structural Identity for Recommendation , 2021, IEEE Transactions on Emerging Topics in Computing.

[8]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[9]  Kevin Chen-Chuan Chang,et al.  A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[10]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[11]  Jiang Guo,et al.  A General Framework for Content-enhanced Network Representation Learning , 2016, ArXiv.

[12]  Mahidhar Tatineni,et al.  Topic Modeling and Visualization for Big Data in Social Sciences , 2016, 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld).

[13]  Jian Ma,et al.  A Weighted Topic Model Enhanced Approach for complementary Collaborator Recommendation , 2014, PACIS.

[14]  Mathias Niepert,et al.  Learning Graph Representations with Embedding Propagation , 2017, NIPS.

[15]  Chidchanok Lursinsap,et al.  Collaborator recommendation in interdisciplinary computer science using degrees of collaborative forces, temporal evolution of research interest, and comparative seniority status , 2015, Knowl. Based Syst..

[16]  Xing Zhou,et al.  Collaborator recommendation in heterogeneous bibliographic networks using random walks , 2017, Information Retrieval Journal.

[17]  Vikram Pudi,et al.  Paper2vec: Combining Graph and Text Information for Scientific Paper Representation , 2017, ECIR.

[18]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[19]  Jian Pei,et al.  A Survey on Network Embedding , 2017, IEEE Transactions on Knowledge and Data Engineering.

[20]  A. Barabasi,et al.  Quantifying the evolution of individual scientific impact , 2016, Science.

[21]  David M. Blei,et al.  Content-based recommendations with Poisson factorization , 2014, NIPS.

[22]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[23]  Xiao Huang,et al.  Label Informed Attributed Network Embedding , 2017, WSDM.

[24]  Feng Xia,et al.  Scientific collaboration patterns vary with scholars’ academic ages , 2017, Scientometrics.

[25]  Mansaf Alam,et al.  A survey on scholarly data: From big data perspective , 2017, Inf. Process. Manag..

[26]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[27]  Khalid Alfalqi,et al.  A Survey of Topic Modeling in Text Mining , 2015 .

[28]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[29]  Feng Xia,et al.  Big Scholarly Data: A Survey , 2017, IEEE Transactions on Big Data.

[30]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[31]  Amr M. Tolba,et al.  Exploiting Publication Contents and Collaboration Networks for Collaborator Recommendation , 2016, PloS one.

[32]  Palash Goyal,et al.  Graph Embedding Techniques, Applications, and Performance: A Survey , 2017, Knowl. Based Syst..

[33]  Chen Yang,et al.  Scientific Collaborator Recommendation in Heterogeneous Bibliographic Networks , 2015, 2015 48th Hawaii International Conference on System Sciences.

[34]  Joon-Oh Park,et al.  The Increasing Dominance of Teams in Production of Knowledge , 2011 .

[35]  Deli Zhao,et al.  Network Representation Learning with Rich Text Information , 2015, IJCAI.

[36]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[37]  Barry Bozeman,et al.  The Impact of Research Collaboration on Scientific Productivity , 2005 .

[38]  Zhiyuan Liu,et al.  CANE: Context-Aware Network Embedding for Relation Modeling , 2017, ACL.

[39]  Charu C. Aggarwal,et al.  Heterogeneous Network Embedding via Deep Architectures , 2015, KDD.

[40]  Jimeng Sun,et al.  Cross-domain collaboration recommendation , 2012, KDD.