Dual network embedding for representing research interests in the link prediction problem on co-authorship networks

We present a study on co-authorship network representation based on network embedding together with additional information on topic modeling of research papers and new edge embedding operator. We use the link prediction (LP) model for constructing a recommender system for searching collaborators with similar research interests. Extracting topics for each paper, we construct keywords co-occurrence network and use its embedding for further generalizing author attributes. Standard graph feature engineering and network embedding methods were combined for constructing co-author recommender system formulated as LP problem and prediction of future graph structure. We evaluate our survey on the dataset containing temporal information on National Research University Higher School of Economics over 25 years of research articles indexed in Russian Science Citation Index and Scopus. Our model of network representation shows better performance for stated binary classification tasks on several co-authorship networks. Subjects Artificial Intelligence, Data Mining and Machine Learning, Digital Libraries, Network Science and Online Social Networks, World Wide Web and Web Science

[1]  Kevin Chen-Chuan Chang,et al.  A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[2]  Yan Liu,et al.  Predicting who rated what in large-scale datasets , 2007, SKDD.

[3]  Craig B. Stanford,et al.  Scopus , 2009, Experimental Neurology.

[4]  Mohammad Al Hasan,et al.  A Survey of Link Prediction in Social Networks , 2011, Social Network Data Analytics.

[5]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[6]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[7]  M. Newman 1 Who is the best connected scientist ? A study of scientific coauthorship networks , 2004 .

[8]  Leonid Zhukov,et al.  Scientific Matchmaker: Collaborator Recommender System , 2017, AIST.

[9]  Nicola Cetorelli,et al.  Prestigious Stock Exchanges: A Network Analysis of International Financial Centers , 2009 .

[10]  Huan Liu,et al.  Unsupervised feature selection for linked social media data , 2012, KDD.

[11]  C. Morel,et al.  Co-authorship Network Analysis: A Powerful Tool for Strategic Planning of Research, Development and Capacity Building Programs on Neglected Diseases , 2009, PLoS neglected tropical diseases.

[12]  Deli Zhao,et al.  Network Representation Learning with Rich Text Information , 2015, IJCAI.

[13]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[14]  Sami Abu-El-Haija,et al.  Learning Edge Representations via Low-Rank Asymmetric Projections , 2017, CIKM.

[15]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[16]  Leonid Zhukov,et al.  Recommending Co-authorship via Network Embeddings and Feature Engineering: The case of National Research University Higher School of Economics , 2018, JCDL.

[17]  Ulrik Brandes,et al.  What is network science? , 2013, Network Science.

[18]  Chengqi Zhang,et al.  Tri-Party Deep Network Representation , 2016, IJCAI.

[19]  Jian Pei,et al.  A Survey on Network Embedding , 2017, IEEE Transactions on Knowledge and Data Engineering.

[20]  Stephen Lin,et al.  Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Leonid Zhukov,et al.  Co-authorship Network Embedding and Recommending Collaborators via Network Embedding , 2018, AIST.

[22]  Jiaying Liu,et al.  VOPRec: Vector Representation Learning of Papers with Text Information and Structural Identity for Recommendation , 2021, IEEE Transactions on Emerging Topics in Computing.

[23]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[24]  Peng Wang,et al.  Recent developments in exponential random graph (p*) models for social networks , 2007, Soc. Networks.

[25]  Steven Skiena,et al.  A Tutorial on Network Embeddings , 2018, ArXiv.

[26]  Qing Li,et al.  Finding Relevant Papers Based on Citation Relations , 2011, WAIM.

[27]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[28]  Yi Yu,et al.  Link prediction for interdisciplinary collaboration via co-authorship network , 2018, Social Network Analysis and Mining.

[29]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[30]  Leonid Zhukov,et al.  Joint Node-Edge Network Embedding for Link Prediction , 2018, AIST.

[31]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[32]  Ludovic Denoyer,et al.  Temporal link prediction by integrating content and structure information , 2011, CIKM '11.

[33]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[34]  Peng Wang,et al.  Link prediction in social networks: the state-of-the-art , 2014, Science China Information Sciences.

[35]  Hsinchun Chen,et al.  Recommendation as link prediction: a graph kernel-based machine learning approach , 2009, JCDL '09.

[36]  Charu C. Aggarwal,et al.  Heterogeneous Network Embedding via Deep Architectures , 2015, KDD.

[37]  Palash Goyal,et al.  Capturing Edge Attributes via Network Embedding , 2018, IEEE Transactions on Computational Social Systems.

[38]  M. de Rijke,et al.  Discovering missing links in Wikipedia , 2005, LinkKDD '05.

[39]  Ying Ding,et al.  Applying centrality measures to impact analysis: A coauthorship network analysis , 2009, J. Assoc. Inf. Sci. Technol..

[40]  Pabitra Mitra,et al.  Applications of Link Prediction , 2016 .

[41]  Qiongkai Xu,et al.  GraRep: Learning Graph Representations with Global Structural Information , 2015, CIKM.

[42]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[43]  Palash Goyal,et al.  Graph Embedding Techniques, Applications, and Performance: A Survey , 2017, Knowl. Based Syst..

[44]  D. Watts,et al.  Origins of Homophily in an Evolving Social Network1 , 2009, American Journal of Sociology.

[45]  Hao Wu,et al.  Network Vector: Distributed Representations of Networks with Global Context , 2017, ArXiv.

[46]  Hsinchun Chen,et al.  Link prediction approach to collaborative filtering , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[47]  Carl Lagoze,et al.  Patterns of Collaboration in Co-authorship Networks in Chemistry-Mesoscopic Analysis and Interpretation , 2009 .

[48]  Hui Chen,et al.  A literature survey on smart cities , 2015, Science China Information Sciences.

[49]  Huan Liu,et al.  Leveraging social media networks for classification , 2011, Data Mining and Knowledge Discovery.

[50]  Max Welling,et al.  Variational Graph Auto-Encoders , 2016, ArXiv.

[51]  Katarzyna Musial,et al.  Link Prediction Methods and Their Accuracy for Different Social Networks and Network Metrics , 2015, Sci. Program..

[52]  Roger M. Needham The changing environment for security protocols , 1997 .

[53]  M. Newman Coauthorship networks and patterns of scientific collaboration , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[55]  Xiangnan He,et al.  Attributed Social Network Embedding , 2017, IEEE Transactions on Knowledge and Data Engineering.

[56]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[57]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.

[58]  Daniel Kifer,et al.  Context-aware citation recommendation , 2010, WWW '10.

[59]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[60]  Leonid Zhukov,et al.  Co-author Recommender System , 2016 .

[61]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[62]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[63]  Xiao Huang,et al.  Label Informed Attributed Network Embedding , 2017, WSDM.