Link prediction in co-authorship networks based on hybrid content similarity metric

Link prediction in online social networks is used to determine new interactions among its members which are likely to occur in the future. Link prediction in the co-authorship network has been regarded as one of the main targets in link prediction researches so far. Researchers have focused on analyzing and proposing solutions to give efficient recommendation for authors who can work together in a science project. In order to give precise prediction of links between two ubiquitous authors in a co-authorship network, it is preferable to design a similarity metric between them and then utilizing it to determine the most possible co-author(s). However, the relevant researches did not regard the integration of paper’s content in the metric itself. This is important when considering the collaboration between scientists since it is possible that authors having same research interests are more likely to have a joint paper than those in different researches. In this paper, we propose a new metric for link prediction in the co-authorship network based on the content similarity named as LDAcosin. Mathematical notions of the link prediction in the co-authorship network and a link prediction algorithm based on topic modeling are proposed. The new metric is experimentally validated on the public bibliographic collection.

[1]  Ricardo B. C. Prudêncio,et al.  Time Series Based Link Prediction , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[2]  Nitesh V. Chawla,et al.  New perspectives and methods in link prediction , 2010, KDD.

[3]  R. Manjula,et al.  Similarity Index based Link Prediction Algorithms in Social Networks: A Survey , 2016 .

[4]  Qi Yu,et al.  Predicting Co-Author Relationship in Medical Co-Authorship Networks , 2014, PloS one.

[5]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[6]  Feng Xia,et al.  MVCWalker: Random Walk-Based Most Valuable Collaborators Recommendation Exploiting Academic Factors , 2014, IEEE Transactions on Emerging Topics in Computing.

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  Jie Tang,et al.  Link Prediction of Social Networks Based on Weighted Proximity Measures , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[9]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[10]  Ricardo B. C. Prudêncio,et al.  Proximity measures for link prediction based on temporal events , 2013, Expert Syst. Appl..

[11]  Giseli Rabello Lopes,et al.  Collaboration Recommendation on Academic Social Networks , 2010, ER Workshops.

[12]  Tran Manh Tuan,et al.  A cooperative semi-supervised fuzzy clustering framework for dental X-ray image segmentation , 2016, Expert Syst. Appl..

[13]  Ke Chen,et al.  Applied Mathematics and Computation , 2022 .

[14]  Zehra Cataltepe,et al.  Link prediction using time series of neighborhood-based node similarity scores , 2015, Data Mining and Knowledge Discovery.

[15]  Srikanta J. Bedathur,et al.  Towards time-aware link prediction in evolving social networks , 2009, SNA-KDD '09.

[16]  Hui Chen,et al.  A literature survey on smart cities , 2015, Science China Information Sciences.

[17]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Mohammad Reza Meybodi,et al.  Link prediction based on temporal similarity metrics using continuous action set learning automata , 2016 .

[19]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[20]  Giseli Rabello Lopes,et al.  Using link semantics to recommend collaborations in academic social networks , 2013, WWW.

[21]  SonLe Hoang,et al.  A cooperative semi-supervised fuzzy clustering framework for dental X-ray image segmentation , 2016 .

[22]  Ryutaro Ichise,et al.  Time Aware Index for Link Prediction in Social Networks , 2011, DaWaK.

[23]  Loren Terveen,et al.  User Personality and User Satisfaction with Recommender Systems , 2017, Information Systems Frontiers.

[24]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[25]  Ryutaro Ichise,et al.  Citation count prediction as a link prediction problem , 2015, Applied Intelligence.

[26]  Chintan Bhatt,et al.  Enhance Link Prediction in Online Social Networks Using Similarity Metrics, Sampling, and Classification , 2018 .

[27]  Srinivasan Parthasarathy,et al.  Local Probabilistic Models for Link Prediction , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[28]  Doina Caragea,et al.  Predicting Friendship Links in Social Networks Using a Topic Modeling Approach , 2011, PAKDD.

[29]  Mohammad Reza Meybodi,et al.  Link prediction in stochastic social networks: Learning automata approach , 2017, J. Comput. Sci..

[30]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[31]  Peng Wang,et al.  Link prediction in social networks: the state-of-the-art , 2014, Science China Information Sciences.

[32]  Michael Mitzenmacher,et al.  A Brief History of Generative Models for Power Law and Lognormal Distributions , 2004, Internet Math..

[33]  Ling Chen,et al.  Link prediction in dynamic social networks by integrating different types of information , 2014, Applied Intelligence.

[34]  Buket Kaya,et al.  Unsupervised link prediction in evolving abnormal medical parameter networks , 2015, International Journal of Machine Learning and Cybernetics.

[35]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  Barbara Carminati,et al.  Network and profile based measures for user similarities on social networks , 2011, 2011 IEEE International Conference on Information Reuse & Integration.

[37]  Rossano Schifanella,et al.  Folks in Folksonomies: social link prediction from shared metadata , 2010, WSDM '10.

[38]  M. P. S. Bhatia,et al.  Content based approach to find the credibility of user in social networks: an application of cyberbullying , 2015, International Journal of Machine Learning and Cybernetics.

[39]  Yin Zhang,et al.  Scalable proximity estimation and link prediction in online social networks , 2009, IMC '09.

[40]  Cécile Favre,et al.  Information diffusion in online social networks: a survey , 2013, SGMD.

[41]  T. Jaya Lakshmi,et al.  Link Prediction in Temporal Heterogeneous Networks , 2017, PAISI.

[42]  Tao Dai,et al.  Explore semantic topics and author communities for citation recommendation in bipartite bibliographic network , 2017, Journal of Ambient Intelligence and Humanized Computing.

[43]  Barbara Carminati,et al.  User similarities on social networks , 2013, Social Network Analysis and Mining.

[44]  David B. Dunson,et al.  Probabilistic topic models , 2012, Commun. ACM.

[45]  Nian-Shing Chen,et al.  Studying Research Collaboration Patterns via Co-authorship Analysis in the Field of TeL: The Case of Educational Technology & Society Journal , 2014, J. Educ. Technol. Soc..

[46]  Noël Crespi,et al.  CSD: A multi-user similarity metric for community recommendation in online social networks , 2016, Expert Syst. Appl..

[47]  Christopher M. Danforth,et al.  An evolutionary algorithm approach to link prediction in dynamic social networks , 2013, J. Comput. Sci..

[48]  Guobin Chen,et al.  Recommendation Method of Educational Resources Under the Big Data Environment , 2016 .