Ensemble Link Predictor for Heterogeneous Complex Networks

Link prediction in complex networks aims to explore similarities between node pairs. Recently, link prediction has been considered in the presence of network heterogeneity which makes the majority of the homogeneous link prediction approaches infeasible. A meta-structure, known as meta-path, has been proposed to explore such networks. Generating good meta-paths and selecting the best of them introduce some new challenges to link prediction problem. In this paper, a new ensemble-based link prediction approach is proposed in heterogeneous complex networks. This approach consists of three steps: (i) a set of meta-paths are selected such that each of them represents a different semantic between the target node pairs; (ii) a feature vector is extracted for each node pair using each meta-path; (iii) an ensemble of learners would be established on different feature sets. The final link predictor is obtained after the ensemble aggregation. The results on DBLP network show that the proposed approach has more accurate predictions than a single meta-path based link predictor.

[1]  Fangzhao Wu,et al.  Social Spammer and Spam Message Co-Detection in Microblogging with Social Context Regularization , 2015, CIKM.

[2]  Yuan-Hai Shao,et al.  Weighted linear loss twin support vector machine for large-scale classification , 2015, Knowl. Based Syst..

[3]  Yizhou Sun,et al.  Recommendation in heterogeneous information networks with implicit user feedback , 2013, RecSys.

[4]  Yizhou Sun,et al.  Mining heterogeneous information networks: a structural analysis approach , 2013, SKDD.

[5]  Reynold Cheng,et al.  Discovering Meta-Paths in Large Heterogeneous Information Networks , 2015, WWW.

[6]  Ciprian Dobre,et al.  SPRINT-SELF: Social-Based Routing and Selfish Node Detection in Opportunistic Networks , 2015, Mob. Inf. Syst..

[7]  Jiawei Luo,et al.  Prioritization of potential candidate disease genes by topological similarity of protein-protein interaction network and phenotype data , 2015, J. Biomed. Informatics.

[8]  Philip S. Yu,et al.  Meta path-based collective classification in heterogeneous information networks , 2012, CIKM.

[9]  Philip S. Yu,et al.  Meta-path based multi-network collective link prediction , 2014, KDD.

[10]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[11]  Charu C. Aggarwal,et al.  Co-author Relationship Prediction in Heterogeneous Bibliographic Networks , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[12]  Jiawei Han,et al.  Citation Prediction in Heterogeneous Bibliographic Networks , 2012, SDM.

[13]  Reshma Khemchandani,et al.  Twin Support Vector Machines for Pattern Classification , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  José Fernando Rodrigues,et al.  Supervised-learning link recommendation in the DBLP co-authoring network , 2014, 2014 IEEE International Conference on Pervasive Computing and Communication Workshops (PERCOM WORKSHOPS).

[15]  Michael Ley,et al.  DBLP - Some Lessons Learned , 2009, Proc. VLDB Endow..

[16]  Qi Yu,et al.  Predicting Co-Author Relationship in Medical Co-Authorship Networks , 2014, PloS one.

[17]  Philip S. Yu,et al.  Collective Prediction of Multiple Types of Links in Heterogeneous Information Networks , 2014, 2014 IEEE International Conference on Data Mining.

[18]  Peng Wang,et al.  Link prediction in social networks: the state-of-the-art , 2014, Science China Information Sciences.

[19]  Yizhou Sun,et al.  Meta-Path-Based Search and Mining in Heterogeneous Information Networks , 2013 .

[20]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[21]  Reda Alhajj,et al.  Time frame based link prediction in directed citation networks , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[22]  Philip S. Yu,et al.  Relevance search in heterogeneous networks , 2012, EDBT '12.

[23]  Ni Lao,et al.  Relational retrieval using a combination of path-constrained random walks , 2010, Machine Learning.

[24]  Hyun-Chul Kim,et al.  Constructing support vector machine ensemble , 2003, Pattern Recognit..