Mutual information model for link prediction in heterogeneous complex networks

Recently, a number of meta-path based similarity indices like PathSim, HeteSim, and random walk have been proposed for link prediction in heterogeneous complex networks. However, these indices suffer from two major drawbacks. Firstly, they are primarily dependent on the connectivity degrees of node pairs without considering the further information provided by the given meta-path. Secondly, most of them are required to use a single and usually symmetric meta-path in advance. Hence, employing a set of different meta-paths is not straightforward. To tackle with these problems, we propose a mutual information model for link prediction in heterogeneous complex networks. The proposed model, called as Meta-path based Mutual Information Index (MMI), introduces meta-path based link entropy to estimate the link likelihood and could be carried on a set of available meta-paths. This estimation measures the amount of information through the paths instead of measuring the amount of connectivity between the node pairs. The experimental results on a Bibliography network show that the MMI obtains high prediction accuracy compared with other popular similarity indices.

[1]  Matthias M Dehmer,et al.  Novel topological descriptors for analyzing biological networks , 2010, BMC Structural Biology.

[2]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[3]  Jing Zhao,et al.  Prediction of Links and Weights in Networks by Reliable Routes , 2015, Scientific Reports.

[4]  Fei Tan,et al.  Link Prediction in Complex Networks: A Mutual Information Perspective , 2014, PloS one.

[5]  Carlo Vittorio Cannistraci,et al.  Minimum curvilinearity to enhance topological prediction of protein interactions by network embedding , 2013, Bioinform..

[6]  Reynold Cheng,et al.  Discovering Meta-Paths in Large Heterogeneous Information Networks , 2015, WWW.

[7]  Jiawei Han,et al.  Citation Prediction in Heterogeneous Bibliographic Networks , 2012, SDM.

[8]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[9]  Yongxiang Xia,et al.  Link Prediction in Weighted Networks: A Weighted Mutual Information Model , 2016, PloS one.

[10]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .

[11]  Ni Lao,et al.  Relational retrieval using a combination of path-constrained random walks , 2010, Machine Learning.

[12]  Claude E. Shannon,et al.  The mathematical theory of communication , 1950 .

[13]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[14]  Nitesh V. Chawla,et al.  Evaluating link prediction methods , 2014, Knowledge and Information Systems.

[15]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[16]  Yongtang Shi,et al.  Entropy of Weighted Graphs with Randi'c Weights , 2015, Entropy.

[17]  Yongxiang Xia,et al.  An information-theoretic model for link prediction in complex networks , 2015, Scientific Reports.

[18]  Timothy Ravasi,et al.  From link-prediction in brain connectomes and protein interactomes to the local-community-paradigm in complex networks , 2013, Scientific Reports.

[19]  L. Kavraki,et al.  Tracing conformational changes in proteins , 2009, 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop.

[20]  Philip S. Yu,et al.  HeteSim: A General Framework for Relevance Measure in Heterogeneous Networks , 2013, IEEE Transactions on Knowledge and Data Engineering.

[21]  N. Eagle,et al.  Network Diversity and Economic Development , 2010, Science.

[22]  Jure Leskovec,et al.  The Network Completion Problem: Inferring Missing Nodes and Edges in Networks , 2011, SDM.

[23]  Michael Ley,et al.  DBLP - Some Lessons Learned , 2009, Proc. VLDB Endow..

[24]  Tao Zhou,et al.  Predicting missing links and identifying spurious links via likelihood analysis , 2016, Scientific Reports.

[25]  Philip S. Yu,et al.  Meta path-based collective classification in heterogeneous information networks , 2012, CIKM.

[26]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[27]  Charu C. Aggarwal,et al.  Co-author Relationship Prediction in Heterogeneous Bibliographic Networks , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[28]  Hairong Qi,et al.  Friendbook: A Semantic-Based Friend Recommendation System for Social Networks , 2015, IEEE Transactions on Mobile Computing.

[29]  Simone Daminelli,et al.  Common neighbours and the local-community-paradigm for topological link prediction in bipartite networks , 2015, ArXiv.

[30]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[31]  Matthias Dehmer,et al.  Information Indices with High Discriminative Power for Graphs , 2012, PloS one.

[32]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[33]  Tim Weninger,et al.  Mining Interesting Meta-Paths from Complex Heterogeneous Information Networks , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[34]  Philip S. Yu,et al.  Meta-path based multi-network collective link prediction , 2014, KDD.