On fast and scalable recurring link’s prediction in evolving multi-graph streams

Abstract The link prediction task has found numerous applications in real-world scenarios. However, in most of the cases like interactions, purchases, mobility, etc., links can re-occur again and again across time. As a result, the data being generated is excessively large to handle, associated with the complexity and sparsity of networks. Therefore, we propose a very fast, memory-less, and dynamic sampling-based method for predicting recurring links for a successive future point in time. This method works by biasing the links exponentially based on their time of occurrence, frequency, and stability. To evaluate the efficiency of our method, we carried out rigorous experiments with massive real-world graph streams. Our empirical results show that the proposed method outperforms the state-of-the-art method for recurring links prediction. Additionally, we also empirically analyzed the evolution of links with the perspective of multi-graph topology and their recurrence probability over time.

[1]  João Gama,et al.  Biased Dynamic Sampling for Temporal Network Streams , 2018, COMPLEX NETWORKS.

[2]  Nazar Zaki,et al.  Link Prediction in Dynamic Social Networks: A Literature Review , 2018, 2018 IEEE 5th International Congress on Information Science and Technology (CiSt).

[3]  Luís Torgo,et al.  Evaluation Procedures for Forecasting with Spatio-Temporal Data , 2018, ECML/PKDD.

[4]  João Gama,et al.  Processing Evolving Social Networks for Change Detection Based on Centrality Measures , 2018, Studies in Big Data.

[5]  Mohammad Reza Meybodi,et al.  Link prediction in weighted social networks using learning automata , 2018, Eng. Appl. Artif. Intell..

[6]  Yongsub Lim,et al.  Memory-Efficient and Accurate Sampling for Counting Local Triangles in Graph Streams , 2018, ACM Trans. Knowl. Discov. Data.

[7]  Mykola Pechenizkiy,et al.  Clustering-Structure Representative Sampling from Graph Streams , 2017, COMPLEX NETWORKS.

[8]  Myra Spiliopoulou,et al.  Forgetting techniques for stream-based matrix factorization in recommender systems , 2017, Knowledge and Information Systems.

[9]  Ryan A. Rossi,et al.  On Sampling from Massive Graph Streams , 2017, Proc. VLDB Endow..

[10]  Junming Yin,et al.  Scalable Temporal Latent Space Inference for Link Prediction in Dynamic Social Networks , 2017, IEEE Transactions on Knowledge and Data Engineering.

[11]  Charu C. Aggarwal,et al.  Link prediction in graph streams , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[12]  João Gama,et al.  Sampling massive streaming call graphs , 2016, SAC.

[13]  Chengqi Zhang,et al.  Graph Ensemble Boosting for Imbalanced Noisy Graph Stream Classification , 2015, IEEE Transactions on Cybernetics.

[14]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[15]  Hung-Hsuan Chen,et al.  The predictive value of young and old links in a social network , 2013, DBSocial '13.

[16]  João Gama,et al.  On evaluating stream learning algorithms , 2012, Machine Learning.

[17]  Alexis Papadimitriou,et al.  Fast and accurate link prediction in social networking systems , 2012, J. Syst. Softw..

[18]  Purnamrita Sarkar,et al.  Nonparametric Link Prediction in Dynamic Networks , 2012, ICML.

[19]  Giulio Rossetti,et al.  Scalable Link Prediction on Multidimensional Networks , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[20]  Dino Pedreschi,et al.  Human mobility, social ties, and link prediction , 2011, KDD.

[21]  Mohammad Al Hasan,et al.  A Survey of Link Prediction in Social Networks , 2011, Social Network Data Analytics.

[22]  Hisashi Kashima,et al.  Fast and Scalable Algorithms for Semi-supervised Link Prediction on Static and Dynamic Graphs , 2010, ECML/PKDD.

[23]  Céline Rouveirol,et al.  Supervised Machine Learning Applied to Link Prediction in Bipartite Social Networks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[24]  Yin Zhang,et al.  Scalable proximity estimation and link prediction in online social networks , 2009, IMC '09.

[25]  Srikanta J. Bedathur,et al.  Towards time-aware link prediction in evolving social networks , 2009, SNA-KDD '09.

[26]  Srinivasan Parthasarathy,et al.  Local Probabilistic Models for Link Prediction , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[27]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[28]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .

[29]  Padhraic Smyth,et al.  Prediction and ranking algorithms for event-based network data , 2005, SKDD.

[30]  Divyakant Agrawal,et al.  Efficient Computation of Frequent and Top-k Elements in Data Streams , 2005, ICDT.

[31]  A. Winsor Sampling techniques. , 2000, Nursing times.

[32]  Jeffrey Scott Vitter,et al.  Random sampling with a reservoir , 1985, TOMS.