Learning user distance from multiple social networks

In this paper, we propose an adaptive user distance measurement model to address the challenging problem of modeling user distance from multiple social networks. Previous works construct user distance model in a single social network, and dataset easily leads to over-fitting of the models due to the data sparseness of a single sparse network. We observe that people often simultaneously appear in multiple social networks, because different social networks (e.g., Facebook, LinkedIn, QQ, Douban, etc.) can provide complementary services. Thus, the knowledge from different social networks can help overcome the problem of data sparseness. However, knowledge cannot be directly obtained due to that it is from different social networks. Aiming to solve this problem, we construct an adaptive model to measure user distance from multiple social networks by employing the metric learning and the boosting technology. The basic idea of our model is to embed multiple networks into a potential feature space while retaining the topology of social networks. In the procedure of boosting, the negative effects caused by network differences and useless information can be avoided. To get the solution of our model, we formulate it as a convex optimization problem. Besides, we propose an Adaptive User Distance Measurement (AUDM) algorithm whose time complexity is linear to the number of the links. Finally, we verify the feasibility and effectiveness of AUDM on the problem of link prediction. Experiments on a real large-scale dataset show that AUDM outperforms the state-of-the-art algorithm.

[1]  Anton van den Hengel,et al.  Pedestrian Detection with Spatially Pooled Features and Structured Ensemble Learning , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[3]  Massimiliano Pontil,et al.  Sparse coding for multitask and transfer learning , 2012, ICML.

[4]  Dechang Pi,et al.  Ensemble kernel method: SVM classification based on game theory , 2016 .

[5]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[6]  Ryosuke Saga,et al.  Text Comparison Visualization from Two Information Sources by Network Merging and Integration , 2014, KICSS.

[7]  Aníbal R. Figueiras-Vidal,et al.  Laplace Approximation for Divisive Gaussian Processes for Nonstationary Regression , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Wei Gao,et al.  Detecting Rumors from Microblogs with Recurrent Neural Networks , 2016, IJCAI.

[9]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[10]  Chengqi Zhang,et al.  Self-adaptive attribute weighting for Naive Bayes classification , 2015, Expert Syst. Appl..

[11]  Noriyoshi Sukegawa,et al.  Maximizing Barber’s bipartite modularity is also hard , 2013, Optimization Letters.

[12]  Qiong Cao,et al.  Generalization bounds for metric and similarity learning , 2012, Machine Learning.

[13]  A. Enders,et al.  The long tail of social networking.: Revenue models of social networking sites , 2008 .

[14]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[15]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[16]  Arif Mahmood,et al.  Constrained Metric Learning by Permutation Inducing Isometries , 2016, IEEE Transactions on Image Processing.

[17]  Jieping Ye,et al.  Rank-One Matrix Pursuit for Matrix Completion , 2014, ICML.

[18]  Stephen Tyree,et al.  Non-linear Metric Learning , 2012, NIPS.

[19]  Gunnar Rätsch,et al.  Soft Margins for AdaBoost , 2001, Machine Learning.

[20]  Rongrong Ji,et al.  Low-Rank Similarity Metric Learning in High Dimensions , 2015, AAAI.

[21]  Srinivasan Parthasarathy,et al.  Dynamic inference of social roles in information cascades , 2015, Data Mining and Knowledge Discovery.

[22]  Jiwen Lu,et al.  Nonlinear metric learning for visual tracking , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).

[23]  Xiao Yu,et al.  Discovering Structure in the Universe of Attribute Names , 2016, WWW.

[24]  Bert Huang,et al.  Learning a Distance Metric from a Network , 2011, NIPS.

[25]  Chengqi Zhang,et al.  Multi-graph-view Learning for Graph Classification , 2014, 2014 IEEE International Conference on Data Mining.

[26]  Nitish Srivastava,et al.  Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[27]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[28]  Nikos Paragios,et al.  Data fusion through cross-modality metric learning using similarity-sensitive hashing , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[29]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[30]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[31]  Min Tan,et al.  Robust object recognition via weakly supervised metric and template learning , 2016, Neurocomputing.

[32]  Xiaojin Zhu,et al.  Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners , 2015, AAAI.

[33]  William Nick Street,et al.  Ensemble Pruning Via Semi-definite Programming , 2006, J. Mach. Learn. Res..

[34]  Brian Kulis,et al.  Metric Learning: A Survey , 2013, Found. Trends Mach. Learn..

[35]  Jia Wu,et al.  Artificial immune system for attribute weighted Naive Bayes classification , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[36]  Elad Hazan,et al.  Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[37]  Kiyoung Choi,et al.  A design framework for hierarchical ensemble of multiple feature extractors and multiple classifiers , 2016, Pattern Recognit..

[38]  Nagehan Ilhan,et al.  Feature identification for predicting community evolution in dynamic social networks , 2016, Eng. Appl. Artif. Intell..

[39]  Kar-Ann Toh,et al.  Benchmarking a reduced multivariate polynomial pattern classifier , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.