Bi-directional Joint Inference for User Links and Attributes on Large Social Graphs

Users on social networks primarily do two things, connect to existing or new friends and exchange information. Recently, social media apps have become the primary information source for users to consume news stories. Users are inundated with a flood of information from social media networks shared by their friends and other content providers. For social media networks, it is important for us to study users' friendships and interests in order to best serve their need. However, given the nature of online applications, information on both sides is highly incomplete and noisy. Inspired by the well-known homophily phenomenon which found the two properties strongly interleaving, we propose to jointly learn them by leveraging their data redundancy and mutual reinforcement. Specifically, we exploit homophily by iteratively addressing smoothness on the graph in two directions, ie, from closeness to similarity (stronger links lead to more similar attributes), and vice versa. The two processes are done in a unified probabilistic framework through label propagation and graph construction. The refined user links and attributes are immediately useful for various tasks including link recommendation and content targeting on social networks.

[1]  Paulo Shakarian,et al.  Early Identification of Violent Criminal Gang Members , 2015, KDD.

[2]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[3]  W. Reed The Pareto, Zipf and other power laws , 2001 .

[4]  Chun Chen,et al.  Efficient manifold ranking for image retrieval , 2011, SIGIR.

[5]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[6]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[7]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[8]  Jiawei Han,et al.  A Unified Framework for Link Recommendation Using Random Walks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[9]  Srinivasan Parthasarathy,et al.  Efficient community detection in large networks using content and links , 2012, WWW.

[10]  Shenghuo Zhu,et al.  Learning multiple graphs for document recommendations , 2008, WWW.

[11]  Deepayan Chakrabarti,et al.  Joint Inference of Multiple Label Types in Large Networks , 2014, ICML.

[12]  Wei Liu,et al.  Large Graph Construction for Scalable Semi-Supervised Learning , 2010, ICML.

[13]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[14]  Foster Provost,et al.  A Simple Relational Classifier , 2003 .

[15]  Jieping Ye,et al.  Geodesic Distance Function Learning via Heat Flow on Vector Fields , 2014, ICML.

[16]  Anand Rajaraman,et al.  Mining of Massive Datasets , 2011 .

[17]  Bernhard Schölkopf,et al.  Uncovering the Temporal Dynamics of Diffusion Networks , 2011, ICML.

[18]  Nitesh V. Chawla,et al.  New perspectives and methods in link prediction , 2010, KDD.

[19]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[20]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[21]  Qiaozhu Mei,et al.  PTE: Predictive Text Embedding through Large-scale Heterogeneous Text Networks , 2015, KDD.

[22]  Zoubin Ghahramani,et al.  Learning from labeled and unlabeled data with label propagation , 2002 .

[23]  Bernhard Schölkopf,et al.  Ranking on Data Manifolds , 2003, NIPS.

[24]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.

[25]  Kevin Chen-Chuan Chang,et al.  Graph-based Semi-supervised Learning: Realizing Pointwise Smoothness Probabilistically , 2014, ICML.

[26]  Kevin Chen-Chuan Chang,et al.  User profiling in an ego network: co-profiling attributes and relationships , 2014, WWW.

[27]  Ling Huang,et al.  Joint Link Prediction and Attribute Inference Using a Social-Attribute Network , 2014, TIST.

[28]  Shih-Fu Chang,et al.  Graph construction and b-matching for semi-supervised learning , 2009, ICML '09.

[29]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[30]  Nicola Barbieri,et al.  Who to follow and why: link prediction with explanations , 2014, KDD.

[31]  Olivia R. Liu Sheng,et al.  A Survey of Link Recommendation for Social Networks , 2015, ACM Trans. Manag. Inf. Syst..

[32]  Wotao Yin,et al.  A Block Coordinate Descent Method for Regularized Multiconvex Optimization with Applications to Nonnegative Tensor Factorization and Completion , 2013, SIAM J. Imaging Sci..

[33]  Alexander J. Smola,et al.  Like like alike: joint friendship and interest propagation in social networks , 2011, WWW.

[34]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[35]  References , 1971 .

[36]  Geoffrey E. Hinton,et al.  Exponential Family Harmoniums with an Application to Information Retrieval , 2004, NIPS.