论文信息 - User behavior learning and transfer in composite social networks

User behavior learning and transfer in composite social networks

Accurate prediction of user behaviors is important for many social media applications, including social marketing, personalization, and recommendation. A major challenge lies in that although many previous works model user behavior from only historical behavior logs, the available user behavior data or interactions between users and items in a given social network are usually very limited and sparse (e.g., ⩾ 99.9% empty), which makes models overfit the rare observations and fail to provide accurate predictions. We observe that many people are members of several social networks in the same time, such as Facebook, Twitter, and Tencent’s QQ. Importantly, users’ behaviors and interests in different networks influence one another. This provides an opportunity to leverage the knowledge of user behaviors in different networks by considering the overlapping users in different networks as bridges, in order to alleviate the data sparsity problem, and enhance the predictive performance of user behavior modeling. Combining different networks “simply and naively” does not work well. In this article, we formulate the problem to model multiple networks as “adaptive composite transfer” and propose a framework called ComSoc. ComSoc first selects the most suitable networks inside a composite social network via a hierarchical Bayesian model, parameterized for individual users. It then builds topic models for user behavior prediction using both the relationships in the selected networks and related behavior data. With different relational regularization, we introduce different implementations, corresponding to different ways to transfer knowledge from composite social relations. To handle big data, we have implemented the algorithm using Map/Reduce. We demonstrate that the proposed composite network-based user behavior models significantly improve the predictive accuracy over a number of existing approaches on several real-world applications, including a very large social networking dataset from Tencent Inc.

[1] J. Lafferty,et al. Mixed-membership models of scientific publications , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[2] George Eastman House,et al. Sparse Bayesian Learning and the Relevan e Ve tor Ma hine , 2001 .

[3] Jiawei Han,et al. Ranking-based classification of heterogeneous information networks , 2011, KDD.

[4] Duncan J. Watts,et al. Six Degrees: The Science of a Connected Age , 2003 .

[5] Chong Wang,et al. Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[6] Krishna P. Gummadi,et al. Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[7] Edoardo M. Airoldi,et al. Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[8] Thomas Hofmann,et al. Latent semantic models for collaborative filtering , 2004, TOIS.

[9] Ramesh Nallapati,et al. Joint latent topic models for text and citations , 2008, KDD.

[10] Bo Zhao,et al. Probabilistic topic models with biased propagation on heterogeneous information networks , 2011, KDD.

[11] Geoffrey E. Hinton,et al. Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[12] Alex Pentland,et al. Composite Social Network for Predicting Mobile Apps Installation , 2011, AAAI.

[13] David M. Blei,et al. Relational Topic Models for Document Networks , 2009, AISTATS.