Infer User Interests via Link Structure Regularization

Learning user interests from online social networks helps to better understand user behaviors and provides useful guidance to design user-centric applications. Apart from analyzing users' online content, it is also important to consider users' social connections in the social Web. Graph regularization methods have been widely used in various text mining tasks, which can leverage the graph structure information extracted from data. Previously, graph regularization methods operate under the cluster assumption that nearby nodes are more similar and nodes on the same structure (typically referred to as a cluster or a manifold) are likely to be similar. We argue that learning user interests from complex, sparse, and dynamic social networks should be based on the link structure assumption under which node similarities are evaluated based on the local link structures instead of explicit links between two nodes. We propose a regularization framework based on the relation bipartite graph, which can be constructed from any type of relations. Using Twitter as our case study, we evaluate our proposed framework from social networks built from retweet relations. Both quantitative and qualitative experiments show that our proposed method outperforms a few competitive baselines in learning user interests over a set of predefined topics. It also gives superior results compared to the baselines on retweet prediction and topical authority identification.

[1]  Raleigh North Haewoon, Kwak, Changhyun, Lee, Park, Hosung, and Moon, Sue. . What is Twitter, a Social Network or a News Media?. 19th International World Wide Web (WWW) Conference.April. , 2010 .

[2]  Rui Li,et al.  Exploring social tagging graph for web object classification , 2009, KDD.

[3]  Hai Yang,et al.  ACM Transactions on Intelligent Systems and Technology - Special Section on Urban Computing , 2014 .

[4]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[5]  Alexander J. Smola,et al.  Discovering geographical topics in the twitter stream , 2012, WWW.

[6]  Jie Tang,et al.  Who will follow you back?: reciprocal relationship prediction , 2011, CIKM '11.

[7]  Ciro Cattuto,et al.  Dynamical classes of collective attention in twitter , 2011, WWW.

[8]  Jun Zhu,et al.  User grouping behavior in online forums , 2009, KDD.

[9]  Ravi Kumar,et al.  Influence and correlation in social networks , 2008, KDD.

[10]  Thomas Hofmann,et al.  Semi-supervised Learning on Directed Graphs , 2004, NIPS.

[11]  Michael R. Lyu,et al.  Learning to recommend with social trust ensemble , 2009, SIGIR.

[12]  Brian D. Davison,et al.  Empirical study of topic modeling in Twitter , 2010, SOMA '10.

[13]  Arnold Neumaier,et al.  Solving Ill-Conditioned and Singular Linear Systems: A Tutorial on Regularization , 1998, SIAM Rev..

[14]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[15]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[16]  Lei Yang,et al.  We know what @you #tag: does the dual role affect hashtag adoption? , 2012, WWW.

[17]  Jianyong Wang,et al.  Retweet or not?: personalized tweet re-ranking , 2013, WSDM.

[18]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[19]  HeYulan,et al.  Infer User Interests via Link Structure Regularization , 2014 .

[20]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Chun Chen,et al.  Personalized tag recommendation using graph-based ranking on multi-type interrelated objects , 2009, SIGIR.

[22]  Hongfei Yan,et al.  Comparing Twitter and Traditional Media Using Topic Models , 2011, ECIR.

[23]  Xiao Li,et al.  Learning query intent from regularized click graphs , 2008, SIGIR '08.

[24]  Brian D. Davison,et al.  Predicting popular messages in Twitter , 2011, WWW.

[25]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[26]  Junghoo Cho,et al.  Topical semantics of twitter links , 2011, WSDM '11.

[27]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[28]  Rajat Raina,et al.  Learning relevance from heterogeneous social network and its application in online targeting , 2011, SIGIR.

[29]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[30]  Bernhard Schölkopf,et al.  Learning from labeled and unlabeled data on a directed graph , 2005, ICML.

[31]  Duncan J. Watts,et al.  Who says what to whom on twitter , 2011, WWW.

[32]  Brian D. Davison,et al.  Temporal Dynamics of User Interests in Tagging Systems , 2011, AAAI.

[33]  Chao Liu,et al.  Recommender systems with social regularization , 2011, WSDM '11.

[34]  Michael R. Lyu,et al.  Learning to recommend with trust and distrust relationships , 2009, RecSys '09.

[35]  Philip S. Yu,et al.  Mining Knowledge from Data: An Information Network Analysis Approach , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[36]  Jiawei Han,et al.  Learning search tasks in queries and web pages via graph regularization , 2011, SIGIR '11.

[37]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[38]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[39]  Brian D. Davison,et al.  A probabilistic model for personalized tag prediction , 2010, KDD.

[40]  Alexander J. Smola,et al.  Scalable distributed inference of dynamic user interests for behavioral targeting , 2011, KDD.

[41]  Songqing Chen,et al.  Analyzing patterns of user content generation in online social networks , 2009, KDD.

[42]  Deng Cai,et al.  Topic modeling with network regularization , 2008, WWW.

[43]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[44]  Thomas L. Griffiths,et al.  Probabilistic author-topic models for information discovery , 2004, KDD.