Relational Learning with Social Status Analysis

Relational learning has been proposed to cope with the interdependency among linked instances in social network analysis, which often adopts network connectivity and social media content for prediction. A common assumption in existing relational learning methods is that data instances are equally important. The algorithms developed based on the assumption may be significantly affected by outlier data and thus less robust. In the meantime, it has been well established in social sciences that actors are naturally of different social status in a social network. Motivated by findings from social sciences, in this paper, we investigate whether social status analysis could facilitate relational learning. Particularly, we propose a novel framework RESA to model social status using the network structure. It extracts robust and intrinsic latent social dimensions for social actors, which are further exploited as features for supervised learning models. The proposed method is applicable for real-world relational learning problems where noise exists. Extensive experiments are conducted on datasets obtained from real-world social media platforms. Empirical results demonstrate the effectiveness of RESA and further experiments are conducted to help understand the effects of parameter settings to the proposed model and how local social status works.

[1]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[2]  Foster J. Provost,et al.  Classification in Networked Data: a Toolkit and a Univariate Case Study , 2007, J. Mach. Learn. Res..

[3]  Jonathan Currie,et al.  Opti: Lowering the Barrier Between Open Source Optimizers and the Industrial MATLAB User , 2012 .

[4]  Lise Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[5]  Reza Zafarani,et al.  Sarcasm Detection on Twitter: A Behavioral Modeling Approach , 2015, WSDM.

[6]  Thomas S. Huang,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[7]  Xuan Li,et al.  Robust Nonnegative Matrix Factorization via Half-Quadratic Minimization , 2012, 2012 IEEE 12th International Conference on Data Mining.

[8]  Krishna P. Gummadi,et al.  Understanding and combating link farming in the twitter social network , 2012, WWW.

[9]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[10]  Albert-László Barabási,et al.  Scale-Free Networks: A Decade and Beyond , 2009, Science.

[11]  Ben Taskar,et al.  Introduction to Statistical Relational Learning (Adaptive Computation and Machine Learning) , 2007 .

[12]  Xia Wang,et al.  Who Will Follow Your Shop? Exploiting Multiple Information Sources in Finding Followers , 2013, DASFAA.

[13]  Reza Zafarani,et al.  Social Media Mining: An Introduction , 2014 .

[14]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[15]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[16]  P. Bühlmann,et al.  The group lasso for logistic regression , 2008 .

[17]  M. Hogg,et al.  Social Identity and Self-Categorization Processes in Organizational Contexts , 2000 .

[18]  Guandong Xu,et al.  Social Media Mining and Social Network Analysis: Emerging Research , 2013 .

[19]  Liang Du,et al.  Heterogeneous Metric Learning with Content-Based Regularization for Software Artifact Retrieval , 2014, 2014 IEEE International Conference on Data Mining.

[20]  Feiping Nie,et al.  Early Active Learning via Robust Representation and Structured Sparsity , 2013, IJCAI.

[21]  Weifeng Liu,et al.  Correntropy: Properties and Applications in Non-Gaussian Signal Processing , 2007, IEEE Transactions on Signal Processing.

[22]  Xiaojun Wu,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[24]  Asuman E. Ozdaglar,et al.  Spread of (Mis)Information in Social Networks , 2009, Games Econ. Behav..

[25]  Jennifer Neville,et al.  An analysis of how ensembles of collective classifiers improve predictions in graphs , 2012, CIKM '12.

[26]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Jennifer Neville,et al.  Why collective inference improves relational classification , 2004, KDD.

[28]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[29]  Foster Provost,et al.  A Simple Relational Classifier , 2003 .

[30]  Huan Liu,et al.  Discovering Overlapping Groups in Social Media , 2010, 2010 IEEE International Conference on Data Mining.

[31]  Ben Taskar,et al.  Introduction to statistical relational learning , 2007 .

[32]  Huan Liu,et al.  ActNeT: Active Learning for Networked Texts in Microblogging , 2013, SDM.

[33]  Liang Du,et al.  Unsupervised Feature Selection with Adaptive Structure Learning , 2015, KDD.

[34]  Volker Tresp,et al.  Nonparametric Relational Learning for Social Network Analysis , 2008 .

[35]  Huan Liu,et al.  Unsupervised feature selection for linked social media data , 2012, KDD.

[36]  Feiping Nie,et al.  Exclusive Feature Learning on Arbitrary Structures via \ell_{1, 2}-norm , 2014, NIPS.

[37]  Nan Lin,et al.  SOCIAL NETWORKS AND STATUS ATTAINMENT , 1999 .

[38]  Jieping Ye,et al.  Moreau-Yosida Regularization for Grouped Tree Structure Learning , 2010, NIPS.

[39]  R. Tibshirani,et al.  Sparsity and smoothness via the fused lasso , 2005 .

[40]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.