Toward Relational Learning with Misinformation

Relational learning has been proposed to cope with the interdependency among linked instances in a network, and it is a fundamental tool to categorize social network users for various tasks. However, the emerging widespread of misinformation in social networks, information that is inaccurate or false, poses novel challenges to utilizing social media data. Malicious users may actively manipulate their content and characteristics, which easily lead to a noisy dataset. Hence, it is intricate for traditional relational learning approaches to deliver an accurate predictive model in the presence of misinformation. In this work, we precisely focus on the problem by proposing a joint framework that simultaneously constructs a relational learning model and mitigates the effect of misinformation by restraining anomalous points. Empirical results on real-world social media data prove the superiority of the proposed approach, Relational Learning with Misinformation (RLM), over traditional approaches on modeling social network users.

[1]  M. Hogg Social Identity Theory , 2016 .

[2]  Huan Liu,et al.  Toward Personalized Relational Learning , 2017, SDM.

[3]  Xuan Li,et al.  Robust Nonnegative Matrix Factorization via Half-Quadratic Minimization , 2012, 2012 IEEE 12th International Conference on Data Mining.

[4]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[5]  Claude Lemaréchal,et al.  Practical Aspects of the Moreau-Yosida Regularization: Theoretical Preliminaries , 1997, SIAM J. Optim..

[6]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[7]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[8]  Hyun Ah Song,et al.  FRAUDAR: Bounding Graph Fraud in the Face of Camouflage , 2016, KDD.

[9]  C. O’Brien Statistical Learning with Sparsity: The Lasso and Generalizations , 2016 .

[10]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[11]  Huan Liu,et al.  Relational Learning with Social Status Analysis , 2016, WSDM.

[12]  Jennifer Neville,et al.  Why collective inference improves relational classification , 2004, KDD.

[13]  P. Bühlmann,et al.  The group lasso for logistic regression , 2008 .

[14]  Huan Liu,et al.  Detecting Camouflaged Content Polluters , 2017, ICWSM.

[15]  Jennifer Neville,et al.  An analysis of how ensembles of collective classifiers improve predictions in graphs , 2012, CIKM '12.

[16]  Deli Zhao,et al.  Network Representation Learning with Rich Text Information , 2015, IJCAI.

[17]  David Carmel,et al.  Social media recommendation based on people and tags , 2010, SIGIR.

[18]  Catherine Tucker,et al.  Social Networks, Personalized Advertising, and Privacy Controls , 2013 .

[19]  Huan Liu,et al.  Adaptive Spammer Detection with Sparse Group Modeling , 2017, ICWSM.

[20]  Alexander J. Smola,et al.  Parallelized Stochastic Gradient Descent , 2010, NIPS.

[21]  Xiaojun Wu,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Jieping Ye,et al.  Moreau-Yosida Regularization for Grouped Tree Structure Learning , 2010, NIPS.

[23]  Volker Tresp,et al.  Nonparametric Relational Learning for Social Network Analysis , 2008 .

[24]  Huan Liu,et al.  Attributed Network Embedding for Learning in a Dynamic Environment , 2017, CIKM.

[25]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[26]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[27]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.