Context Specificity Matters: Profile Attributes Prediction for Social Network Users

Analysis of social network user profiles plays an important role in various applications such as recommender systems and marketing services. In this paper we consider the task of profile prediction given a sample of a social graph. We introduce a new node property named context specificity, which allows to enhance node attribute prediction methods by emphasizing more useful nodes and edges. Particularly, we show how to apply it to several existing methods: label propagation, classification based on node embeddings, graph convolutional neural network. Our experiments prove that context specificity feature improves methods quality in most considered tasks. Additionally, we suggest a new supervised method based on attribute values distributions that is competitive with state of the art approaches. We also describe and share two new datasets, representing a sample of VK social network.

[1]  Sunghwan Mac Kim,et al.  Demographic Inference on Twitter using Recursive Neural Networks , 2017, ACL.

[2]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[3]  Steven Skiena,et al.  Exact Age Prediction in Social Networks , 2015, WWW.

[4]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Alex Smola,et al.  Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs , 2019, ArXiv.

[6]  A. Gomzin,et al.  A method of automatically estimating user age using social connections , 2016 .

[7]  Oleg U. Ivanov,et al.  Learning Representations in Directed Networks , 2015, AIST.

[8]  L. Takac DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS , 2012 .

[9]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[11]  Krishna P. Gummadi,et al.  You are who you know: inferring user profiles in online social networks , 2010, WSDM '10.

[12]  Nikolaos Aletras,et al.  Predicting Twitter User Socioeconomic Attributes with Network and Language Information , 2018, HT.

[13]  Y. S. Trofimovich,et al.  Approaches to estimate location of social network users based on social graph , 2016 .

[14]  Jure Leskovec,et al.  Representation Learning on Graphs: Methods and Applications , 2017, IEEE Data Eng. Bull..

[15]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[16]  Palash Goyal,et al.  Graph Embedding Techniques, Applications, and Performance: A Survey , 2017, Knowl. Based Syst..

[17]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[18]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Nikolaos Aletras,et al.  An analysis of the user occupational class through Twitter content , 2015, ACL.

[20]  Christos Faloutsos,et al.  Sampling from large graphs , 2006, KDD '06.

[21]  Zoubin Ghahramani,et al.  Learning from labeled and unlabeled data with label propagation , 2002 .

[22]  Katja Filippova,et al.  User Demographics and Language in an Implicit Social Network , 2012, EMNLP.