Relational Learning Models for Social Network Analysis

Social networks have been studied for nearly half a century by sociologists to analyze interactions between people. Nowadays, with the advent of Web 2.0, social networks have moved from being an abstract concept to actual online applications such as Facebook, Twitter and Linkedin, which are used daily by people to create and maintain relationships with friends, co-workers and other acquaintances. However, online social networks allow their users to do more than just maintain friendships: people can generally create and share content in various forms, from simple textual messages (called posts) to photos, videos, audios and much more. Several approaches in Social Network Analysis have been proposed in the years to extract knowledge from social networks, addressing tasks that ranges from understanding how users create and modify their relationships, to finding the most influential people in a group, to understanding the ideas and opinions expressed by people in their posts. Many techniques from the field of Machine Learning have been used to address these problems. While some of them exploit the relationships among users, others focus on the content generated by the users, typically by analyzing the textual content written in the posts. These approaches, however, are generally unable to exploit both, in this way ignoring a consistent part of information available in social networks. The field of Relational Learning tries to overcome this limitation, by extending traditional approaches in order to use both sources of information, and thus achieve better performances. In this thesis, I propose new Relational Learning approaches that address two tasks in Social Network Analysis. The first task is Community Discovery, which objective is to detect groups of users that share strong connections (e.g. working in the same company, attended the same school, etc.) or sharing the same interests. While this task is generally addressed by considering only the network structure, adding the user content can allow to increase the performance. The second task is Opinion Detection, which objective is to infer the opinion of users about a specific topic (politics, likeness of a brand). This task is typically addressed using user textual content, but the relationships can provide additional insights that allow to improve the inference of users’ opinions. The experimental investigations reveal that network structure and user-generated content provide complementary information, and that using both sources of data can improve the performance of algorithms in both community discovery (a structure-based task) and opinion detection (a content-based task). iii

[1]  Elisabetta Fersini,et al.  Web Page Classification through Probabilistic Relational Models , 2013, Int. J. Pattern Recognit. Artif. Intell..

[2]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[3]  Ulrik Brandes,et al.  On Finding Graph Clusterings with Maximum Modularity , 2007, WG.

[4]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[5]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[6]  Adrian E. Raftery,et al.  Bayesian Model Averaging: A Tutorial , 2016 .

[7]  Srinivasan Parthasarathy,et al.  Efficient community detection in large networks using content and links , 2012, WWW.

[8]  Ben Taskar,et al.  Introduction to Statistical Relational Learning (Adaptive Computation and Machine Learning) , 2007 .

[9]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[10]  Nigel Collier,et al.  Sentiment Analysis using Support Vector Machines with Diverse Information Sources , 2004, EMNLP.

[11]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  V Latora,et al.  Efficient behavior of small-world networks. , 2001, Physical review letters.

[13]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[14]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[15]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[16]  Ellen Spertus,et al.  Smokey: Automatic Recognition of Hostile Messages , 1997, AAAI/IAAI.

[17]  Simon Price,et al.  Inductive Logic Programming , 2000, Lecture Notes in Computer Science.

[18]  Steven Skiena,et al.  Trading Strategies to Exploit Blog and News Sentiment , 2010, ICWSM.

[19]  Philip S. Yu,et al.  Community detection in incomplete information networks , 2012, WWW.

[20]  Anupam Joshi,et al.  Identifying and characterizing user communities on Twitter during crisis events , 2012, DUBMMSM '12.

[21]  Ben Taskar,et al.  Probabilistic Classification and Clustering in Relational Data , 2001, IJCAI.

[22]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[23]  Martin Ester,et al.  Opinion digger: an unsupervised opinion miner from unstructured product reviews , 2010, CIKM.

[24]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[25]  J. A. Barnes Class and Committees in a Norwegian Island Parish , 1954 .

[26]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[27]  Kristina Lerman,et al.  The Role of Social Media in the Discussion of Controversial Topics , 2013, 2013 International Conference on Social Computing.

[28]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[29]  Piotr Indyk,et al.  Enhanced hypertext categorization using hyperlinks , 1998, SIGMOD '98.

[30]  Long Jiang,et al.  User-level sentiment analysis incorporating social networks , 2011, KDD.

[31]  Manfred Jaeger,et al.  Relational Bayesian Networks , 1997, UAI.

[32]  Stanford,et al.  Learning to Discover Social Circles in Ego Networks , 2012 .

[33]  Jie Tang,et al.  Detecting Community Kernels in Large Social Networks , 2011, 2011 IEEE 11th International Conference on Data Mining.

[34]  Jonathon Read,et al.  Using Emoticons to Reduce Dependency in Machine Learning Techniques for Sentiment Classification , 2005, ACL.

[35]  Kristina Lerman,et al.  Tripartite graph clustering for dynamic sentiment analysis on social media , 2014, SIGMOD Conference.

[36]  Jun'ichi Tatemura Virtual reviewers for collaborative exploration of movie reviews , 2000, IUI '00.

[37]  Fang Wu,et al.  Finding communities in linear time: a physics approach , 2003, ArXiv.

[38]  Santosh S. Vempala,et al.  On clusterings-good, bad and spectral , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[39]  Wojciech Gryc,et al.  Leveraging Textual Sentiment Analysis with Social Network Modelling , 2014 .

[40]  Hong Chen,et al.  Detecting network communities using regularized spectral clustering algorithm , 2014, Artificial Intelligence Review.

[41]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[42]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[43]  Elisabetta Fersini,et al.  Bayesian Model Averaging and Model Selection for Polarity Classification , 2013, NLDB.

[44]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[45]  Bin Wu,et al.  Community detection in large-scale social networks , 2007, WebKDD/SNA-KDD '07.

[46]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  Charu C. Aggarwal,et al.  Community Detection with Edge Content in Social Media Networks , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[48]  Maurice Bruynooghe,et al.  Logical Bayesian Networks and Their Relation to Other Probabilistic Logical Models , 2005, BNAIC.

[49]  Jennifer Neville,et al.  Why collective inference improves relational classification , 2004, KDD.

[50]  Charu C. Aggarwal,et al.  Social Network Data Analytics , 2011 .

[51]  Mike Y. Chen,et al.  Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web , 2001 .

[52]  Bernardo A. Huberman,et al.  E-Mail as Spectroscopy: Automated Discovery of Community Structure within Organizations , 2005, Inf. Soc..

[53]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[54]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[55]  Bo Hu,et al.  User Features and Social Networks for Topic Modeling in Online Social Media , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[56]  Nagarajan Natarajan,et al.  Community detection in content-sharing social networks , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[57]  David S. Johnson,et al.  Some Simplified NP-Complete Graph Problems , 1976, Theor. Comput. Sci..

[58]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[59]  Eeti Jain,et al.  Using Mahout for clustering similar Twitter users: Performance evaluation of k-means and its comparison with fuzzy k-means , 2014, 2014 International Conference on Computer and Communication Technology (ICCCT).

[60]  Jennifer Neville,et al.  Linkage and Autocorrelation Cause Feature Selection Bias in Relational Learning , 2002, ICML.

[61]  Claudio Castellano,et al.  Community Structure in Graphs , 2007, Encyclopedia of Complexity and Systems Science.

[62]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[63]  Lawrence B. Holder,et al.  Mining Graph Data , 2006 .

[64]  Steven Skiena,et al.  The Wisdom of Bookies? Sentiment Analysis Versus. the NFL Point Spread , 2010, ICWSM.

[65]  Jure Leskovec,et al.  Planetary-scale views on a large instant-messaging network , 2008, WWW.

[66]  Luc De Raedt,et al.  Towards digesting the alphabet-soup of statistical relational learning , 2008 .

[67]  Daniel Kifer,et al.  What Is an Opinion About? Exploring Political Standpoints Using Opinion Scoring Model , 2010, AAAI.

[68]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[69]  Shlomo Argamon,et al.  Using appraisal groups for sentiment analysis , 2005, CIKM '05.

[70]  Ludovic Denoyer,et al.  Learning latent representations of nodes for classifying in heterogeneous social networks , 2014, WSDM.

[71]  Yang Zhang,et al.  Community Discovery in Twitter Based on User Interests , 2012 .

[72]  Judea Pearl,et al.  Probabilistic reasoning using graphs , 1987, International Conference on Information Processing and Management of Uncertainty.

[73]  Noah A. Smith,et al.  What's Worthy of Comment? Content and Comment Volume in Political Blogs , 2010, ICWSM.

[74]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[75]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[76]  Lise Getoor,et al.  Active Surveying: A Probabilistic Approach for Identifying Key Opinion Leaders , 2011, IJCAI.

[77]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[78]  Michael L. Wick,et al.  SampleRank : Learning Preferences from Atomic Gradients , 2009 .

[79]  Yihong Gong,et al.  Combining content and link for classification using matrix factorization , 2007, SIGIR.

[80]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[81]  Yun Chi,et al.  Combining link and content for community detection: a discriminative approach , 2009, KDD.

[82]  T. Huston,et al.  Interpersonal attraction and relationships. , 1978, Annual review of psychology.

[83]  Marie desJardins,et al.  Generative Models for Clustering: The Next Generation , 2008, AAAI Spring Symposium: Social Information Processing.

[84]  G. Caldarelli,et al.  Detecting communities in large networks , 2004, cond-mat/0402499.

[85]  Kathleen M. Carley A Theory of Group Stability , 1991 .

[86]  Kathleen M. Carley,et al.  On the Coevolution of Stereotype, Culture, and Social Relationships , 2014 .

[87]  Saso Dzeroski,et al.  Multi-relational data mining: an introduction , 2003, SKDD.

[88]  Kristian Kersting,et al.  An inductive logic programming approach to statistical relational learning , 2006, AI Commun..

[89]  Heng Ji,et al.  Exploring and inferring user–user pseudo‐friendship for sentiment analysis with heterogeneous networks , 2014, Stat. Anal. Data Min..

[90]  Thomas G. Dietterich The Handbook of Brain Theory and Neural Networks , 2002 .

[91]  Jason Baldridge,et al.  Twitter Polarity Classification with Label Propagation over Lexical Links and the Follower Graph , 2011, ULNLP@EMNLP.

[92]  Loren Terveen,et al.  PHOAKS: a system for sharing recommendations , 1997, CACM.

[93]  Noah A. Smith,et al.  Movie Reviews and Revenues: An Experiment in Text Regression , 2010, NAACL.

[94]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[95]  Ronen Feldman,et al.  Identifying and Following Expert Investors in Stock Microblogs , 2011, EMNLP.

[96]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[97]  Huan Liu,et al.  Exploiting social relations for sentiment analysis in microblogging , 2013, WSDM.

[98]  Takashi Washio,et al.  State of the art of graph-based data mining , 2003, SKDD.

[99]  Irina Rish,et al.  An empirical study of the naive Bayes classifier , 2001 .

[100]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[101]  P. Erdos,et al.  Some further statistical properties of the digits in Cantor's series , 1959 .

[102]  Ben Taskar,et al.  Probabilistic Entity-Relationship Models, PRMs, and Plate Models , 2007 .

[103]  Francesco Archetti,et al.  A probabilistic relational approach for web document clustering , 2010, Inf. Process. Manag..

[104]  H. Faili,et al.  A non-parametric LDA-based induction method for sentiment analysis , 2012, The 16th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP 2012).

[105]  Bart De Moor,et al.  Clustering by heterogeneous data fusion : framework and applications , 2008, ICONIP 2008.

[106]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[107]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[108]  S. Džeroski,et al.  Relational Data Mining , 2001, Springer Berlin Heidelberg.

[109]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.