Extracting Social Dimensions Using Fiedler Embedding

In this paper, we present and evaluate the use of a Fiedler embedding representation for multi-label classification of social media. Networked data, such as data from social media, contains instances of multiple types that are related through different types of links. The network structure causes these data instances to no longer remain independently identically distributed (i.i.d.). Relational learning succeeds in improving the classification performance by leveraging the correlation of the labels between linked instances. However, instances in a network can be linked for different causal reasons, hence treating all links in a homogeneous way limits the performance of relational classifiers on such datasets. Social-dimension based approaches address this problem by extracting a feature space which captures the pattern of prominent interactions in the network. In this paper, we propose an alternate low-dimensional social feature representation that can be extracted from edge-based social dimensions using Fiedler embedding. This embedded feature space encodes the relations between people and their connections (nodes and links). Experiments on two real-world social media datasets demonstrate that our proposed framework offers a better feature representation for multi-label classification problems on social media.

[1]  Huan Liu,et al.  Scalable learning of collective behavior based on sparse social dimensions , 2009, CIKM.

[2]  Lise Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[3]  Chih-Jen Lin,et al.  A Study on Threshold Selection for Multi-label Classification , 2007 .

[4]  Inderjit S. Dhillon,et al.  Co-clustering documents and words using bipartite spectral graph partitioning , 2001, KDD '01.

[5]  Patrick F. Reidy An Introduction to Latent Semantic Analysis , 2009 .

[6]  Jennifer Neville,et al.  Iterative Classification in Relational Data , 2000 .

[7]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[8]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[9]  Huan Liu,et al.  Leveraging social media networks for classification , 2011, Data Mining and Knowledge Discovery.

[10]  Foster Provost,et al.  A Simple Relational Classifier , 2003 .

[11]  Graham Cormode,et al.  Node Classification in Social Networks , 2011, Social Network Data Analytics.

[12]  Mubarak Shah,et al.  Recognizing human actions using multiple features , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Ben Taskar,et al.  Introduction to Statistical Relational Learning (Adaptive Computation and Machine Learning) , 2007 .

[14]  B. Hendrickson Latent semantic analysis and Fiedler retrieval , 2007 .

[15]  Foster J. Provost,et al.  Classification in Networked Data: a Toolkit and a Univariate Case Study , 2007, J. Mach. Learn. Res..

[16]  Bhavani M. Thuraisingham,et al.  Social network classification incorporating link type values , 2009, 2009 IEEE International Conference on Intelligence and Security Informatics.

[17]  Stephen J. Wright,et al.  Dissimilarity in Graph-Based Semi-Supervised Classification , 2007, AISTATS.