Unsupervised Feature Selection in Signed Social Networks

The rapid growth of social media services brings a large amount of high-dimensional social media data at an unprecedented rate. Feature selection is powerful to prepare high-dimensional data by finding a subset of relevant features. A vast majority of existing feature selection algorithms for social media data exclusively focus on positive interactions among linked instances such as friendships and user following relations. However, in many real-world social networks, instances may also be negatively interconnected. Recent work shows that negative links have an added value over positive links in advancing many learning tasks. In this paper, we study a novel problem of unsupervised feature selection in signed social networks and propose a novel framework SignedFS. In particular, we provide a principled way to model positive and negative links for user latent representation learning. Then we embed the user latent representations into feature selection when label information is not available. Also, we revisit the principle of homophily and balance theory in signed social networks and incorporate the signed graph regularization into the feature selection framework to capture the first-order and the second-order proximity among users in signed social networks. Experiments on two real-world signed social networks demonstrate the effectiveness of our proposed framework. Further experiments are conducted to understand the impacts of different components of SignedFS.

[1]  David G. Stork,et al.  Pattern Classification , 1973 .

[2]  Huan Liu,et al.  Is distrust the negation of trust?: the value of distrust in social media , 2014, HT.

[3]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[4]  Mohamed S. Kamel,et al.  An Efficient Greedy Method for Unsupervised Feature Selection , 2011, 2011 IEEE 11th International Conference on Data Mining.

[5]  Huan Liu,et al.  Reconstruction-based Unsupervised Feature Selection: An Embedded Approach , 2017, IJCAI.

[6]  Charu C. Aggarwal,et al.  A Survey of Signed Network Mining in Social Media , 2015, ACM Comput. Surv..

[7]  Huan Liu,et al.  Feature Selection with Linked Data in Social Media , 2012, SDM.

[8]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[9]  Jing Liu,et al.  Unsupervised Feature Selection Using Nonnegative Spectral Analysis , 2012, AAAI.

[10]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[11]  Huan Liu,et al.  Unsupervised feature selection for linked social media data , 2012, KDD.

[12]  Jiming Liu,et al.  Community Mining from Signed Social Networks , 2007, IEEE Transactions on Knowledge and Data Engineering.

[13]  Liu Huan,et al.  Toward Time-Evolving Feature Selection on Dynamic Networks , 2016 .

[14]  Lada A. Adamic,et al.  Power-Law Distribution of the World Wide Web , 2000, Science.

[15]  Nagarajan Natarajan,et al.  Exploiting longer cycles for link prediction in signed networks , 2011, CIKM '11.

[16]  Jiawei Han,et al.  Towards feature selection in network , 2011, CIKM '11.

[17]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2007 .

[18]  Cosma Rohilla Shalizi,et al.  Homophily and Contagion Are Generically Confounded in Observational Social Network Studies , 2010, Sociological methods & research.

[19]  E. Xing,et al.  Statistical Estimation of Correlated Genome Associations to a Quantitative Trait Network , 2009, PLoS genetics.

[20]  Charu C. Aggarwal,et al.  Recommendations in Signed Social Networks , 2016, WWW.

[21]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[22]  Lei Xie,et al.  FASCINATE: Fast Cross-Layer Dependency Inference on Multi-layered Networks , 2016, KDD.

[23]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[24]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[25]  Kewei Cheng,et al.  Unsupervised Sentiment Analysis with Signed Social Networks , 2017, AAAI.

[26]  Kewei Cheng,et al.  FeatureMiner: A Tool for Interactive Feature Selection , 2016, CIKM.

[27]  Charu C. Aggarwal,et al.  Negative Link Prediction in Social Media , 2014, WSDM.

[28]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[29]  Huan Liu,et al.  Unsupervised Streaming Feature Selection in Social Media , 2015, CIKM.

[30]  F. Heider Attitudes and cognitive organization. , 1946, The Journal of psychology.

[31]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[32]  Sahin Albayrak,et al.  Spectral Analysis of Signed Graphs for Clustering, Prediction and Visualization , 2010, SDM.

[33]  Huan Liu,et al.  Robust Unsupervised Feature Selection on Networked Data , 2016, SDM.

[34]  Deng Cai,et al.  Unsupervised feature selection for multi-cluster data , 2010, KDD.

[35]  Huan Liu,et al.  Spectral feature selection for supervised and unsupervised learning , 2007, ICML '07.

[36]  Huan Liu,et al.  Challenges of Feature Selection for Big Data Analytics , 2016, IEEE Intelligent Systems.

[37]  Huan Liu,et al.  Toward Time-Evolving Feature Selection on Dynamic Networks , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[38]  Jing Liu,et al.  A comparative analysis of evolutionary and memetic algorithms for community detection from signed social networks , 2013, Soft Computing.