Toward Time-Evolving Feature Selection on Dynamic Networks

Recent years have witnessed the prevalence of networked data in various domains. Among them, a large number of networks are not only topologically structured but also have a rich set of features on nodes. These node features are usually of high dimensionality with noisy, irrelevant and redundant information, which may impede the performance of other learning tasks. Feature selection is useful to alleviate these critical issues. Nonetheless, a vast majority of existing feature selection algorithms are predominantly designed in a static setting. In reality, real-world networks are naturally dynamic, characterized by both topology and content changes. It is desirable to capture these changes to find relevant features tightly hinged with network structure continuously, which is of fundamental importance for many applications such as disaster relief and viral marketing. In this paper, we study a novel problem of time-evolving feature selection for dynamic networks in an unsupervised scenario. Specifically, we propose a TeFS framework by leveraging the temporal evolution property of dynamic networks to update the feature selection results incrementally. Experimental results show the superiority of TeFS over the state-of-the-art batch-mode unsupervised feature selection algorithms.

[1]  Huan Liu,et al.  Feature Selection with Linked Data in Social Media , 2012, SDM.

[2]  Jing Liu,et al.  Unsupervised Feature Selection Using Nonnegative Spectral Analysis , 2012, AAAI.

[3]  Huan Liu,et al.  Unsupervised feature selection for linked social media data , 2012, KDD.

[4]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[5]  Xin Wang,et al.  Recommending Groups to Users Using User-Group Engagement and Time-Dependent Matrix Factorization , 2016, AAAI.

[6]  Philip S. Yu,et al.  Efficient Partial Order Preserving Unsupervised Feature Selection on Networks , 2015, SDM.

[7]  Jian Pei,et al.  SNOC: Streaming Network Node Classification , 2014, 2014 IEEE International Conference on Data Mining.

[8]  Hanghang Tong,et al.  Fast Eigen-Functions Tracking on Dynamic Graphs , 2015, SDM.

[9]  Huan Liu,et al.  Discovering Overlapping Groups in Social Media , 2010, 2010 IEEE International Conference on Data Mining.

[10]  Charu C. Aggarwal,et al.  Evolutionary Network Analysis , 2014, ACM Comput. Surv..

[11]  Chih-Jen Lin,et al.  Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[12]  Huan Liu,et al.  Robust Unsupervised Feature Selection on Networked Data , 2016, SDM.

[13]  Huan Liu,et al.  Spectral feature selection for supervised and unsupervised learning , 2007, ICML '07.

[14]  Kewei Cheng,et al.  Feature Selection , 2016, ACM Comput. Surv..

[15]  Lise Getoor,et al.  Collective Classification in Network Data , 2008, AI Mag..

[16]  Philip S. Yu,et al.  Colibri: fast mining of large static and dynamic graphs , 2008, KDD.

[17]  James D. Westaby Dynamic Network Theory: How Social Networks Influence Goal Pursuit , 2011 .

[18]  Jingrui He,et al.  Rare Category Detection on Time-Evolving Graphs , 2015, 2015 IEEE International Conference on Data Mining.

[19]  Jennifer Neville,et al.  Randomization tests for distinguishing social influence and homophily effects , 2010, WWW '10.

[20]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[21]  Jiawei Han,et al.  Towards feature selection in network , 2011, CIKM '11.

[22]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[23]  Huan Liu,et al.  Unsupervised Streaming Feature Selection in Social Media , 2015, CIKM.