Link Prediction-Based Multi-label Classification on Networked Data

In this paper, we study the problem of performing multi-label classification on networked data, where each instance in the network is assigned with multiple labels and the connections between instances are driven by various casual reasons. Networked data extracted from social media or web pages may not reflect the relationship between users in real life accurately. By mining the links that actually exist but have not yet been found in the network, the potential relations between users can be discovered, and thus help us to predict the users' labels more accurately. In this work, we propose a link prediction-based multi-label relational neighbor classifier which employs social context features (LP-SCRN). It firstly predicts missing links in the network, and then calculates the weights of the links according to the similarity between nodes in their social features. In addition, by capturing the potential correlation between nodes, we expand a node's neighbor set, and refine the multi-label relational classifier. Experiments on two real-world datasets demonstrate that our proposed method improves the performance of multi-label classification on networked data.

[1]  Foster Provost,et al.  A Simple Relational Classifier , 2003 .

[2]  Jennifer Neville,et al.  Why collective inference improves relational classification , 2004, KDD.

[3]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[4]  Lise Getoor,et al.  Combining Collective Classification and Link Prediction , 2007, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007).

[5]  Lawrence B. Holder,et al.  Discovering Structural Anomalies in Graph-Based Data , 2007 .

[6]  Christos Faloutsos,et al.  Using ghost edges for classification in sparsely labeled networks , 2008, KDD.

[7]  Huan Liu,et al.  Scalable learning of collective behavior based on sparse social dimensions , 2009, CIKM.

[8]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[9]  Tina Eliassi-Rad,et al.  Correcting evaluation bias of relational classifiers with network cross validation , 2010, Knowledge and Information Systems.

[10]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[11]  Jiawei Han,et al.  Ranking-based classification of heterogeneous information networks , 2011, KDD.

[12]  Mohammad Al Hasan,et al.  A Survey of Link Prediction in Social Networks , 2011, Social Network Data Analytics.

[13]  Gita Reese Sukthankar,et al.  Multi-label relational neighbor classification using social context features , 2013, KDD.

[14]  Gita Reese Sukthankar,et al.  Link prediction in multi-relational collaboration networks , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[15]  Philip S. Yu,et al.  Multi-label classification by mining label and instance correlations from heterogeneous information networks , 2013, KDD.

[16]  Ling Liu,et al.  Activity-edge centric multi-label classification for mining heterogeneous information networks , 2014, KDD.

[17]  Meng Wang,et al.  Context-Aware Reviewer Assignment for Trust Enhanced Peer Review , 2015, PloS one.

[18]  Weixiong Zhang,et al.  Marginalized Denoising for Link Prediction and Multi-Label Learning , 2015, AAAI.