Multi-label collective classification in multi-attribute multi-relational network data

Classical machine learning techniques assume the data to be i.i.d., but the real world data is inherently relational and can generally be represented using graphs or some variants of a graph representation. The importance of modeling relational data is evident from its increasing presence in many domains: Telecom networks, WWW, social networks, organizational networks, images, protein sequences, etc. This field has recently been receiving a lot of attention in various communities under different themes depending on the problem addressed and the nature of solution proposed. Collective classification is one such popular approach which involves the use of a local classifier that embeds the node's own attributes and neighbors' information in a feature vector, and classifies the nodes in an iterative procedure. Despite the increasing popularity, there is not much attention paid towards datasets with multiple attributes and multi-relational (MAMR) networks under multi-label scenarios. In MAMR data, nodes can be represented using multiple types of attributes (attribute views) and there are multiple link types between the nodes. For example, in Twitter, users can be represented using their tweets, urls shared, hashtags and list memberships. And different Twitter users can be connected using follower, followed by and re-tweet links. Secondly, in many networks, nodes are associated with more than one label. For instance, Twitter users can be tagged with one or more labels from a set L, where L contains various movie genres that a user might like. Motivated by this, we propose a learning technique for multi-label collective classification using multiple attribute views on multi-relational network data which captures complex label correlations within and across attribute/relationship types. We empirically evaluate our proposed approach on Twitter and MovieLens datasets, and we show that it performs better than the state-of-art approaches.

[1]  Sunita Sarawagi,et al.  Discriminative Methods for Multi-labeled Classification , 2004, PAKDD.

[2]  Hamideh Afsarmanesh,et al.  Disagreement-Based Co-training , 2011, 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence.

[3]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[4]  Rayid Ghani,et al.  Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[5]  Philip S. Yu,et al.  Multi-Label Collective Classification , 2011, SDM.

[6]  Dacheng Tao,et al.  A Survey on Multi-view Learning , 2013, ArXiv.

[7]  Philip S. Yu,et al.  Learning from Heterogeneous Sources via Gradient Boosting Consensus , 2012, SDM.

[8]  Derek Greene,et al.  Producing a unified graph representation from multiple social network views , 2013, WebSci.

[9]  Grigorios Tsoumakas,et al.  On the Stratification of Multi-label Data , 2011, ECML/PKDD.

[10]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[11]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[12]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[13]  Yanlei Wu,et al.  2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2014, Beijing, China, August 17-20, 2014 , 2014, ASONAM.

[14]  Balaraman Ravindran,et al.  Multi Grain Sentiment Analysis using Collective Classification , 2010, ECAI.

[15]  David Lo,et al.  Collective Churn Prediction in Social Network , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[16]  Jennifer Neville,et al.  Iterative Classification in Relational Data , 2000 .

[17]  Lise Getoor,et al.  Link-based Classifi-cation using Labeled and Unlabeled Data , 2003 .

[18]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[19]  Kalyan Moy Gupta,et al.  Cautious Collective Classification , 2009, J. Mach. Learn. Res..

[20]  Ludovic Denoyer,et al.  Iterative Annotation of Multi-relational Social Networks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[21]  Lise Getoor,et al.  Collective Classification in Network Data , 2008, AI Mag..

[22]  Jennifer Neville,et al.  Across-Model Collective Ensemble Classification , 2011, AAAI.