HOPF: Higher Order Propagation Framework for Deep Collective Classification

Given a graph where every node has certain attributes associated with it and some nodes have labels associated with them, Collective Classification (CC) is the task of assigning labels to every unlabeled node using information from the node as well as its neighbors. It is often the case that a node is not only influenced by its immediate neighbors but also by higher order neighbors, multiple hops away. Recent state-of-the-art models for CC learn end-to-end differentiable variations of Weisfeiler-Lehman (WL) kernels to aggregate multi-hop neighborhood information. In this work, we propose a Higher Order Propagation Framework, HOPF, which provides an iterative inference mechanism for these powerful differentiable kernels. Such a combination of classical iterative inference mechanism with recent differentiable kernels allows the framework to learn graph convolutional filters that simultaneously exploit the attribute and label information available in the neighborhood. Further, these iterative differentiable kernels can scale to larger hops beyond the memory limitations of existing differentiable kernels. We also show that existing WL kernel-based models suffer from the problem of Node Information Morphing where the information of the node is morphed or overwhelmed by the information of its neighbors when considering multiple hops. To address this, we propose a specific instantiation of HOPF, called the NIP models, which preserves the node information at every propagation step. The iterative formulation of NIP models further helps in incorporating distant hop information concisely as summaries of the inferred labels. We do an extensive evaluation across 11 datasets from different domains. We show that existing CC models do not provide consistent performance across datasets, while the proposed NIP model with iterative inference is more robust.

[1]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[2]  Lise Getoor,et al.  Collective entity resolution in relational data , 2007, TKDD.

[3]  Charu C. Aggarwal,et al.  Social Network Data Analytics , 2011 .

[4]  Christos Faloutsos,et al.  Fast Random Walk with Restart and Its Applications , 2006, Sixth International Conference on Data Mining (ICDM'06).

[5]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6]  C. Lee Giles,et al.  Advances in Social Network Mining and Analysis, Second International Workshop, SNAKDD 2008, Las Vegas, NV, USA, August 24-27, 2008, Revised Selected Papers , 2010, SNAKDD.

[7]  Paul N. Bennett,et al.  Overcoming Relational Learning Biases to Accurately Predict Preferences in Large Scale Networks , 2015, WWW.

[8]  Kristian Kersting,et al.  Glocalized Weisfeiler-Lehman Graph Kernels: Global-Local Feature Maps of Graphs , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[9]  Lise Getoor,et al.  Query-driven Active Surveying for Collective Classification , 2012 .

[10]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[11]  Roman Garnett,et al.  Propagation kernels: efficient graph kernels from propagated information , 2015, Machine Learning.

[12]  Samuel S. Schoenholz,et al.  Neural Message Passing for Quantum Chemistry , 2017, ICML.

[13]  Graham Cormode,et al.  Node Classification in Social Networks , 2011, Social Network Data Analytics.

[14]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[15]  Lise Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[16]  Doina Precup,et al.  Iterative Multilevel MRF Leveraging Context and Voxel Information for Brain Tumour Segmentation in MRI , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Huan Liu,et al.  Discovering Overlapping Groups in Social Media , 2010, 2010 IEEE International Conference on Data Mining.

[18]  Byron Boots,et al.  Learning to Filter with Predictive State Inference Machines , 2015, ICML.

[19]  Christos Faloutsos,et al.  Using ghost edges for classification in sparsely labeled networks , 2008, KDD.

[20]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[21]  David W. Aha,et al.  Semi-Supervised Collective Classification via Hybrid Label Regularization , 2012, ICML.

[22]  David W. Aha,et al.  Labels or attributes?: rethinking the neighbors for collective classification in sparsely-labeled networks , 2013, CIKM.

[23]  Pramodita Sharma 2012 , 2013, Les 25 ans de l’OMC: Une rétrospective en photos.

[24]  Tina Eliassi-Rad,et al.  Leveraging Label-Independent Features for Classification in Sparsely Labeled Networks: An Empirical Study , 2008, SNAKDD.

[25]  Rok Sosic,et al.  SNAP , 2016, ACM Trans. Intell. Syst. Technol..

[26]  Jennifer Neville,et al.  Collective Classification with Relational Dependency Networks , 2003 .

[27]  Tsvi Kuflik,et al.  Workshop on information heterogeneity and fusion in recommender systems (HetRec 2010) , 2010, RecSys '10.

[28]  Kurt Mehlhorn,et al.  Weisfeiler-Lehman Graph Kernels , 2011, J. Mach. Learn. Res..

[29]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[30]  Jennifer Neville,et al.  Iterative Classification in Relational Data , 2000 .

[31]  Tsvi Kuflik,et al.  Second workshop on information heterogeneity and fusion in recommender systems (HetRec2011) , 2011, RecSys '11.

[32]  Kalyan Moy Gupta,et al.  Cautious Inference in Collective Classification , 2007, AAAI.

[33]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[34]  Jennifer Neville,et al.  Why collective inference improves relational classification , 2004, KDD.

[35]  Jennifer Neville,et al.  Deep Collective Inference , 2017, AAAI.

[36]  Lise Getoor,et al.  Collective Classification in Network Data , 2008, AI Mag..

[37]  Ruslan Salakhutdinov,et al.  Revisiting Semi-Supervised Learning with Graph Embeddings , 2016, ICML.

[38]  David A. Clausi,et al.  Combining local and global features for image segmentation using iterative classification and region merging , 2005, The 2nd Canadian Conference on Computer and Robot Vision (CRV'05).