Rep the Set: Neural Networks for Learning Set Representations

In several domains, data objects can be decomposed into sets of simpler objects. It is then natural to represent each object as the set of its components or parts. Many conventional machine learning algorithms are unable to process this kind of representations, since sets may vary in cardinality and elements lack a meaningful ordering. In this paper, we present a new neural network architecture, called RepSet, that can handle examples that are represented as sets of vectors. The proposed model computes the correspondences between an input set and some hidden sets by solving a series of network flow problems. This representation is then fed to a standard neural network architecture to produce the output. The architecture allows end-to-end gradient-based learning. We demonstrate RepSet on classification tasks, including text categorization, and graph classification, and we show that the proposed neural network achieves performance better or comparable to state-of-the-art algorithms.

[1]  Tony Jebara,et al.  A Kernel Between Sets of Vectors , 2003, ICML.

[2]  Mathias Niepert,et al.  Learning Convolutional Neural Networks for Graphs , 2016, ICML.

[3]  Trevor Darrell,et al.  Approximate Correspondences in High Dimensions , 2006, NIPS.

[4]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[6]  Ian D. Reid,et al.  DeepSetNet: Predicting Sets with Deep Neural Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[7]  Alexander J. Smola,et al.  Deep Sets , 2017, 1703.06114.

[8]  Daniel R. Figueiredo,et al.  struc2vec: Learning Node Representations from Structural Identity , 2017, KDD.

[9]  Michalis Vazirgiannis,et al.  Matching Node Embeddings for Graph Similarity , 2017, AAAI.

[10]  Paolo Frasconi,et al.  Shift Aggregate Extract Networks , 2017, Front. Robot. AI.

[11]  Matt J. Kusner,et al.  Supervised Word Mover's Distance , 2016, NIPS.

[12]  Jiaxin Li,et al.  SO-Net: Self-Organizing Network for Point Cloud Analysis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13]  Yixin Chen,et al.  An End-to-End Deep Learning Architecture for Graph Classification , 2018, AAAI.

[14]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[15]  Yee Whye Teh,et al.  Set Transformer , 2018, ICML.

[16]  A. Debnath,et al.  Structure-activity relationship of mutagenic aromatic and heteroaromatic nitro compounds. Correlation with molecular orbital energies and hydrophobicity. , 1991, Journal of medicinal chemistry.

[17]  Ian D. Reid,et al.  Joint Learning of Set Cardinality and State Distribution , 2017, AAAI.

[18]  David Haussler,et al.  Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[19]  Ana Margarida de Jesus,et al.  Improving Methods for Single-label Text Categorization , 2007 .

[20]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[21]  Trevor Darrell,et al.  The Pyramid Match Kernel: Efficient Learning with Sets of Features , 2007, J. Mach. Learn. Res..

[22]  Siwei Lyu,et al.  A Kernel Between Unordered Sets of Data: The Gaussian Mixture Approach , 2005, ECML.

[23]  Yijian Xiang,et al.  RetGK: Graph Kernels based on Return Probabilities of Random Walks , 2018, NeurIPS.

[24]  Jure Leskovec,et al.  Hierarchical Graph Representation Learning with Differentiable Pooling , 2018, NeurIPS.

[25]  Tony Jebara,et al.  Probability Product Kernels , 2004, J. Mach. Learn. Res..

[26]  Matt J. Kusner,et al.  From Word Embeddings To Document Distances , 2015, ICML.

[27]  Samy Bengio,et al.  Order Matters: Sequence to sequence for sets , 2015, ICLR.

[28]  Pinar Yanardag,et al.  Deep Graph Kernels , 2015, KDD.

[29]  Hans-Peter Kriegel,et al.  Protein function prediction via graph kernels , 2005, ISMB.