The skew spectrum of graphs

The central issue in representing graph-structured data instances in learning algorithms is designing features which are invariant to permuting the numbering of the vertices. We present a new system of invariant graph features which we call the skew spectrum of graphs. The skew spectrum is based on mapping the adjacency matrix of any (weigted, directed, unlabeled) graph to a function on the symmetric group and computing bispectral invariants. The reduced form of the skew spectrum is computable in O(n3) time, and experiments show that on several benchmark datasets it can outperform state of the art graph kernels.

[1]  Hans-Peter Kriegel,et al.  Shortest-path kernels on graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[2]  Leonidas J. Guibas,et al.  Efficient Inference for Distributions on Permutations , 2007, NIPS.

[3]  John Shawe-Taylor,et al.  Symmetries and discriminability in feedforward network architectures , 1993, IEEE Trans. Neural Networks.

[4]  Michael Clausen,et al.  Fast Generalized Fourier Transforms , 1989, Theor. Comput. Sci..

[5]  Tony Jebara,et al.  Multi-object tracking with representations of the symmetric group , 2007, AISTATS.

[6]  Risi Kondor,et al.  A novel set of rotationally and translationally invariant features for images based on the non-commutative bispectrum , 2007, cs/0701127.

[7]  G. James,et al.  The Representation Theory of the Symmetric Group , 2009 .

[8]  Zaïd Harchaoui,et al.  Image Classification with Segmentation Graph Kernels , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Daniel N. Rockmore,et al.  Some applications of generalized FFT's , 1997, Groups and Computation.

[10]  R. Kakarala A group-theoretic approach to the triple correlation , 1993, [1993 Proceedings] IEEE Signal Processing Workshop on Higher-Order Statistics.

[11]  Ramakrishna Kakarala,et al.  Triple correlation on groups , 1992 .

[12]  Hans-Peter Kriegel,et al.  Protein function prediction via graph kernels , 2005, ISMB.

[13]  Thomas Gärtner,et al.  A survey of kernels for structured data , 2003, SKDD.

[14]  R. Kondor The skew spectrum of functions on finite groups and their homogeneous spaces , 2007, 0712.4259.

[15]  P. Diaconis Group representations in probability and statistics , 1988 .

[16]  George Karypis,et al.  Comparison of descriptor spaces for chemical compound retrieval and classification , 2006, Sixth International Conference on Data Mining (ICDM'06).

[17]  Ravi Kumar,et al.  Structure and evolution of online social networks , 2006, KDD '06.

[18]  T. Ideker,et al.  Modeling cellular machinery through biological network comparison , 2006, Nature Biotechnology.

[19]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[20]  Antje Chang,et al.  BRENDA , the enzyme database : updates and major new developments , 2003 .

[21]  D. Bonchev Chemical Graph Theory: Introduction and Fundamentals , 1991 .

[22]  Michael Collins,et al.  Convolution Kernels for Natural Language , 2001, NIPS.

[23]  A. Debnath,et al.  Structure-activity relationship of mutagenic aromatic and heteroaromatic nitro compounds. Correlation with molecular orbital energies and hydrophobicity. , 1991, Journal of medicinal chemistry.