Graph kernels

We present a unified framework to study graph kernels, special cases of which include the random walk (Gartner et al., 2003; Borgwardt et al., 2005) and marginalized (Kashima et al., 2003, 2004; Mahet al., 2004) graph kernels. Through reduction to a Sylvester equation we improve the time complexity of kernel computation between unlabeled graphs with n vertices from O(n6) to O(n3). We find a spectral decomposition approach even more efficient when computing entire kernel matrices. For labeled graphs we develop conjugate gradient and fixed-point methods that take O(dn3) time per iteration, where d is the size of the label set. By extending the necessary linear algebra to Reproducing Kernel Hilbert Spaces (RKHS) we obtain the same result for d-dimensional edge kernels, and O(n4) in the infinite-dimensional case; on sparse graphs these algorithms only take O(n2) time per iteration in all cases. Experiments on graphs from bioinformatics and other application domains show that these techniques can speed up computation of the kernel by an order of magnitude or more. We also show that certain rational kernels (Cortes et al., 2002, 2003, 2004) when specialized to graphs reduce to our random walk graph kernel. Finally, we relate our framework to R-convolution kernels (Haussler, 1999) and provide a kernel that is close to the optimal assignment kernel of kernel of Frohlich et al. (2006) yet provably positive semi-definite.

[1]  H. Wiener Structural determination of paraffin boiling points. , 1947, Journal of the American Chemical Society.

[2]  H. Hotelling A Generalized T Test and Measure of Multivariate Dispersion , 1951 .

[3]  R. Fortet,et al.  Convergence de la répartition empirique vers la répartition théorique , 1953 .

[4]  H. Piaggio Kernel Functions , 1955, Nature.

[5]  P. Kelly A congruence theorem for trees. , 1957 .

[6]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[7]  Stephen Warshall,et al.  A Theorem on Boolean Matrices , 1962, JACM.

[8]  Stephen J. Garland,et al.  Algorithm 97: Shortest path , 1962, Commun. ACM.

[9]  V. Vapnik Pattern recognition using generalized portrait method , 1963 .

[10]  P. Bickel A Distribution Free Version of the Smirnov Two Sample Test in the $p$-Variate Case , 1969 .

[11]  Frank Harary,et al.  Graph Theory , 2016 .

[12]  Robert L. Hemminger,et al.  On reconstructing a graph , 1969 .

[13]  P. Stockmeyer,et al.  On Reconstruction of Matrices , 1971 .

[14]  E. Lawler A PROCEDURE FOR COMPUTING THE K BEST SOLUTIONS TO DISCRETE OPTIMIZATION PROBLEMS AND ITS APPLICATION TO THE SHORTEST PATH PROBLEM , 1972 .

[15]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[16]  C. Baker Joint measures and cross-covariance operators , 1973 .

[17]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[18]  Daniel J. Lehmann,et al.  Algebraic Structures for Transitive Closure , 1976, Theor. Comput. Sci..

[19]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[20]  J. Friedman,et al.  Multivariate generalizations of the Wald--Wolfowitz and Smirnov two-sample tests , 1979 .

[21]  Jean Berstel,et al.  Transductions and context-free languages , 1979, Teubner Studienbücher : Informatik.

[22]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .

[23]  Horst Bunke,et al.  Inexact graph matching for structural pattern recognition , 1983, Pattern Recognit. Lett..

[24]  Robert E. Tarjan,et al.  Fibonacci heaps and their uses in improved network optimization algorithms , 1984, JACM.

[25]  C. Berg,et al.  Harmonic Analysis on Semigroups , 1984 .

[26]  Arto Salomaa,et al.  Semirings, Automata and Languages , 1985 .

[27]  Arto Salomaa,et al.  Semirings, Automata, Languages , 1985, EATCS Monographs on Theoretical Computer Science.

[28]  Dieter Jungnickel,et al.  Graphen, Netzwerke und Algorithmen , 1987 .

[29]  Colin McDiarmid,et al.  Surveys in Combinatorics, 1989: On the method of bounded differences , 1989 .

[30]  D. Bonchev Chemical Graph Theory: Introduction and Fundamentals , 1991 .

[31]  A. Debnath,et al.  Structure-activity relationship of mutagenic aromatic and heteroaromatic nitro compounds. Correlation with molecular orbital energies and hydrophobicity. , 1991, Journal of medicinal chemistry.

[32]  E. Giné,et al.  On the Bootstrap of $U$ and $V$ Statistics , 1992 .

[33]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[34]  Alan J. Laub,et al.  Solution of the Sylvester matrix equation AXBT + CXDT = E , 1992, TOMS.

[35]  N. H. Anderson,et al.  Two-sample test statistics for measuring discrepancies between two multivariate probability density functions using kernel-based density estimates , 1994 .

[36]  R. Agarwal Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[37]  N. L. Johnson,et al.  Continuous Univariate Distributions. , 1995 .

[38]  Michael Riley,et al.  Speech Recognition by Composition of Weighted Finite Automata , 1996, ArXiv.

[39]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.

[40]  N. Pitsianis The Kronecker Product in Approximation and Fast Transform Geration , 1997 .

[41]  R. Dipert The mathematical structure of the world: The world as graph , 1997 .

[42]  X. Yao,et al.  Structure of the carboxy-terminal fragment of the apo-biotin carboxyl carrier subunit of Escherichia coli acetyl-CoA carboxylase. , 1997, Biochemistry.

[43]  Brendan D. McKay,et al.  Small graphs are reconstructible , 1997, Australas. J Comb..

[44]  Yuri Gurevich,et al.  From Invariants to Canonization , 2001, Bull. EATCS.

[45]  Paul S. Bradley,et al.  Feature Selection via Concave Minimization and Support Vector Machines , 1998, ICML.

[46]  Horst Bunke,et al.  A graph distance metric based on the maximal common subgraph , 1998, Pattern Recognit. Lett..

[47]  Alin Deutsch,et al.  A Query Language for XML , 1999, Comput. Networks.

[48]  J. C. BurgesChristopher A Tutorial on Support Vector Machines for Pattern Recognition , 1998 .

[49]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[50]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[51]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[52]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[53]  N. Henze,et al.  On the multivariate runs test , 1999 .

[54]  David Haussler,et al.  Convolution kernels on discrete structures , 1999 .

[55]  Horst Bunke,et al.  Error Correcting Graph Matching: On the Influence of the Underlying Cost Function , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[56]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[57]  W. Imrich,et al.  Product Graphs: Structure and Recognition , 2000 .

[58]  Sayan Mukherjee,et al.  Feature Selection for SVMs , 2000, NIPS.

[59]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[60]  Horst Bunke,et al.  Recent developments in graph matching , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[61]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[62]  G. W. Stewart,et al.  The decompositional approach to matrix computation , 2000, Comput. Sci. Eng..

[63]  Abraham Kandel,et al.  On the Minimum Common Supergraph of Two Graphs , 2000, Computing.

[64]  Hannu Toivonen,et al.  Statistical evaluation of the predictive toxicology challenge , 2000 .

[65]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[66]  Stephen J. Wright,et al.  Numerical Optimization (Springer Series in Operations Research and Financial Engineering) , 2000 .

[67]  C. Loan The ubiquitous Kronecker product , 2000 .

[68]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[69]  Peter L. Bartlett,et al.  Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , 2003, J. Mach. Learn. Res..

[70]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[71]  Ina Koch,et al.  Enumerating all connected maximal common subgraphs in two graphs , 2001, Theor. Comput. Sci..

[72]  George Karypis,et al.  Frequent subgraph discovery , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[73]  Luc De Raedt,et al.  Molecular feature mining in HIV data , 2001, KDD '01.

[74]  William Bialek,et al.  Entropy and Inference, Revisited , 2001, NIPS.

[75]  Ingo Steinwart,et al.  On the Influence of the Kernel on the Consistency of Support Vector Machines , 2002, J. Mach. Learn. Res..

[76]  Marc G. Genton,et al.  Classes of Kernels for Machine Learning: A Statistics Perspective , 2002, J. Mach. Learn. Res..

[77]  Gabriel Valiente,et al.  A graph distance metric combining maximum common subgraph and minimum common supergraph , 2001, Pattern Recognit. Lett..

[78]  Mehryar Mohri,et al.  Rational Kernels , 2002, NIPS.

[79]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[80]  Ehud Gudes,et al.  Computing frequent graph patterns from semistructured data , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[81]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[82]  Mehryar Mohri,et al.  Semiring Frameworks and Algorithms for Shortest-Distance Problems , 2002, J. Autom. Lang. Comb..

[83]  Kiyoshi Asai,et al.  Marginalized kernels for biological sequences , 2002, ISMB.

[84]  John D. Lafferty,et al.  Diffusion Kernels on Graphs and Other Discrete Input Spaces , 2002, ICML.

[85]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[86]  Dudley,et al.  Real Analysis and Probability: Measurability: Borel Isomorphism and Analytic Sets , 2002 .

[87]  Alexander J. Smola,et al.  Fast Kernels for String and Tree Matching , 2002, NIPS.

[88]  P. Hall,et al.  Permutation tests for equality of distributions in high‐dimensional settings , 2002 .

[89]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[90]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[91]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[92]  R. Tibshirani,et al.  Diagnosis of multiple cancer types by shrunken centroids of gene expression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[93]  Risi Kondor,et al.  Diffusion kernels on graphs and other discrete structures , 2002, ICML 2002.

[94]  Christian Borgelt,et al.  Mining molecular fragments: finding relevant substructures of molecules , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[95]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[96]  Mehryar Mohri,et al.  Positive Definite Rational Kernels , 2003, COLT.

[97]  Z N Oltvai,et al.  Evolutionary conservation of motif constituents in the yeast protein interaction network , 2003, Nature Genetics.

[98]  Horst Bunke,et al.  Graph-Based Tools for Data Mining and Machine Learning , 2003, MLDM.

[99]  Hisashi Kashima,et al.  Marginalized Kernels Between Labeled Graphs , 2003, ICML.

[100]  Ashwin Srinivasan,et al.  Statistical Evaluation of the Predictive Toxicology Challenge 2000-2001 , 2003, Bioinform..

[101]  Thomas Gärtner,et al.  A survey of kernels for structured data , 2003, SKDD.

[102]  J. Whisstock,et al.  Prediction of protein function from protein sequence and structure , 2003, Quarterly Reviews of Biophysics.

[103]  J. Gasteiger,et al.  Chemoinformatics: A Textbook , 2003 .

[104]  E. Ordentlich,et al.  Inequalities for the L1 Deviation of the Empirical Distribution , 2003 .

[105]  Alexander J. Smola,et al.  Kernels and Regularization on Graphs , 2003, COLT.

[106]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[107]  Takashi Washio,et al.  State of the art of graph-based data mining , 2003, SKDD.

[108]  James R. Knight,et al.  A Protein Interaction Map of Drosophila melanogaster , 2003, Science.

[109]  Bernhard Schölkopf,et al.  Use of the Zero-Norm with Linear Models and Kernel Methods , 2003, J. Mach. Learn. Res..

[110]  Sundar Vishwanathan,et al.  Kernel Methods Fast Algorithms and real life applications , 2003 .

[111]  H. Kubinyi Drug research: myths, hype and reality , 2003, Nature Reviews Drug Discovery.

[112]  Jan Ramon,et al.  Expressivity versus efficiency of graph kernels , 2003 .

[113]  Trevor Hastie,et al.  Class Prediction by Nearest Shrunken Centroids, with Applications to DNA Microarrays , 2003 .

[114]  Wei Wang,et al.  Efficient mining of frequent subgraphs in the presence of isomorphism , 2003, Third IEEE International Conference on Data Mining.

[115]  Bernhard Schölkopf,et al.  Kernel Methods in Computational Biology , 2005 .

[116]  P. Dobson,et al.  Distinguishing enzyme structures from non-enzymes without alignments. , 2003, Journal of molecular biology.

[117]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[118]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[119]  Thomas Gärtner,et al.  Cyclic pattern kernels for predictive graph mining , 2004, KDD.

[120]  Susumu Goto,et al.  The KEGG resource for deciphering the genome , 2004, Nucleic Acids Res..

[121]  Antje Chang,et al.  New Developments , 2003 .

[122]  Tatsuya Akutsu,et al.  Extensions of marginalized graph kernels , 2004, ICML.

[123]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[124]  Michael I. Jordan,et al.  Dimensionality Reduction for Supervised Learning with Reproducing Kernel Hilbert Spaces , 2004, J. Mach. Learn. Res..

[125]  George Karypis,et al.  GREW - a scalable frequent subgraph discovery algorithm , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[126]  S. L. Wong,et al.  A Map of the Interactome Network of the Metazoan C. elegans , 2004, Science.

[127]  Uri Alon,et al.  Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs , 2004, Bioinform..

[128]  Joost N. Kok,et al.  A quickstart in frequent structure mining can make a difference , 2004, KDD.

[129]  Joos Vandewalle,et al.  Computation of the Canonical Decomposition by Means of a Simultaneous Generalized Schur Decomposition , 2005, SIAM J. Matrix Anal. Appl..

[130]  R. Tibshirani,et al.  Use of gene-expression profiling to identify prognostic subclasses in adult acute myeloid leukemia. , 2004, The New England journal of medicine.

[131]  Mehryar Mohri,et al.  Rational Kernels: Theory and Algorithms , 2004, J. Mach. Learn. Res..

[132]  A. E. Hirsh,et al.  Coevolution of gene expression among interacting proteins , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[133]  H. Kashima,et al.  Kernels for graphs , 2004 .

[134]  Mario Vento,et al.  Thirty Years Of Graph Matching In Pattern Recognition , 2004, Int. J. Pattern Recognit. Artif. Intell..

[135]  Sebastian Wernicke,et al.  A Faster Algorithm for Detecting Network Motifs , 2005, WABI.

[136]  Felix Naumann,et al.  DogmatiX tracks down duplicates in XML , 2005, SIGMOD '05.

[137]  Terry Caelli,et al.  Graphical models for graph matching: Approximate models and optimal algorithms , 2005, Pattern Recognit. Lett..

[138]  Bernhard Schölkopf,et al.  Measuring Statistical Dependence with Hilbert-Schmidt Norms , 2005, ALT.

[139]  Roland Eils,et al.  Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes , 2005, BMC Bioinformatics.

[140]  Alexander J. Smola,et al.  Joint Regularization , 2005, ESANN.

[141]  Paolo Frasconi,et al.  Weighted decomposition kernels , 2005, ICML.

[142]  László Györfi,et al.  On the asymptotic properties of a nonparametric L/sub 1/-test statistic of homogeneity , 2005, IEEE Transactions on Information Theory.

[143]  Hans-Peter Kriegel,et al.  Shortest-path kernels on graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[144]  Andreas Zell,et al.  Optimal assignment kernels for attributed molecular graphs , 2005, ICML.

[145]  Horst Bunke,et al.  Self-organizing maps for learning the edit costs in graph matching , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[146]  Fernando Pereira,et al.  Weighted Automata in Text and Speech Processing , 2005, ArXiv.

[147]  Byoung-Tak Zhang,et al.  Construction of phylogenetic trees by kernel-based comparative analysis of metabolic networks , 2006, BMC Bioinformatics.

[148]  Yiming Yang,et al.  Analysis of recursive gene selection approaches from microarray data , 2005, Bioinform..

[149]  P. Rosenbaum An exact distribution‐free test comparing two multivariate distributions based on adjacency , 2005 .

[150]  George Karypis,et al.  Frequent substructure-based approaches for classifying chemical compounds , 2003, IEEE Transactions on Knowledge and Data Engineering.

[151]  Hans-Peter Kriegel,et al.  Protein function prediction via graph kernels , 2005, ISMB.

[152]  Hui Lu,et al.  Correlation between gene expression profiles and protein-protein interactions within and across genomes , 2005, Bioinform..

[153]  George Karypis,et al.  Finding Frequent Patterns in a Large Sparse Graph* , 2005, Data Mining and Knowledge Discovery.

[154]  Thomas Hofmann,et al.  Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[155]  Pierre Baldi,et al.  Graph kernels for chemical informatics , 2005, Neural Networks.

[156]  Ali Esmaili,et al.  Probability and Random Processes , 2005, Technometrics.

[157]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[158]  George Karypis,et al.  Comparison of descriptor spaces for chemical compound retrieval and classification , 2006, Sixth International Conference on Data Mining (ICDM'06).

[159]  Igor Jurisica,et al.  Efficient estimation of graphlet frequency distributions in protein-protein interaction networks , 2006, Bioinform..

[160]  S. V. N. Vishwanathan,et al.  Fast Computation of Graph Kernels , 2006, NIPS.

[161]  T. Ideker,et al.  Modeling cellular machinery through biological network comparison , 2006, Nature Biotechnology.

[162]  Alexander J. Smola,et al.  Kernel extrapolation , 2006, Neurocomputing.

[163]  D. Huson,et al.  Application of phylogenetic networks in evolutionary studies. , 2006, Molecular biology and evolution.

[164]  Conrad Sanderson,et al.  An Efficient Alternative to SVM Based Recursive Feature Elimination with Applications in Natural Language Processing and Bioinformatics , 2006, Australian Conference on Artificial Intelligence.

[165]  Hans-Peter Kriegel,et al.  Class Prediction from Time Series Gene Expression Profiles Using Dynamical Systems Kernels , 2005, Pacific Symposium on Biocomputing.

[166]  L. Ein-Dor,et al.  Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[167]  Bernhard Schölkopf,et al.  A Kernel Method for the Two-Sample-Problem , 2006, NIPS.

[168]  Hans-Peter Kriegel,et al.  Pattern Mining in Frequent Dynamic Subgraphs , 2006, Sixth International Conference on Data Mining (ICDM'06).

[169]  Meng Xu,et al.  NetAlign: a web-based tool for comparison of protein interaction networks , 2006, Bioinform..

[170]  Hans-Peter Kriegel,et al.  Integrating structured biological data by Kernel Maximum Mean Discrepancy , 2006, ISMB.

[171]  Bernhard Schölkopf,et al.  Correcting Sample Selection Bias by Unlabeled Data , 2006, NIPS.

[172]  Alfred O. Hero,et al.  A binary linear programming formulation of the graph edit distance , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[173]  Wei-Po Lee,et al.  Differential evolutionary conservation of motif modes in the yeast protein interaction network , 2006, BMC Genomics.

[174]  Andreas Zell,et al.  Kernel Functions for Attributed Molecular Graphs – A New Similarity‐Based Approach to ADME Prediction in Classification and Regression , 2006 .

[175]  Ravi Kumar,et al.  Structure and evolution of online social networks , 2006, KDD '06.

[176]  Horst Bunke,et al.  Bridging the Gap between Graph Edit Distance and Kernel Machines , 2007, Series in Machine Perception and Artificial Intelligence.

[177]  Gabriele Ausiello,et al.  MINT: the Molecular INTeraction database , 2006, Nucleic Acids Res..

[178]  Hans-Peter Kriegel,et al.  An Efficient Sampling Scheme For Comparison of Large Graphs , 2007, MLG.

[179]  Kaspar Riesen,et al.  Bipartite Graph Matching for Computing the Edit Distance of Graphs , 2007, GbRPR.

[180]  Jiawei Han,et al.  Discriminative Frequent Pattern Analysis for Effective Classification , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[181]  J. Y. Yen,et al.  Finding the K Shortest Loopless Paths in a Network , 2007 .

[182]  Horst Bunke,et al.  Automatic learning of cost functions for graph edit distance , 2007, Inf. Sci..

[183]  Bernhard Schölkopf,et al.  A Kernel Approach to Comparing Distributions , 2007, AAAI.

[184]  Hans-Peter Kriegel,et al.  Graph Kernels For Disease Outcome Prediction From Protein-Protein Interaction Networks , 2006, Pacific Symposium on Biocomputing.

[185]  Le Song,et al.  A dependence maximization view of clustering , 2007, ICML '07.

[186]  Le Song,et al.  Gene selection via the BAHSIC family of algorithms , 2007, ISMB/ECCB.

[187]  Natasa Przulj,et al.  Biological network comparison using graphlet degree distribution , 2007, Bioinform..

[188]  Le Song,et al.  Supervised feature selection via dependence estimation , 2007, ICML '07.

[189]  Karsten M. Borgwardt,et al.  The skew spectrum of graphs , 2008, ICML '08.

[190]  Jean-Philippe Vert,et al.  The optimal assignment kernel is not positive definite , 2008, ArXiv.

[191]  Oleg Verbitsky,et al.  From Invariants to Canonization in Parallel , 2006, CSR.

[192]  Kurt Mehlhorn,et al.  Efficient graphlet kernels for large graph comparison , 2009, AISTATS.