Kernels on Structured Objects Through Nested Histograms

We propose a family of kernels for structured objects which is based on the bag-of-components paradigm. However, rather than decomposing each complex object into the single histogram of its components, we use for each object a family of nested histograms, where each histogram in this hierarchy describes the object seen from an increasingly granular perspective. We use this hierarchy of histograms to define elementary kernels which can detect coarse and fine similarities between the objects. We compute through an efficient averaging trick a mixture of such specific kernels, to propose a final kernel value which weights efficiently local and global matches. We propose experimental results on an image retrieval experiment which show that this mixture is an effective template procedure to be used with kernels on histograms.

[1]  Guy Lebanon Sequential Document Representations and Simplicial Curves , 2006, UAI.

[2]  Kenji Fukumizu,et al.  Semigroup Kernels on Measures , 2005, J. Mach. Learn. Res..

[3]  Jason Weston,et al.  Mismatch String Kernels for SVM Protein Classification , 2002, NIPS.

[4]  Patrick Haffner,et al.  Support vector machines for histogram-based image classification , 1999, IEEE Trans. Neural Networks.

[5]  Vapnik,et al.  SVMs for Histogram Based Image Classification , 1999 .

[6]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[7]  B. Schölkopf,et al.  Accurate Splice Site Detection for Caenorhabditis elegans , 2004 .

[8]  Shun-ichi Amari,et al.  Methods of information geometry , 2000 .

[9]  Matthias Hein,et al.  Hilbertian Metrics and Positive Definite Kernels on Probability Measures , 2005, AISTATS.

[10]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[11]  C. Berg,et al.  Harmonic Analysis on Semigroups , 1984 .

[12]  Olivier Catoni,et al.  Statistical learning theory and stochastic optimization , 2004 .

[13]  David Haussler,et al.  Convolution kernels on discrete structures , 1999 .

[14]  John D. Lafferty,et al.  Diffusion Kernels on Statistical Manifolds , 2005, J. Mach. Learn. Res..

[15]  Trevor Darrell,et al.  The pyramid match kernel: discriminative classification with sets of image features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[16]  Jean-Philippe Vert,et al.  A novel representation of protein sequences for prediction of subcellular location using support vector machines , 2005, Protein science : a publication of the Protein Society.

[17]  Tony Jebara,et al.  A Kernel Between Sets of Vectors , 2003, ICML.

[18]  Bernhard Schölkopf,et al.  Kernel Methods in Computational Biology , 2005 .

[19]  Frans M. J. Willems,et al.  The context-tree weighting method: basic properties , 1995, IEEE Trans. Inf. Theory.

[20]  Jean-Philippe Vert,et al.  The context-tree kernel for strings , 2005, Neural Networks.