Fast Global Alignment Kernels

We propose novel approaches to cast the widely-used family of Dynamic Time Warping (DTW) distances and similarities as positive definite kernels for time series. To this effect, we provide new theoretical insights on the family of Global Alignment kernels introduced by Cuturi et al. (2007) and propose alternative kernels which are both positive definite and faster to compute. We provide experimental evidence that these alternatives are both faster and more efficient in classification tasks than other kernels based on the DTW formalism.

[1]  Peter A. Flach,et al.  Evaluation Measures for Multi-class Subgroup Discovery , 2009, ECML/PKDD.

[2]  Daniel Lemire,et al.  Faster retrieval with a two-pass dynamic-time-warping lower bound , 2008, Pattern Recognit..

[3]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[4]  Elisa Ricci,et al.  Learning Pedestrian Trajectories with Kernels , 2010, 2010 20th International Conference on Pattern Recognition.

[5]  K. Rieck,et al.  Large Scale Learning with String Kernels , 2006 .

[6]  Cyril Banderier,et al.  Why Delannoy numbers? , 2004, ArXiv.

[7]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[8]  Alexander J. Smola,et al.  Binet-Cauchy Kernels on Dynamical Systems and its Application to the Analysis of Dynamic Scenes , 2007, International Journal of Computer Vision.

[9]  S. V. N. Vishwanathan,et al.  Graph kernels , 2007 .

[10]  Robert A. Sulanke,et al.  OBJECTS COUNTED BY THE CENTRAL DELANNOY NUMBERS , 2003 .

[11]  Shigeki Sagayama,et al.  Dynamic Time-Alignment Kernel in Support Vector Machine , 2001, NIPS.

[12]  Zaïd Harchaoui,et al.  Image Classification with Segmentation Graph Kernels , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Gaël Richard,et al.  Temporal Integration for Audio Classification With Application to Musical Instrument Classification , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Jason Weston,et al.  Large-Scale Learning with String Kernels , 2007 .

[15]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[16]  Marc G. Genton,et al.  Classes of Kernels for Machine Learning: A Statistics Perspective , 2002, J. Mach. Learn. Res..

[17]  Fernando De la Torre,et al.  Unsupervised discovery of facial events , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18]  Maarten van Someren,et al.  Clustering Vessel Trajectories with Alignment Kernels under Trajectory Compression , 2010, ECML/PKDD.

[19]  Tetsuji Kuboyama,et al.  A generalization of Haussler's convolution kernel: mapping kernel , 2008, ICML.

[20]  C. Berg,et al.  Harmonic Analysis on Semigroups , 1984 .

[21]  Thomas Philip Runarsson,et al.  Support vector machines and dynamic time warping for time series , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[22]  Tony Jebara,et al.  Probability Product Kernels , 2004, J. Mach. Learn. Res..

[23]  Tomoko Matsui,et al.  A Kernel for Time Series Based on Global Alignments , 2006, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[24]  Claus Bahlmann,et al.  Online handwriting recognition with support vector machines - a kernel approach , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[25]  T. Gneiting Compactly Supported Correlation Functions , 2002 .

[26]  David Haussler,et al.  Convolution kernels on discrete structures , 1999 .

[27]  Akira Hayashi,et al.  Embedding Time Series Data for Classification , 2005, MLDM.

[28]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[29]  Claus Bahlmann,et al.  Learning with Distance Substitution Kernels , 2004, DAGM-Symposium.