Robust time-series retrieval using probabilistic adaptive segmental alignment

Traditional pairwise sequence alignment is based on matching individual samples from two sequences, under time monotonicity constraints. However, in many application settings, matching subsequences (segments) instead of individual samples may bring in additional robustness to noise or local non-causal perturbations. This paper presents an approach to segmental sequence alignment that jointly segments and aligns two sequences, generalizing the traditional per-sample alignment. To accomplish this task, we introduce a distance metric between segments based on average pairwise distances and then present a modified pair-HMM (PHMM) that incorporates the proposed distance metric to solve the joint segmentation and alignment task. We also propose a relaxation to our model that improves the computational efficiency of the generic segmental PHMM. Our results demonstrate that this new measure of sequence similarity can lead to improved classification performance, while being resilient to noise, on a variety of sequence retrieval problems, from EEG to motion sequence classification.

[1]  AghabozorgiSaeed,et al.  Time-series clustering - A decade review , 2015 .

[2]  Eamonn J. Keogh A decade of progress in indexing and mining large time series databases , 2006, VLDB.

[3]  Vladimir Pavlovic,et al.  Isotonic CCA for sequence alignment and activity recognition , 2011, 2011 International Conference on Computer Vision.

[4]  Fernando De la Torre,et al.  Unsupervised Temporal Commonality Discovery , 2012, ECCV.

[5]  Henrik André-Jönsson,et al.  Using Signature Files for Querying Time-Series Data , 1997, PKDD.

[6]  Hayko Riemenschneider,et al.  Bag of Optical Flow Volumes for Image Sequence Recognition , 2009, BMVC.

[7]  U. Hoffmann,et al.  A Boosting Approach to P300 Detection with Application to Brain-Computer Interfaces , 2005, Conference Proceedings. 2nd International IEEE EMBS Conference on Neural Engineering, 2005..

[8]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, ICPR 2004.

[9]  Dimitrios Gunopulos,et al.  Discovering similar multidimensional trajectories , 2002, Proceedings 18th International Conference on Data Engineering.

[10]  Max Crochemore,et al.  Algorithms and Theory of Computation Handbook , 2010 .

[11]  Fernando Henrique Lopes da Silva,et al.  The hemodynamic response of the alpha rhythm: An EEG/fMRI study , 2007, NeuroImage.

[12]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[13]  Melanie Hilario,et al.  Distances and (Indefinite) Kernels for Sets of Objects , 2006, Sixth International Conference on Data Mining (ICDM'06).

[14]  Jignesh M. Patel,et al.  An efficient and accurate method for evaluating time series similarity , 2007, SIGMOD '07.

[15]  Eamonn J. Keogh,et al.  Accelerating the discovery of unsupervised-shapelets , 2015, Data Mining and Knowledge Discovery.

[16]  Tony Jebara,et al.  A Kernel Between Sets of Vectors , 2003, ICML.

[17]  Ying Wah Teh,et al.  Time-series clustering - A decade review , 2015, Inf. Syst..

[18]  Tido Röder,et al.  Documentation Mocap Database HDM05 , 2007 .

[19]  Franklin C. Crow,et al.  Summed-area tables for texture mapping , 1984, SIGGRAPH.

[20]  Mikhail J. Atallah,et al.  Algorithms and Theory of Computation Handbook , 2009, Chapman & Hall/CRC Applied Algorithms and Data Structures series.

[21]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[22]  Eamonn J. Keogh,et al.  Time series shapelets: a new primitive for data mining , 2009, KDD.

[23]  Hui Ding,et al.  Querying and mining of time series data: experimental comparison of representations and distance measures , 2008, Proc. VLDB Endow..

[24]  Serge J. Belongie,et al.  Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[25]  Michael L. Lightstone,et al.  A new efficient approach for the removal of impulse noise from highly corrupted images , 1996, IEEE Trans. Image Process..

[26]  Jehoshua Bruck,et al.  Coding for delay-insensitive communication with partial synchronization , 1994, IEEE Trans. Inf. Theory.

[27]  Vladimir Pavlovic,et al.  A New Adaptive Segmental Matching Measure for Human Activity Recognition , 2013, 2013 IEEE International Conference on Computer Vision.

[28]  Fernando De la Torre,et al.  Canonical Time Warping for Alignment of Human Behavior , 2009, NIPS.

[29]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[30]  Paul Lukowicz,et al.  On general purpose time series similarity measures and their use as kernel functions in support vector machines , 2014, Inf. Sci..

[31]  Ronen Basri,et al.  Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[32]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[33]  Christoph H. Lampert,et al.  Efficient Subwindow Search: A Branch and Bound Framework for Object Localization , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Luc Van Gool,et al.  Variations of a Hough-Voting Action Recognition System , 2010, ICPR Contests.

[35]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[36]  Lei Chen,et al.  On The Marriage of Lp-norms and Edit Distance , 2004, VLDB.

[37]  Mario A. Nascimento,et al.  Proceedings of the Thirtieth international conference on Very large data bases - Volume 30 , 2004 .

[38]  Michael S. Ryoo,et al.  Human activity prediction: Early recognition of ongoing activities from streaming videos , 2011, 2011 International Conference on Computer Vision.

[39]  Vladimir Pavlovic,et al.  Improved sequence classification using adaptive segmental sequence alignment , 2012, ACML.

[40]  Lei Chen,et al.  Robust and fast similarity search for moving object trajectories , 2005, SIGMOD '05.