Correlation optimized warping and dynamic time warping as preprocessing methods for chromatographic data

Two different algorithms for time‐alignment as a preprocessing step in linear factor models are studied. Correlation optimized warping and dynamic time warping are both presented in the literature as methods that can eliminate shift‐related artifacts from measurements by correcting a sample vector towards a reference. In this study both the theoretical properties and the practical implications of using signal warping as preprocessing for chromatographic data are investigated. The connection between the two algorithms is also discussed. The findings are illustrated by means of a case study of principal component analysis on a real data set, including manifest retention time artifacts, of extracts from coffee samples stored under different packaging conditions for varying storage times. We concluded that for the data presented here dynamic time warping with rigid slope constraints and correlation optimized warping are superior to unconstrained dynamic time warping; both considerably simplify interpretation of the factor model results. Unconstrained dynamic time warping was found to be too flexible for this chromatographic data set, resulting in an overcompensation of the observed shifts and suggesting the unsuitability of this preprocessing method for this type of signals. Copyright © 2004 John Wiley & Sons, Ltd.

[1]  Athanassios Kassidas Fault Detection and Diagnosis in Dynamic Multivariable Chemical Processes Using Speech Recognition Methods , 1997 .

[2]  C. Posten,et al.  Supervision of bioprocesses using a dynamic time warping algorithm , 1996 .

[3]  L. Rabiner,et al.  Performance trade‐offs in dynamic time warping algorithms for isolated word recognition , 1979 .

[4]  Thomas L. Isenhour,et al.  Time-warping algorithm applied to chromatographic peak matching gas chromatography/Fourier transform infrared/mass spectrometry , 1987 .

[5]  H. Sakoe,et al.  Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognition , 1979 .

[6]  George M. Church,et al.  Aligning gene expression time series with time warping algorithms , 2001, Bioinform..

[7]  A K Smilde,et al.  Influence of temperature on vibrational spectra and consequences for the predictive ability of multivariate models. , 1998, Analytical chemistry.

[8]  Rasmus Bro,et al.  Chemometrics in food science—a demonstration of the feasibility of a highly exploratory, inductive evaluation strategy of fundamental scientific significance , 1998 .

[9]  J. Carstensen,et al.  Aligning of single and multiple wavelength chromatographic profiles for chemometric data analysis using correlation optimised warping , 1998 .

[10]  D. Massart,et al.  A comparison of two algorithms for warping of analytical signals , 2002 .

[11]  G. Dunteman Principal Components Analysis , 1989 .

[12]  K. Markides,et al.  Chromatographic alignment by warping and dynamic programming as a pre-processing tool for PARAFAC modelling of liquid chromatography-mass spectrometry data. , 2002, Journal of chromatography. A.

[13]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[14]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[15]  T. F. Moran,et al.  Characterization of normal human cells by pyrolysis gas chromatography mass spectrometry. , 1979, Biomedical mass spectrometry.

[16]  A. Smilde,et al.  Dynamic time warping of spectroscopic BATCH data , 2003 .

[17]  Aaron E. Rosenberg,et al.  Performance tradeoffs in dynamic time warping algorithms for isolated word recognition , 1980 .

[18]  C. Myers,et al.  A level building dynamic time warping algorithm for connected word recognition , 1981 .

[19]  P. A. Taylor,et al.  Synchronization of batch trajectories using dynamic time warping , 1998 .

[20]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[21]  P. A. Taylor,et al.  Off-line diagnosis of deterministic faults in continuous dynamic multivariable processes using speech recognition methods , 1998 .

[22]  J. Edward Jackson,et al.  A User's Guide to Principal Components. , 1991 .