Adaptive binning: An improved binning method for metabolomics data using the undecimated wavelet transform

Statistical analysis of metabolomic datasets can lead to erroneous interpretation of results due to misalignment of the data. Therefore pre-processing methods for peak alignment and data averaging (binning or bucketing) to improve data quality have been used. Here we introduce adaptive binning. The undecimated wavelet transform is used in an improved method for correcting variation in chemical shifts in nuclear magnetic resonance spectroscopy data. Adaptive binning using theoretical and metabolomics NMR spectra significantly increases the ratio of inter-class to intra-class variation and increases data interpretability when compared to conventional binning.

[1]  Sarah Oehlschlager,et al.  NMR profiling of transgenic peas. , 2004, Plant biotechnology journal.

[2]  D. Kell,et al.  An introduction to wavelet transforms for chemometricians: A time-frequency approach , 1997 .

[3]  S. Wold Nonlinear partial least squares modelling II. Spline inner relation , 1992 .

[4]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[5]  I. Schuppe-Koistinen,et al.  Peak alignment of NMR signals by means of a genetic algorithm , 2003 .

[6]  Beata Walczak,et al.  Preprocessing of two‐dimensional gel electrophoresis images , 2004, Proteomics.

[7]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[8]  A. Haar Zur Theorie der orthogonalen Funktionensysteme , 1910 .

[9]  L. Crombie,et al.  Synthesis of the Mammea coumarins. Part 1. The coumarins of the mammea A, B, and C series , 1987 .

[10]  A. Antoniadis,et al.  Wavelets and Statistics , 1995 .

[11]  H. Williams,et al.  Metabolic profiling of genetic disorders: a multitissue (1)H nuclear magnetic resonance spectroscopic and pattern recognition study into dystrophic tissue. , 2001, Analytical biochemistry.

[12]  D. Massart,et al.  The use of wavelets for signal denoising in capillary electrophoresis. , 2001, Analytical chemistry.

[13]  Jeremy K. Nicholson,et al.  Comparative biochemistry and short-term starvation effects on the earthworms Eisenia veneta and Lumbricus terrestris studied by 1H NMR spectroscopy and pattern recognition , 2001 .

[14]  T. Ebbels,et al.  NMR-based metabonomic toxicity classification: hierarchical cluster analysis and k-nearest-neighbour approaches , 2003 .

[15]  J. Lindon,et al.  NMR‐based metabonomic approaches for evaluating physiological influences on biofluid composition , 2005, NMR in biomedicine.

[16]  Michael P. Williamson,et al.  The self-association of the black tea polyphenol theaflavin and its complexation with caffeine , 2000 .

[17]  D. Kell,et al.  Variable selection in wavelet regression models , 1998 .

[18]  R. Dixon,et al.  Plant metabolomics: large-scale phytochemistry in the functional genomics era. , 2003, Phytochemistry.

[19]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .

[20]  Desire L. Massart,et al.  Noise suppression and signal compression using the wavelet packet transform , 1997 .

[21]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Paul Brereton,et al.  Application of (1)h NMR and multivariate statistics for screening complex mixtures: quality control and authenticity of instant coffee. , 2002, Journal of agricultural and food chemistry.

[23]  A. Walden,et al.  Wavelet Methods for Time Series Analysis , 2000 .