Multiscale Processing of Mass Spectrometry Data

This work addresses the problem of extracting signal content from protein mass spectrometry data. A multiscale decomposition of these spectra is used to focus on local scale-based structure by defining scale-specific features. Quantification of features is accompanied by an efficient method for calculating the location of features which avoids estimation of signal-to-noise ratios or bandwidths. Scale-based histograms serve as spectral-density-like functions indicating the regions of high density of features in the data. These regions provide bins within which features are quantified and compared across samples. As a preliminary step, the locations of prominent features within coarse-scale bins may be used for a crude registration of spectra. The multiscale decomposition, the scale-based feature definition, the calculation of feature locations, and subsequent quantification of features are carried out by way of a translation-invariant wavelet analysis.

[1]  Todd R. Ogden,et al.  Wavelet Methods for Time Series Analysis , 2002 .

[2]  Anestis Antoniadis,et al.  Wavelet Estimators in Nonparametric Regression: A Comparative Simulation Study , 2001 .

[3]  Jeffrey S. Morris,et al.  Quality control and peak finding for proteomics data collected from nipple aspirate fluid by surface-enhanced laser desorption and ionization. , 2003, Clinical chemistry.

[4]  Stéphane Mallat,et al.  Singularity detection and processing with wavelets , 1992, IEEE Trans. Inf. Theory.

[5]  J. Potter,et al.  A data-analytic strategy for protein biomarker discovery: profiling of high-dimensional proteomic data for cancer detection. , 2003, Biostatistics.

[6]  S. Mallat A wavelet tour of signal processing , 1998 .

[7]  Donald B. Percival,et al.  The WMTSA Wavelet Toolkit for Data Analysis in the Geosciences , 2003 .

[8]  Jeffrey S. Morris,et al.  Feature extraction and quantification for mass spectrometry in biomedical applications using the mean spectrum , 2005, Bioinform..

[9]  Philip R. Gafken,et al.  Evaluation of matrix‐assisted laser desorption/ionization‐time of flight mass spectrometry proteomic profiling: identification of alpha 2‐HS glycoprotein B‐chain as a biomarker of diet , 2005, Proteomics.

[10]  Li Hsu,et al.  Partially Supervised Learning Using an EM‐Boosting Algorithm , 2004, Biometrics.

[11]  Ziding Feng,et al.  Quantifying Peptide Signal in MALDI-TOF Mass Spectrometry Data* , 2005, Molecular & Cellular Proteomics.

[12]  Robert Tibshirani,et al.  Sample classification from protein mass spectrometry, by 'peak probability contrasts' , 2004, Bioinform..