Normalizing and integrating metabolomics data.

Metabolomics research often requires the use of multiple analytical platforms, batches of samples, and laboratories, any of which can introduce a component of unwanted variation. In addition, every experiment is subject to within-platform and other experimental variation, which often includes unwanted biological variation. Such variation must be removed in order to focus on the biological information of interest. We present a broadly applicable method for the removal of unwanted variation arising from various sources for the identification of differentially abundant metabolites and, hence, for the systematic integration of data on the same quantities from different sources. We illustrate the versatility and the performance of the approach in four applications, and we show that it has several advantages over the existing normalization methods.

[1]  J. Lindon,et al.  Scaling and normalization effects in NMR spectroscopic metabonomic data sets. , 2006, Analytical chemistry.

[2]  Eleanor C. Saunders,et al.  Central carbon metabolism of Leishmania parasites , 2010, Parasitology.

[3]  Jairus Bowne,et al.  Comprehensive profiling and quantitation of amine group containing metabolites. , 2011, Analytical chemistry.

[4]  Johann A. Gagnon-Bartsch,et al.  Using control genes to correct for unwanted variation in microarray data. , 2012, Biostatistics.

[5]  Oliver Stegle,et al.  Accounting for Non-genetic Factors Improves the Power of eQTL Studies , 2008, RECOMB.

[6]  M. Sjöström,et al.  Design of experiments: an efficient strategy to identify factors influencing extraction and derivatization of Arabidopsis thaliana samples in metabolomic studies with gas chromatography/mass spectrometry. , 2004, Analytical biochemistry.

[7]  A. Smilde,et al.  Large-scale human metabolomics studies: a strategy for data (pre-) processing and validation. , 2006, Analytical chemistry.

[8]  Joachim Selbig,et al.  Metabolite fingerprinting: detecting biological features by independent component analysis , 2004, Bioinform..

[9]  Ute Roessner,et al.  What is metabolomics all about? , 2009, BioTechniques.

[10]  J. D. Morrison,et al.  Computer methods in analytical mass spectrometry. Identification of an unknown compound in a catalog , 1968 .

[11]  Joshua D. Knowles,et al.  Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry , 2011, Nature Protocols.

[12]  Matej Oresic,et al.  Normalization method for metabolomics data using optimal selection of multiple internal standards , 2007, BMC Bioinformatics.

[13]  John D. Storey,et al.  Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis , 2007, PLoS genetics.

[14]  Kazuki Saito,et al.  Compensation for systematic cross-contribution improves normalization of mass spectrometry based metabolomics data. , 2009, Analytical chemistry.

[15]  T. Shaler,et al.  Quantification of proteins and metabolites by mass spectrometry without isotopic labeling or spiked standards. , 2003, Analytical chemistry.

[16]  Joachim Selbig,et al.  A gentle guide to the analysis of metabolomic data. , 2007, Methods in molecular biology.