Normalization method for metabolomics data using optimal selection of multiple internal standards

BackgroundSuccess of metabolomics as the phenotyping platform largely depends on its ability to detect various sources of biological variability. Removal of platform-specific sources of variability such as systematic error is therefore one of the foremost priorities in data preprocessing. However, chemical diversity of molecular species included in typical metabolic profiling experiments leads to different responses to variations in experimental conditions, making normalization a very demanding task.ResultsWith the aim to remove unwanted systematic variation, we present an approach that utilizes variability information from multiple internal standard compounds to find optimal normalization factor for each individual molecular species detected by metabolomics approach (NOMIS). We demonstrate the method on mouse liver lipidomic profiles using Ultra Performance Liquid Chromatography coupled to high resolution mass spectrometry, and compare its performance to two commonly utilized normalization methods: normalization by l2 norm and by retention time region specific standard compound profiles. The NOMIS method proved superior in its ability to reduce the effect of systematic error across the full spectrum of metabolite peaks. We also demonstrate that the method can be used to select best combinations of standard compounds for normalization.ConclusionDepending on experiment design and biological matrix, the NOMIS method is applicable either as a one-step normalization method or as a two-step method where the normalization parameters, influenced by variabilities of internal standard compounds and their correlation to metabolites, are first calculated from a study conducted in repeatability conditions. The method can also be used in analytical development of metabolomics methods by helping to select best combinations of standard compounds for a particular biological matrix and analytical platform.

[1]  Joachim Selbig,et al.  Metabolite fingerprinting: detecting biological features by independent component analysis , 2004, Bioinform..

[2]  J. van der Greef,et al.  The role of analytical sciences in medical systems biology. , 2004, Current opinion in chemical biology.

[3]  Matej Oresic,et al.  MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data , 2006, Bioinform..

[4]  P. Diggle Analysis of Longitudinal Data , 1995 .

[5]  Edmond de Hoffmann,et al.  Comprar Mass Spectrometry: Principles and Applications | Vincent Stroobant | 9780470033104 | Wiley , 2007 .

[6]  A. Smilde,et al.  Large-scale human metabolomics studies: a strategy for data (pre-) processing and validation. , 2006, Analytical chemistry.

[7]  Matej Orešič,et al.  The role of metabolomics in systems biology : a new vision for drug discovery and development , 2003 .

[8]  R. Goodacre,et al.  Metabolic Profiling: Its Role in Biomarker Discovery and Gene Function Analysis , 2003, Springer US.

[9]  D. Kell,et al.  Metabolomics by numbers: acquiring and understanding global metabolite data. , 2004, Trends in biotechnology.

[10]  Eric K. Neumann,et al.  Phenotype Characterisation Using Integrated Gene Transcript, Protein and Metabolite Profiling , 2004, Applied bioinformatics.

[11]  Jürgen Kurths,et al.  Observing and Interpreting Correlations in Metabolic Networks , 2003, Bioinform..

[12]  Tommi S. Jaakkola,et al.  Maximum-likelihood estimation of optimal scaling factors for expression array normalization , 2001, SPIE BiOS.

[13]  J. Trottier,et al.  Book Review: Mass Spectrometry: Principles and Applications. E. de Hoffman, J. Charette and W. Stroobant. Wiley, Chichester 1996. ISBN 0 471 96697 5 , 1997 .

[14]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[15]  M. Maffei,et al.  Positional cloning of the mouse obese gene and its human homologue , 1995, Nature.

[16]  D. Kell,et al.  A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations , 2001, Nature Biotechnology.

[17]  A. B. Robinson,et al.  Quantitative analysis of urine vapor and breath by gas-liquid partition chromatography. , 1971, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Matej Orešič,et al.  The Role of Metabolomics in Systems Biology , 2003 .

[19]  Matej Oresic,et al.  Processing methods for differential analysis of LC/MS profile data , 2005, BMC Bioinformatics.

[20]  M. Orešič,et al.  Metabolomic approaches to phenotype characterization and applications to complex diseases , 2006, Expert review of molecular diagnostics.

[21]  J. D. Morrison,et al.  Computer methods in analytical mass spectrometry. Identification of an unknown compound in a catalog , 1968 .

[22]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[23]  M. Maffei,et al.  Positional cloning of the mouse obese gene and its human homologue , 1994, Nature.

[24]  Yizeng Liang,et al.  Preprocessing of analytical profiles in the presence of homoscedastic or heteroscedastic noise , 1994 .

[25]  R. A. van den Berg,et al.  Centering, scaling, and transformations: improving the biological information content of metabolomics data , 2006, BMC Genomics.

[26]  T. Shaler,et al.  Quantification of proteins and metabolites by mass spectrometry without isotopic labeling or spiked standards. , 2003, Analytical chemistry.