Intensity drift removal in LC/MS metabolomics by common variance compensation

UNLABELLED Liquid chromatography coupled to mass spectrometry (LC/MS) has become widely used in Metabolomics. Several artefacts have been identified during the acquisition step in large LC/MS metabolomics experiments, including ion suppression, carryover or changes in the sensitivity and intensity. Several sources have been pointed out as responsible for these effects. In this context, the drift effects of the peak intensity is one of the most frequent and may even constitute the main source of variance in the data, resulting in misleading statistical results when the samples are analysed. In this article, we propose the introduction of a methodology based on a common variance analysis before the data normalization to address this issue. This methodology was tested and compared with four other methods by calculating the Dunn and Silhouette indices of the quality control classes. The results showed that our proposed methodology performed better than any of the other four methods. As far as we know, this is the first time that this kind of approach has been applied in the metabolomics context. AVAILABILITY AND IMPLEMENTATION The source code of the methods is available as the R package intCor at http://b2slab.upc.edu/software-and-downloads/intensity-drift-correction/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Nigel W. Hardy,et al.  Establishing reporting standards for metabolomic and metabonomic studies: a call for participation. , 2006, Omics : a journal of integrative biology.

[2]  Mia Hubert,et al.  ROBPCA: A New Approach to Robust Principal Component Analysis , 2005, Technometrics.

[3]  M. Sjöström,et al.  Drift correction for gas sensors using multivariate methods , 2000 .

[4]  Andrew E. Jaffe,et al.  Bioinformatics Applications Note Gene Expression the Sva Package for Removing Batch Effects and Other Unwanted Variation in High-throughput Experiments , 2022 .

[5]  Daniel Peña,et al.  A New Statistic for Influence in Linear Regression , 2005, Technometrics.

[6]  H. Fritz,et al.  EXPLORING HIGH-DIMENSIONAL DATA WITH ROBUST PRINCIPAL COMPONENTS , 2007 .

[7]  Joshua D. Knowles,et al.  Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry , 2011, Nature Protocols.

[8]  Matej Oresic,et al.  Normalization method for metabolomics data using optimal selection of multiple internal standards , 2007, BMC Bioinformatics.

[9]  Pere Caminal,et al.  Drift Compensation of Gas Sensor Array Data by Common Principal Component Analysis , 2010 .

[10]  Guy N. Brock,et al.  clValid , an R package for cluster validation , 2008 .

[11]  Gordana Ivosev,et al.  Instrumental and experimental effects in LC-MS-based metabolomics. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[12]  C. Kuo,et al.  Batch Normalizer: a fast total abundance regression calibration method to simultaneously adjust batch and injection order effects in liquid chromatography/time-of-flight mass spectrometry-based metabolomics data and comparison with current calibration methods. , 2013, Analytical chemistry.

[13]  Nickolay T. Trendafilov,et al.  Stepwise estimation of common principal components , 2010, Comput. Stat. Data Anal..

[14]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[15]  Kirsten R. McEwen,et al.  Naïve pluripotency is associated with global DNA hypomethylation , 2013, Nature Structural &Molecular Biology.

[16]  Xin Lu,et al.  LC-MS-based metabonomics analysis. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[17]  Pere Caminal,et al.  Common principal component analysis for drift compensation of gas sensor array data , 2009 .

[18]  T. Ebbels,et al.  Optimized preprocessing of ultra-performance liquid chromatography/mass spectrometry urinary metabolic profiles for improved information recovery. , 2011, Analytical chemistry.

[19]  Xi-jun Wang,et al.  Modern analytical techniques in metabolomics analysis. , 2012, The Analyst.

[20]  References , 1971 .

[21]  Rafael Llorach,et al.  An LC-MS-based metabolomics approach for exploring urinary metabolome modifications after cocoa consumption. , 2009, Journal of proteome research.

[22]  B. Flury Common Principal Components in k Groups , 1984 .

[23]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[24]  Xiaodong Zhu,et al.  Gene expression profile analysis of human intervertebral disc degeneration , 2013, Genetics and molecular biology.

[25]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[26]  Cristina Andrés-Lacueva,et al.  Metabolomics unveils urinary changes in subjects with metabolic syndrome following 12-week nut consumption. , 2011, Journal of proteome research.