A Reference Feature based method for Quantification and Identification of LC-MS based untargeted metabolomics

Batch inconsistency is a major problem when applying LC-MS based untargeted metabolomics in real-time analysis situation such as clinical diagnosis or health monitoring. And inefficiency of collecting MS2 is a major problem for metabolite identification. Here, we developed a reference-feature based quantification and identification strategy (RFQI). In RFQI, samples are individually profiled using a pre-fixed reference feature table. Quantification results show that RFQI improves features’ overlap rate and reduce variance across batches significantly in real-time-analysis mode, and can find more than 4-fold numbers of features. Besides, RFQI collects MS2 from consecutive increasing samples for metabolite identification of pre-fixed features, thus it can effectively compensate for the poor efficiency of MS2 collection in data-dependent acquisition mode. In summary, RFQI can make full advantage of consecutive increasing samples in real-time analysis situation, both for quantification and identification.

[1]  E. Marcotte,et al.  Chromatographic alignment of ESI-LC-MS proteomics data sets by ordered bijective interpolated warping. , 2006, Analytical chemistry.

[2]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[3]  Steffen Neumann,et al.  Highly sensitive feature detection for high resolution LC/MS , 2008, BMC Bioinformatics.

[4]  J. Lindon,et al.  Systems biology: Metabonomics , 2008, Nature.

[5]  Johan Trygg,et al.  Chemometrics in metabolomics--a review in human disease diagnosis. , 2010, Analytica chimica acta.

[6]  Joshua D. Knowles,et al.  Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry , 2011, Nature Protocols.

[7]  Jun Feng Xiao,et al.  Metabolite identification and quantitation in LC-MS/MS-based metabolomics. , 2012, Trends in analytical chemistry : TRAC.

[8]  Lin Shi,et al.  Large-scale untargeted LC-MS metabolomics data correction using between-batch feature alignment and cluster-based within-batch signal intensity drift correction , 2016, Metabolomics.

[9]  Gary J Patti,et al.  Systems-Level Annotation of a Metabolomics Data Set Reduces 25 000 Features to Fewer than 1000 Unique Metabolites. , 2017, Analytical chemistry.

[10]  Björn Egert,et al.  Metabolite patterns predicting sex and age in participants of the Karlsruhe Metabolomics and Nutrition (KarMeN) study , 2017, PloS one.

[11]  Juan Daniel Sanjuan-Herráez,et al.  Evaluation of batch effect elimination using quality control replicates in LC-MS metabolite profiling. , 2018, Analytica chimica acta.

[12]  Fan Zhang,et al.  WaveICA: A novel algorithm to remove batch effects for large-scale untargeted metabolomics data based on wavelet analysis. , 2019, Analytica chimica acta.

[13]  Xin Xiong,et al.  Metabolic reaction network-based recursive metabolite annotation for untargeted metabolomics , 2019, Nature Communications.

[14]  Tobias Depke,et al.  CluMSID: an R package for similarity-based clustering of tandem mass spectra to aid feature annotation in metabolomics , 2019, Bioinform..