mQTL.NMR: an integrated suite for genetic mapping of quantitative variations of (1)H NMR-based metabolic profiles.

High-throughput (1)H nuclear magnetic resonance (NMR) is an increasingly popular robust approach for qualitative and quantitative metabolic profiling, which can be used in conjunction with genomic techniques to discover novel genetic associations through metabotype quantitative trait locus (mQTL) mapping. There is therefore a crucial necessity to develop specialized tools for an accurate detection and unbiased interpretability of the genetically determined metabolic signals. Here we introduce and implement a combined chemoinformatic approach for objective and systematic analysis of untargeted (1)H NMR-based metabolic profiles in quantitative genetic contexts. The R/Bioconductor mQTL.NMR package was designed to (i) perform a series of preprocessing steps restoring spectral dependency in collinear NMR data sets to reduce the multiple testing burden, (ii) carry out robust and accurate mQTL mapping in human cohorts as well as in rodent models, (iii) statistically enhance structural assignment of genetically determined metabolites, and (iv) illustrate results with a series of visualization tools. Built-in flexibility and implementation in the powerful R/Bioconductor framework allow key preprocessing steps such as peak alignment, normalization, or dimensionality reduction to be tailored to specific problems. The mQTL.NMR package is freely available with its source code through the Comprehensive R/Bioconductor repository and its own website ( http://www.ican-institute.org/tools/ ). It represents a significant advance to facilitate untargeted metabolomic data processing and quantitative analysis and their genetic mapping.

[1]  Daniel Raftery,et al.  Interdependence of signal processing and analysis of urine 1H NMR spectra for metabolic profiling. , 2009, Analytical chemistry.

[2]  E Holmes,et al.  Automatic reduction of NMR spectroscopic data for statistical and pattern recognition classification of samples. , 1994, Journal of pharmaceutical and biomedical analysis.

[3]  Hao Wu,et al.  R/qtl: QTL Mapping in Experimental Crosses , 2003, Bioinform..

[4]  Ib M. Skovgaard,et al.  Mapping Quantitative Trait Loci by an Extension of the Haley–Knott Regression Method Using Estimating Equations , 2006, Genetics.

[5]  L. Liang,et al.  Mapping complex disease traits with global gene expression , 2009, Nature Reviews Genetics.

[6]  J. Lindon,et al.  'Metabonomics': understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. , 1999, Xenobiotica; the fate of foreign compounds in biological systems.

[7]  Christian Gieger,et al.  A genome-wide perspective of genetic variation in human metabolism , 2010, Nature Genetics.

[8]  J. Lindon,et al.  Metabonomics: a platform for studying drug toxicity and gene function , 2002, Nature Reviews Drug Discovery.

[9]  M. Bihoreau,et al.  Untargeted metabolome quantitative trait locus mapping associates variation in urine glycerate to mutant glycerate kinase. , 2012, Journal of proteome research.

[10]  J. Trygg,et al.  Evaluation of the orthogonal projection on latent structure model limitations caused by chemical shift variability and improved visualization of biomarker changes in 1H NMR spectroscopic metabonomic studies. , 2005, Analytical chemistry.

[11]  M. Spraul,et al.  Precision high-throughput proton NMR spectroscopy of human urine, serum, and plasma for large-scale metabolic phenotyping. , 2014, Analytical chemistry.

[12]  Markus Perola,et al.  Metabonomic, transcriptomic, and genomic variation of a population cohort , 2010, Molecular systems biology.

[13]  C. Gieger,et al.  Human metabolic individuality in biomedical and pharmaceutical research , 2011, Nature.

[14]  David S. Wishart,et al.  MetaboAnalyst 2.0—a comprehensive server for metabolomic data analysis , 2012, Nucleic Acids Res..

[15]  T. Ebbels,et al.  Metabolic profiling, metabolomic and metabonomic procedures for NMR spectroscopy of urine, plasma, serum and tissue extracts , 2007, Nature Protocols.

[16]  Tao Wang,et al.  Automics: an integrated platform for NMR-based metabonomics spectral processing and data analysis , 2009, BMC Bioinformatics.

[17]  I. Schuppe-Koistinen,et al.  Peak alignment of NMR signals by means of a genetic algorithm , 2003 .

[18]  Manuel Desco,et al.  A novel R-package graphic user interface for the analysis of metabonomic profiles , 2009, BMC Bioinformatics.

[19]  T. Ebbels,et al.  Recursive segment-wise peak alignment of biological (1)h NMR spectra for improved metabolic biomarker recovery. , 2009, Analytical chemistry.

[20]  Dimitrios Spiliotopoulos,et al.  muma, An R Package for Metabolomics Univariate and Multivariate Statistical Analysis , 2013 .

[21]  M. Dumas,et al.  Statistical recoupling prior to significance testing in nuclear magnetic resonance based metabonomics. , 2009, Analytical chemistry.

[22]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[23]  Gregory D. Tredwell,et al.  Between-person comparison of metabolite fitting for NMR-based quantitative metabolomics. , 2011, Analytical chemistry.

[24]  Yurii S. Aulchenko,et al.  BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btm108 Genetics and population analysis GenABEL: an R library for genome-wide association analysis , 2022 .

[25]  Dominique Gauguier,et al.  Direct quantitative trait locus mapping of mammalian metabolic phenotypes in diabetic and normoglycemic rat models , 2007, Nature Genetics.

[26]  I. Wilson,et al.  An NMR‐based metabonomic approach to investigate the biochemical consequences of genetic strain differences: application to the C57BL10J and Alpk:ApfCD mouse , 2000, FEBS letters.

[27]  Xavier Correig,et al.  Focus: a robust workflow for one-dimensional NMR spectral analysis. , 2014, Analytical chemistry.

[28]  P. Elliott,et al.  High-throughput 1H NMR-based metabolic analysis of human serum and urine for large-scale epidemiological studies: validation study. , 2008, International journal of epidemiology.

[29]  Elaine Holmes,et al.  High-resolution magic-angle-spinning NMR spectroscopy for metabolic profiling of intact tissues , 2010, Nature Protocols.

[30]  Erik Johansson,et al.  Using chemometrics for navigating in the large data sets of genomics, proteomics, and metabonomics (gpm) , 2004, Analytical and bioanalytical chemistry.

[31]  Royston Goodacre,et al.  PYCHEM: a multivariate analysis package for python , 2006, Bioinform..

[32]  Christophe Croux,et al.  TOMCAT: A MATLAB toolbox for multivariate calibration techniques , 2007 .

[33]  Peter Donnelly,et al.  A Genome-Wide Metabolic QTL Analysis in Europeans Implicates Two Loci Shaped by Recent Positive Selection , 2011, PLoS genetics.

[34]  J. Lindon,et al.  Scaling and normalization effects in NMR spectroscopic metabonomic data sets. , 2006, Analytical chemistry.

[35]  Daniel S Waterman,et al.  The influence of EDTA and citrate anticoagulant addition to human plasma on information recovery from NMR-based metabolic profiling studies. , 2010, Molecular bioSystems.

[36]  R. Spang,et al.  State-of-the art data normalization methods improve NMR-based metabolomic analysis , 2011, Metabolomics.

[37]  M. Spraul,et al.  750 MHz 1H and 1H-13C NMR spectroscopy of human blood plasma. , 1995, Analytical chemistry.

[38]  H. Senn,et al.  Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. , 2006, Analytical chemistry.