LipidFrag: Improving reliability of in silico fragmentation of lipids and application to the Caenorhabditis elegans lipidome

Lipid identification is a major bottleneck in high-throughput lipidomics studies. However, tools for the analysis of lipid tandem MS spectra are rather limited. While the comparison against spectra in reference libraries is one of the preferred methods, these libraries are far from being complete. In order to improve identification rates, the in silico fragmentation tool MetFrag was combined with Lipid Maps and lipid-class specific classifiers which calculate probabilities for lipid class assignments. The resulting LipidFrag workflow was trained and evaluated on different commercially available lipid standard materials, measured with data dependent UPLC-Q-ToF-MS/MS acquisition. The automatic analysis was compared against manual MS/MS spectra interpretation. With the lipid class specific models, identification of the true positives was improved especially for cases where candidate lipids from different lipid classes had similar MetFrag scores by removing up to 56% of false positive results. This LipidFrag approach was then applied to MS/MS spectra of lipid extracts of the nematode Caenorhabditis elegans. Fragments explained by LipidFrag match known fragmentation pathways, e.g., neutral losses of lipid headgroups and fatty acid side chain fragments. Based on prediction models trained on standard lipid materials, high probabilities for correct annotations were achieved, which makes LipidFrag a good choice for automated lipid data analysis and reliability testing of lipid identifications.

[1]  Xianlin Han,et al.  Automated lipid identification and quantification by multidimensional mass spectrometry-based shotgun lipidomics. , 2009, Analytical chemistry.

[2]  G. van Meer,et al.  Cellular lipidomics , 2005, The EMBO journal.

[3]  William Stafford Noble,et al.  Posterior error probabilities and false discovery rates: two sides of the same coin. , 2008, Journal of proteome research.

[4]  Johannes Griss,et al.  Greazy: Open-Source Software for Automated Phospholipid Tandem Mass Spectrometry Identification. , 2016, Analytical chemistry.

[5]  Christer S. Ejsing,et al.  Comprehensive Lipidome Analysis by Shotgun Lipidomics on a Hybrid Quadrupole-Orbitrap-Linear Ion Trap Mass Spectrometer , 2014, Journal of The American Society for Mass Spectrometry.

[6]  Matthias Müller-Hannemann,et al.  In silico fragmentation for computer assisted identification of metabolite mass spectra , 2010, BMC Bioinformatics.

[7]  Fong-Fu Hsu,et al.  Charge-driven fragmentation processes in diacyl glycerophosphatidic acids upon low-energy collisional activation. A mechanistic proposal , 2000, Journal of the American Society for Mass Spectrometry.

[8]  R. Murphy,et al.  Detection of the abundance of diacylglycerol and triacylglycerol molecular species in cells using neutral loss mass spectrometry. , 2007, Analytical biochemistry.

[9]  Juan Antonio Vizcaíno,et al.  Shorthand notation for lipid structures derived from mass spectrometry , 2013, Journal of Lipid Research.

[10]  Karsten Suhre,et al.  MassTRIX: mass translator into pathways , 2008, Nucleic Acids Res..

[11]  M. Hirai,et al.  MassBank: a public repository for sharing mass spectral data for life sciences. , 2010, Journal of mass spectrometry : JMS.

[12]  Michael Witting,et al.  The Caenorhabditis elegans lipidome: A primer for lipid analysis in Caenorhabditis elegans. , 2016, Archives of biochemistry and biophysics.

[13]  John Turk,et al.  Characterization of ceramides by low energy collisional-activated dissociation tandem mass spectrometry with negative-ion electrospray ionization , 2002, Journal of the American Society for Mass Spectrometry.

[14]  Xin Lu,et al.  Multiple reaction monitoring-ion pair finder: a systematic approach to transform nontargeted mode to pseudotargeted mode for metabolomics study based on liquid chromatography-mass spectrometry. , 2015, Analytical chemistry.

[15]  R. Taguchi,et al.  High-resolution analysis by nano-electrospray ionization Fourier transform ion cyclotron resonance mass spectrometry for the identification of molecular species of phospholipids and their oxidized metabolites. , 2004, Rapid communications in mass spectrometry : RCM.

[16]  Christer S. Ejsing,et al.  Charting molecular composition of phosphatidylcholines by fatty acid scanning and ion trap MS3 fragmentation Published, JLR Papers in Press, August 16, 2003. DOI 10.1194/jlr.D300020-JLR200 , 2003, Journal of Lipid Research.

[17]  M. Witting,et al.  Optimizing a ultrahigh pressure liquid chromatography-time of flight-mass spectrometry approach using a novel sub-2μm core-shell particle for in depth lipidomic profiling of Caenorhabditis elegans. , 2014, Journal of chromatography. A.

[18]  Oliver Fiehn,et al.  LipidBlast - in-silico tandem mass spectrometry database for lipid identification , 2013, Nature Methods.

[19]  T. Hankemeier,et al.  RPLC-ion-trap-FTMS method for lipid profiling of plasma: method validation and application to p53 mutant mouse model. , 2008, Journal of proteome research.

[20]  Zhixiang Yan,et al.  Improved data-dependent acquisition for untargeted metabolomics using gas-phase fractionation with staggered mass range. , 2015, Analytical chemistry.

[21]  M. Witting,et al.  Chapter 17 – Transcriptome and Metabolome Data Integration—Technical Perquisites for Successful Data Fusion and Visualization , 2014 .

[22]  Michael C. Thomas,et al.  Structural characterization of glycerophospholipids by combinations of ozone- and collision-induced dissociation mass spectrometry: the next step towards "top-down" lipidomics. , 2014, The Analyst.

[23]  Gerd Schmitz,et al.  Shotgun lipidomics by tandem mass spectrometry under data-dependent acquisition control. , 2007, Methods in enzymology.

[24]  M. Mann,et al.  More than 100,000 detectable peptide species elute in single shotgun proteomics runs but the majority is inaccessible to data-dependent LC-MS/MS. , 2011, Journal of proteome research.

[25]  Masanori Arita,et al.  MS-DIAL: Data Independent MS/MS Deconvolution for Comprehensive Metabolome Analysis , 2015, Nature Methods.

[26]  M. Caffrey,et al.  LIPIDAT: a database of lipid phase transition temperatures and enthalpy changes. DMPC data subset analysis. , 1992, Chemistry and physics of lipids.

[27]  Alan Bridge,et al.  The SwissLipids knowledgebase for lipid biology , 2015, Bioinform..

[28]  Zeeshan Ahmed,et al.  Lipid-Pro: a computational lipid identification solution for untargeted lipidomics on data-independent acquisition tandem mass spectrometry platforms , 2015, Bioinform..

[29]  T. Hankemeier,et al.  Comprehensive LC-MS E lipidomic analysis using a shotgun approach and its application to biomarker detection and identification in osteoarthritis patients. , 2010, Journal of proteome research.

[30]  Michael Witting,et al.  MassTRIX Reloaded: Combined Analysis and Visualization of Transcriptome and Metabolome Data , 2012, PloS one.

[31]  E. Yasugi,et al.  How to Search the Glycolipid data in “LIPIDBANK for Web”, the Newly Developed Lipid Database in Japan , 2000 .

[32]  Xianlin Han,et al.  Accurate Quantification of Lipid Species by Electrospray Ionization Mass Spectrometry — Meets a Key Challenge in Lipidomics , 2011, Metabolites.

[33]  A. Shevchenko,et al.  Systematic screening for novel lipids by shotgun lipidomics. , 2014, Analytical chemistry.

[34]  Joseph M. Foster,et al.  LipidHome: A Database of Theoretical Lipids Optimized for High Throughput Mass Spectrometry Lipidomics , 2013, PloS one.

[35]  Xianlin Han,et al.  Shotgun lipidomics: electrospray ionization mass spectrometric analysis and quantitation of cellular lipidomes directly from crude extracts of biological samples. , 2005, Mass spectrometry reviews.

[36]  David W. Russell,et al.  LMSD: LIPID MAPS structure database , 2006, Nucleic Acids Res..

[37]  Vasant R. Marur,et al.  Separation of cis-trans phospholipid isomers using reversed phase LC with high resolution MS detection. , 2012, Analytical chemistry.