Integrative Analysis of Proteomic, Glycomic, and Metabolomic Data for Biomarker Discovery

Studies associating changes in the levels of multiple biomolecules including proteins, glycans, glycoproteins, and metabolites with the onset of cancer have been widely investigated to identify clinically relevant diagnostic biomarkers. Advances in liquid or gas chromatography mass spectrometry (LC-MS, GC-MS) have enabled high-throughput qualitative and quantitative analysis of these biomolecules. While results from separate analyses of different biomolecules have been reported widely, the mutual information obtained by partly or fully combining them has been relatively unexplored. In this study, we investigate integrative analysis of proteins, N-glycans, and metabolites to take advantage of complementary information to improve the ability to distinguish cancer cases from controls. Specifically, support vector machine-recursive feature elimination algorithm is utilized to select a panel of proteins, N-glycans, and metabolites based on LC-MS and GC-MS data previously acquired by the analysis of blood samples from two cohorts in a liver cancer study. Improved performances are observed by integrative analysis compared to separate proteomic, glycomic, and metabolomic studies in distinguishing liver cancer cases from patients with liver cirrhosis.

[1]  Habtom W. Ressom,et al.  GC-MS Based Plasma Metabolomics for Identification of Candidate Biomarkers for Hepatocellular Carcinoma in Egyptian Cohort , 2015, PloS one.

[2]  E. Diamandis,et al.  Strategies for discovering novel cancer biomarkers through utilization of emerging technologies , 2008, Nature Clinical Practice Oncology.

[3]  Rui Zhu,et al.  LC‐MS/MS‐based serum proteomics for identification of candidate biomarkers for hepatocellular carcinoma , 2015, Proteomics.

[4]  Brendan MacLean,et al.  Bioinformatics Applications Note Gene Expression Skyline: an Open Source Document Editor for Creating and Analyzing Targeted Proteomics Experiments , 2022 .

[5]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[6]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[7]  Yue Joseph Wang,et al.  SIMAT: GC-SIM-MS data analysis tool , 2015, BMC Bioinformatics.

[8]  J. Esko,et al.  The sweet and sour of cancer: glycans as novel therapeutic targets , 2005, Nature Reviews Cancer.

[9]  Habtom W. Ressom,et al.  GPA: An algorithm for LC/MS based glycan profile annotation , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[10]  Habtom W. Ressom,et al.  Integrative analysis of LC-MS based glycomic and proteomic data , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[11]  Habtom W. Ressom,et al.  LC–MS Profiling of N-Glycans Derived from Human Serum Samples for Biomarker Discovery in Hepatocellular Carcinoma , 2014, Journal of proteome research.

[12]  C. Van Steenkiste,et al.  Alteration of protein glycosylation in liver diseases. , 2009, Journal of hepatology.