Clinical implications of recent advances in proteogenomics

ABSTRACT Proteogenomics, the alliance of proteomics, transcriptomics, genomics and bioinformatics, was first proposed for refining genome annotation using experimental data acquired on gene products. With high-throughput analysis of proteins made possible with next-generation tandem mass spectrometers, proteogenomics is greatly improving human genome annotation per se, and is helping to decrypt the numerous gene and protein modifications occurring during development, aging, illness and cancer progression. Further efforts are required to obtain a comprehensive picture of human genes, their products, functions, and drift over time or in reaction to microbiota and pathogen stimuli. This should be performed not only to obtain a general overview of the human population, but also to gain specific information at the individual level. This review focuses on the clinical implications of proteogenomics: novel biological insights into fundamental biology, better characterization of pathogens and parasites, discovery of novel diagnostic approaches for cancer, and personalized medicine.

[1]  Andrew N Hoofnagle,et al.  Current and future applications of mass spectrometry to the clinical laboratory. , 2011, American journal of clinical pathology.

[2]  L. Mirny,et al.  High-Resolution Mapping of the Spatial Organization of a Bacterial Chromosome , 2013, Science.

[3]  Nandini A. Sahasrabuddhe,et al.  A proteogenomic approach to map the proteome of an unsequenced pathogen – Leishmania donovani , 2012, Proteomics.

[4]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[5]  Reinhard Guthke,et al.  A review on computational systems biology of pathogen–host interactions , 2015, Front. Microbiol..

[6]  Shivashankar H. Nagaraj,et al.  PGTools: A Software Suite for Proteogenomic Data Analysis and Visualization. , 2015, Journal of proteome research.

[7]  J. Armengaud,et al.  Expanding the Known Repertoire of Virulence Factors Produced by Bacillus cereus through Early Secretome Profiling in Three Redox Conditions , 2010, Molecular & Cellular Proteomics.

[8]  Samuel H. Payne,et al.  Proteogenomic strategies for identification of aberrant cancer peptides using large‐scale next‐generation sequencing data , 2014, Proteomics.

[9]  Proteogenomics of selective susceptibility to endotoxin using circulating acute phase biomarkers and bioassay development in sheep: a review , 2014, Proteome Science.

[10]  Joshua N. Adkins,et al.  Comparative Omics-Driven Genome Annotation Refinement: Application across Yersiniae , 2012, PloS one.

[11]  J. Gribben,et al.  Empirical inference of circuitry and plasticity in a kinase signaling network , 2015, Proceedings of the National Academy of Sciences.

[12]  Lloyd M. Smith,et al.  Proteoform: a single term describing protein complexity , 2013, Nature Methods.

[13]  Philipp F. Lange,et al.  Annotating N Termini for the Human Proteome Project: N Termini and Nα-Acetylation Status Differentiate Stable Cleaved Protein Species from Degradation Remnants in the Human Erythrocyte Proteome , 2014, Journal of proteome research.

[14]  Christine Schaeffer-Reiss,et al.  An improved stable isotope N-terminal labeling approach with light/heavy TMPP to automate proteogenomics data validation: dN-TOP. , 2013, Journal of proteome research.

[15]  J. Armengaud,et al.  Proteogenomic biomarkers for identification of Francisella species and subspecies by matrix-assisted laser desorption ionization-time-of-flight mass spectrometry. , 2014, Analytical chemistry.

[16]  L. Blanchard,et al.  RNA Sequencing and Proteogenomics Reveal the Importance of Leaderless mRNAs in the Radiation-Tolerant Bacterium Deinococcus deserti , 2014, Genome biology and evolution.

[17]  Roman M. Ženka,et al.  Proteomic detection of immunoglobulin light chain variable region peptides from amyloidosis patient biopsies. , 2015, Journal of proteome research.

[18]  Albert J R Heck,et al.  Toward full peptide sequence coverage by dual fragmentation combining electron-transfer and higher-energy collision dissociation tandem mass spectrometry. , 2012, Analytical chemistry.

[19]  A. Pascual-Montano,et al.  Proteogenomics Dashboard for the Human Proteome Project. , 2015, Journal of proteome research.

[20]  RajuRajesh,et al.  Neglected Tropical Diseases and Omics Science: Proteogenomics Analysis of the Promastigote Stage of Leishmania major Parasite , 2014 .

[21]  Chen Chen,et al.  Screening of missing proteins in the human liver proteome by improved MRM-approach-based targeted proteomics. , 2014, Journal of proteome research.

[22]  L. Jensen,et al.  Mass Spectrometry of Human Leukocyte Antigen Class I Peptidomes Reveals Strong Effects of Protein Abundance and Turnover on Antigen Presentation* , 2015, Molecular & Cellular Proteomics.

[23]  Matthew D. Dun,et al.  Proteogenomics: emergence and promise , 2015, Cellular and Molecular Life Sciences.

[24]  Eric W Deutsch,et al.  State of the human proteome in 2013 as viewed through PeptideAtlas: comparing the kidney, urine, and plasma proteomes for the biology- and disease-driven Human Proteome Project. , 2014, Journal of proteome research.

[25]  Alexander V. Tyakht,et al.  Chromosome 18 transcriptome profiling and targeted proteome mapping in depleted plasma, liver tissue and HepG2 cells. , 2013, Journal of proteome research.

[26]  M. Wilkins,et al.  Tools to covisualize and coanalyze proteomic data with genomes and transcriptomes: validation of genes and alternative mRNA splicing. , 2014, Journal of proteome research.

[27]  J. Armengaud,et al.  Non-model organisms, a species endangered by proteogenomics. , 2014, Journal of proteomics.

[28]  William S Hancock,et al.  Protannotator: a semiautomated pipeline for chromosome-wise functional annotation of the "missing" human proteome. , 2014, Journal of proteome research.

[29]  J. Armengaud,et al.  N‐terminomics and proteogenomics, getting off to a good start , 2014, Proteomics.

[30]  Tony Pawson,et al.  Cell-Specific Information Processing in Segregating Populations of Eph Receptor Ephrin–Expressing Cells , 2009, Science.

[31]  Jean Armengaud,et al.  A perfect genome annotation is within reach with the proteomics and genomics alliance. , 2009, Current opinion in microbiology.

[32]  International Human Genome Sequencing Consortium Finishing the euchromatic sequence of the human genome , 2004 .

[33]  Frédéric Chalmel,et al.  Forty-Four Novel Protein-Coding Loci Discovered Using a Proteomics Informed by Transcriptomics (PIT) Approach in Rat Male Germ Cells1 , 2014, Biology of reproduction.

[34]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[35]  Mehdi Mesri,et al.  Linking cancer genome to proteome: NCI's investment into proteogenomics , 2014, Proteomics.

[36]  Alexey I Nesvizhskii,et al.  Mining Missing Membrane Proteins by High-pH Reverse-Phase StageTip Fractionation and Multiple Reaction Monitoring Mass Spectrometry. , 2015, Journal of proteome research.

[37]  Brian L. Frey,et al.  Discovery and Mass Spectrometric Analysis of Novel Splice-junction Peptides Using RNA-Seq* , 2013, Molecular & Cellular Proteomics.

[38]  K. Kuznetsova,et al.  Proteogenomics meets cancer immunology: mass spectrometric discovery and analysis of neoantigens , 2015, Expert review of proteomics.

[39]  B. Kuster,et al.  Mass-spectrometry-based draft of the human proteome , 2014, Nature.

[40]  William S Hancock,et al.  Proteogenomic analysis of human colon carcinoma cell lines LIM1215, LIM1899, and LIM2405. , 2013, Journal of proteome research.

[41]  V. Blinov,et al.  PPLine: An Automated Pipeline for SNP, SAP, and Splice Variant Detection in the Context of Proteogenomics. , 2015, Journal of proteome research.

[42]  M. Mann,et al.  Large-scale Proteomic Analysis of the Human Spliceosome References , 2006 .

[43]  Yixue Li,et al.  Integration of mass spectrometry and RNA‐Seq data to confirm human ab initio predicted genes and lncRNAs , 2014, Proteomics.

[44]  Pamela K. Kreeger,et al.  Cancer systems biology: a network modeling perspective , 2009, Carcinogenesis.

[45]  P. Boutros,et al.  Onco-proteogenomics: cancer proteomics joins forces with genomics , 2014, Nature Methods.

[46]  B. Shen,et al.  A proteogenomics approach integrating proteomics and ribosome profiling increases the efficiency of protein identification and enables the discovery of alternative translation start sites , 2014, Proteomics.

[47]  Lars Malmström,et al.  Quantitative proteogenomics of human pathogens using DIA-MS. , 2015, Journal of proteomics.

[48]  Samuel H. Payne,et al.  Proteogenomic Analysis of Bacteria and Archaea: A 46 Organism Case Study , 2011, PloS one.

[49]  Chen Chen,et al.  Identification of HPV integration and gene mutation in HeLa cell line by integrated analysis of RNA-Seq and MS/MS data. , 2015, Journal of proteome research.

[50]  Alessandro Sette,et al.  An open-source computational and data resource to analyze digital maps of immunopeptidomes , 2015, eLife.

[51]  W. Hsu,et al.  Decoding the disease-associated proteins encoded in the human chromosome 4. , 2013, Journal of proteome research.

[52]  A. Nesvizhskii,et al.  Metrics for the Human Proteome Project 2015: Progress on the Human Proteome and Guidelines for High-Confidence Protein Identification. , 2015, Journal of proteome research.

[53]  M. Snyder Q & A: the Snyderome , 2012, Genome Biology.

[54]  James E. Johnson,et al.  Flexible and Accessible Workflows for Improved Proteogenomic Analysis Using the Galaxy Framework , 2014, Journal of proteome research.

[55]  Vineet Bafna,et al.  Advanced Proteogenomic Analysis Reveals Multiple Peptide Mutations and Complex Immunoglobulin Peptides in Colon Cancer. , 2015, Journal of proteome research.

[56]  J. Armengaud,et al.  Proteogenomics of Gammarus fossarum to Document the Reproductive System of Amphipods* , 2014, Molecular & Cellular Proteomics.

[57]  M. Kube,et al.  Transcriptomics assisted proteomic analysis of Nicotiana occidentalis infected by Candidatus Phytoplasma mali strain AT , 2014, Proteomics.

[58]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[59]  Arie Admon,et al.  The Human Immunopeptidome Project, a Suggestion for yet another Postgenome Next Big Thing , 2011, Molecular & Cellular Proteomics.

[60]  A. Pandey,et al.  Tropical Diseases and Omics Science : Proteogenomics Analysis of the Promastigote Stage of Leishmania major Parasite , 2014 .

[61]  R. Aebersold,et al.  A Combined Shotgun and Targeted Mass Spectrometry Strategy for Breast Cancer Biomarker Discovery. , 2015, Journal of proteome research.

[62]  Akhilesh Pandey,et al.  Proteogenomic analysis of human chromosome 9-encoded genes from human samples and lung cancer tissues. , 2014, Journal of proteome research.

[63]  R. Bast,et al.  Three Biomarkers Identified from Serum Proteomic Analysis for the Detection of Early Stage Ovarian Cancer , 2004, Cancer Research.

[64]  Jeffrey R. Whiteaker,et al.  Proteogenomic characterization of human colon and rectal cancer , 2014, Nature.

[65]  R. Branca,et al.  SpliceVista, a Tool for Splice Variant Identification and Visualization in Shotgun Proteomics Data* , 2014, Molecular & Cellular Proteomics.

[66]  Hugo Y. K. Lam,et al.  Personal Omics Profiling Reveals Dynamic Molecular and Medical Phenotypes , 2012, Cell.

[67]  J. Fernández-Irigoyen,et al.  New insights into the human brain proteome: Protein expression profiling of deep brain stimulation target areas. , 2015, Journal of proteomics.

[68]  K. Gevaert,et al.  Deep Proteome Coverage Based on Ribosome Profiling Aids Mass Spectrometry-based Protein and Peptide Discovery and Provides Evidence of Alternative Translation Products and Near-cognate Translation Initiation Events* , 2013, Molecular & Cellular Proteomics.

[69]  Harsh Pawar,et al.  A bioinformatics approach to reanalyze the genome annotation of kinetoplastid protozoan parasite Leishmania donovani. , 2014, Genomics.

[70]  Alvaro Sebastian Vaca Jacome,et al.  N‐terminome analysis of the human mitochondrial proteome , 2015, Proteomics.

[71]  H. Rodriguez,et al.  Proteogenomic convergence for understanding cancer pathways and networks , 2014, Clinical Proteomics.

[72]  C. Pineau,et al.  Human Spermatozoa as a Model for Detecting Missing Proteins in the Context of the Chromosome-Centric Human Proteome Project. , 2015, Journal of proteome research.

[73]  Samuel H. Payne,et al.  A proteogenomic update to Yersinia: enhancing genome annotation , 2010, BMC Genomics.

[74]  G. Omenn,et al.  A first step toward completion of a genome-wide characterization of the human proteome. , 2013, Journal of proteome research.

[75]  J. Armengaud Microbiology and proteomics, getting the best of both worlds! , 2013, Environmental microbiology.

[76]  E. Marcotte,et al.  Insights into the regulation of protein abundance from proteomic and transcriptomic analyses , 2012, Nature Reviews Genetics.

[77]  Akhilesh Pandey,et al.  Chromosome-centric human proteome project: deciphering proteins associated with glioma and neurodegenerative disorders on chromosome 12. , 2014, Journal of proteome research.

[78]  K. Yamaguchi,et al.  Identification of a novel protein isoform derived from cancer‐related splicing variants using combined analysis of transcriptome and proteome , 2011, Proteomics.

[79]  Shoba Ranganathan,et al.  Functional annotation of the human chromosome 7 "missing" proteins: a bioinformatics approach. , 2013, Journal of proteome research.

[80]  Vladimir Brusic,et al.  Tumor antigens as proteogenomic biomarkers in invasive ductal carcinomas , 2014, BMC Medical Genomics.

[81]  Q. Jin,et al.  A proteogenomic analysis of Shigella flexneri using 2D LC-MALDI TOF/TOF , 2011, BMC Genomics.

[82]  A. Pandey,et al.  Moving from unsequenced to sequenced genome: reanalysis of the proteome of Leishmania donovani. , 2014, Journal of proteomics.

[83]  Gary D Bader,et al.  A draft map of the human proteome , 2014, Nature.

[84]  Christopher G. Adda,et al.  Proteogenomic analysis reveals exosomes are more oncogenic than ectosomes , 2015, Oncotarget.

[85]  Shivakumar Keerthikumar,et al.  Proteogenomic analysis of Candida glabrata using high resolution mass spectrometry. , 2012, Journal of proteome research.

[86]  Richard D. Smith,et al.  Proteogenomics: needs and roles to be filled by proteomics in genome annotation. , 2008, Briefings in functional genomics & proteomics.

[87]  Ruedi Aebersold,et al.  Using data‐independent, high‐resolution mass spectrometry in protein biomarker research: Perspectives and clinical applications , 2015, Proteomics. Clinical applications.

[88]  Alain Gateau,et al.  Computational and Mass-Spectrometry-Based Workflow for the Discovery and Validation of Missing Human Proteins: Application to Chromosomes 2 and 14. , 2015, Journal of proteome research.

[89]  David Fenyö,et al.  Integrated Bottom-Up and Top-Down Proteomics of Patient-Derived Breast Tumor Xenografts* , 2015, Molecular & Cellular Proteomics.

[90]  J. De las Rivas,et al.  In Vitro Transcription/Translation System: A Versatile Tool in the Search for Missing Proteins. , 2015, Journal of proteome research.

[91]  N. Packer,et al.  Quantitative proteomic analysis of paired colorectal cancer and non-tumorigenic tissues reveals signature proteins and perturbed pathways involved in CRC progression and metastasis. , 2015, Journal of proteomics.

[92]  Erica M. Hartmann,et al.  N-Terminal-oriented Proteogenomics of the Marine Bacterium Roseobacter Denitrificans Och114 using N-Succinimidyloxycarbonylmethyl)tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP) Labeling and Diagonal Chromatography* , 2014, Molecular & Cellular Proteomics.

[93]  Tao Zhang,et al.  Systematic analysis of missing proteins provides clues to help define all of the protein-coding genes on human chromosome 1. , 2014, Journal of proteome research.

[94]  Dhanashree S. Kelkar,et al.  Proteogenomic analysis of pathogenic yeast Cryptococcus neoformans using high resolution mass spectrometry , 2014, Clinical Proteomics.

[95]  A. Pandey,et al.  Comprehensive proteomics analysis of glycosomes from Leishmania donovani. , 2015, Omics : a journal of integrative biology.

[96]  J. Armengaud,et al.  Magnetic immunoaffinity enrichment for selective capture and MS/MS analysis of N-terminal-TMPP-labeled peptides. , 2014, Journal of proteome research.

[97]  J. Armengaud,et al.  Proteogenomic insights into salt tolerance by a halotolerant alpha-proteobacterium isolated from an Andean saline spring. , 2014, Journal of proteomics.

[98]  J. Yates,et al.  Mining genomes: correlating tandem mass spectra of modified and unmodified peptides to sequences in nucleotide databases. , 1995, Analytical chemistry.

[99]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[100]  J. Armengaud Power of positive thinking in quantitative proteomics , 2015, Proteomics.

[101]  R. Norton,et al.  Discovery by proteogenomics and characterization of an RF-amide neuropeptide from cone snail venom. , 2015, Journal of proteomics.

[102]  Srikanth S. Manda,et al.  Identification and characterization of proteins encoded by chromosome 12 as part of chromosome-centric human proteome project. , 2014, Journal of proteome research.

[103]  Daniel B. Goodman,et al.  Comparative proteogenomics: combining mass spectrometry and comparative genomics to analyze multiple genomes. , 2008, Genome research.

[104]  Jean Armengaud,et al.  Proteogenomics and systems biology: quest for the ultimate missing parts , 2010, Expert review of proteomics.

[105]  Sheng Gu,et al.  Amino acid residue specific stable isotope labeling for quantitative proteomics. , 2002, Rapid communications in mass spectrometry : RCM.

[106]  Victor Guryev,et al.  Genomic variability and protein species - Improving sequence coverage for proteogenomics. , 2016, Journal of proteomics.

[107]  P. Stadler,et al.  Identification of new protein coding sequences and signal peptidase cleavage sites of Helicobacter pylori strain 26695 by proteogenomics. , 2013, Journal of proteomics.

[108]  A. Nicholson,et al.  Mutations of the BRAF gene in human cancer , 2002, Nature.

[109]  David D. Shteynberg,et al.  State of the Human Proteome in 2014/2015 As Viewed through PeptideAtlas: Enhancing Accuracy and Coverage through the AtlasProphet. , 2015, Journal of proteome research.

[110]  Juan Antonio Vizcaíno,et al.  Quest for Missing Proteins: Update 2015 on Chromosome-Centric Human Proteome Project. , 2015, Journal of proteome research.

[111]  O. Poch,et al.  Ortho-proteogenomics: multiple proteomes investigation through orthology and a new MS-based protocol. , 2008, Genome research.

[112]  A. Podtelejnikov,et al.  Linking genome and proteome by mass spectrometry: large-scale identification of yeast proteins from two dimensional gels. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[113]  B. Alpha-Bazin,et al.  Time dynamics of the Bacillus cereus exoproteome are shaped by cellular oxidation , 2015, Front. Microbiol..

[114]  Young Mok Park,et al.  Proteogenomics of the human hippocampus: The road ahead. , 2015, Biochimica et biophysica acta.

[115]  H. Wiker,et al.  Improving genome annotation of enterotoxigenic Escherichia coli TW10598 by a label‐free quantitative MS/MS approach , 2015, Proteomics.

[116]  Eric W Deutsch,et al.  The state of the human proteome in 2012 as viewed through PeptideAtlas. , 2013, Journal of proteome research.

[117]  Jean Armengaud,et al.  Ribosomal proteins as biomarkers for bacterial identification by mass spectrometry in the clinical microbiology laboratory. , 2013, Journal of microbiological methods.

[118]  Damian Fermin,et al.  Novel gene and gene model detection using a whole genome open reading frame analysis in proteomics , 2006, Genome Biology.

[119]  A. Nesvizhskii Proteogenomics: concepts, applications and computational strategies , 2014, Nature Methods.

[120]  C. Dunyach-Rémy,et al.  Mass spectrometry: a revolution in clinical microbiology? , 2012, Clinical chemistry and laboratory medicine.

[121]  Masaru Tomita,et al.  Onco-proteogenomics: a novel approach to identify cancer-specific mutations combining proteomics and transcriptome deep sequencing , 2010, Genome Biology.

[122]  Subha Madhavan,et al.  The CPTAC Data Portal: A Resource for Cancer Proteomics Research. , 2015, Journal of proteome research.

[123]  Joel A. Kooren,et al.  A two‐step database search method improves sensitivity in peptide sequence matches for metaproteomics and proteogenomics studies , 2013, Proteomics.

[124]  M. Baudet,et al.  Proteomics-based Refinement of Deinococcus deserti Genome Annotation Reveals an Unwonted Use of Non-canonical Translation Initiation Codons , 2009, Molecular & Cellular Proteomics.

[125]  Ronald J. Moore,et al.  Blood Peptidome-Degradome Profile of Breast Cancer , 2010, PloS one.

[126]  Christopher M Overall,et al.  Identifying and quantifying proteolytic events and the natural N terminome by terminal amine isotopic labeling of substrates , 2011, Nature Protocols.

[127]  Alexander V. Tyakht,et al.  Application of Spiroplasma melliferum proteogenomic profiling for the discovery of virulence factors and pathogenicity mechanisms in host-associated spiroplasmas. , 2012, Journal of proteome research.

[128]  S. Hanash,et al.  A chromosome-centric human proteome project (C-HPP) to characterize the sets of proteins encoded in chromosome 17. , 2013, Journal of proteome research.

[129]  Andrew R. Jones,et al.  A large-scale proteogenomics study of apicomplexan pathogens—Toxoplasma gondii and Neospora caninum , 2015, Proteomics.

[130]  Alexander V. Tyakht,et al.  Chromosome 18 transcriptoproteome of liver tissue and HepG2 cells and targeted proteome mapping in depleted plasma: update 2013. , 2014, Journal of proteome research.

[131]  Ruedi Aebersold,et al.  A Mass Spectrometric-Derived Cell Surface Protein Atlas , 2015, PloS one.

[132]  A. Pascual-Montano,et al.  Surfing transcriptomic landscapes. A step beyond the annotation of chromosome 16 proteome. , 2014, Journal of proteome research.

[133]  Jacob D. Jaffe,et al.  Proteogenomic mapping as a complementary method to perform genome annotation , 2004, Proteomics.

[134]  E. Hsi,et al.  Onco-proteogenomics identifies urinary S100A9 and GRN as potential combinatorial biomarkers for early diagnosis of hepatocellular carcinoma , 2015, BBA clinical.