Deep Learning in Proteomics

Proteomics, the study of all the proteins in biological systems, is becoming a data‐rich science. Protein sequences and structures are comprehensively catalogued in online databases. With recent advancements in tandem mass spectrometry (MS) technology, protein expression and post‐translational modifications (PTMs) can be studied in a variety of biological systems at the global scale. Sophisticated computational algorithms are needed to translate the vast amount of data into novel biological insights. Deep learning automatically extracts data representations at high levels of abstraction from data, and it thrives in data‐rich scientific research domains. Here, a comprehensive overview of deep learning applications in proteomics, including retention time prediction, MS/MS spectrum prediction, de novo peptide sequencing, PTM prediction, major histocompatibility complex‐peptide binding prediction, and protein structure prediction, is provided. Limitations and the future directions of deep learning in proteomics are also discussed. This review will provide readers an overview of deep learning and how it can be used to analyze proteomics data.

[1]  U. Kück,et al.  Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications , 2019, mBio.

[2]  Cheng Chen,et al.  DNNAce: Prediction of prokaryote lysine acetylation sites through deep neural networks with multi-information fusion , 2020 .

[3]  Baozhen Shan,et al.  De novo peptide sequencing by deep learning , 2017, Proceedings of the National Academy of Sciences.

[4]  Tianyi Zhao,et al.  Peptide-Major Histocompatibility Complex Class I Binding Prediction Based on Deep Learning With Novel Feature , 2019, Front. Genet..

[5]  Philip E. Bourne,et al.  Immune epitope database analysis resource , 2012, Nucleic Acids Res..

[6]  J. Koenderink Q… , 2014, Les noms officiels des communes de Wallonie, de Bruxelles-Capitale et de la communaute germanophone.

[7]  Rojan Shrestha,et al.  Assessing the accuracy of contact predictions in CASP13 , 2019, Proteins.

[8]  Jürgen Cox,et al.  High-quality MS/MS spectrum prediction for data-dependent and data-independent acquisition data analysis , 2019, Nature Methods.

[9]  Haoyang Zeng,et al.  DeepLigand: accurate prediction of MHC class I ligands using peptide embedding , 2019, Bioinform..

[10]  Yu Xue,et al.  GPS-Palm: a deep learning-based graphic presentation system for the prediction of S-palmitoylation sites in proteins , 2020, Briefings Bioinform..

[11]  Duolin Wang,et al.  MusiteDeep: a deep-learning based webserver for protein post-translational modification site prediction and visualization , 2020, Nucleic Acids Res..

[12]  Kristian E. Swearingen,et al.  Generating high quality libraries for DIA MS with empirically corrected peptide predictions , 2020, Nature Communications.

[13]  Kuldip K. Paliwal,et al.  Capturing non‐local interactions by long short‐term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility , 2017, Bioinform..

[14]  Joseph C. Wu,et al.  Splice-Junction-Based Mapping of Alternative Isoforms in the Human Proteome , 2019, Cell reports.

[15]  Lennart Martens,et al.  Updated MS²PIP web server delivers fast and accurate MS² peak intensity prediction for multiple fragmentation methods, instruments and labeling techniques , 2019, Nucleic Acids Res..

[16]  Yuxin Cui,et al.  DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction , 2018, Scientific Reports.

[17]  A. Millar,et al.  The Scope, Functions, and Dynamics of Posttranslational Protein Modifications. , 2019, Annual review of plant biology.

[18]  Michael J MacCoss,et al.  Improving tandem mass spectrum identification using peptide retention time prediction across diverse chromatography conditions. , 2007, Analytical chemistry.

[19]  Haixu Tang,et al.  Full-Spectrum Prediction of Peptides Tandem Mass Spectra using Deep Neural Network. , 2020, Analytical chemistry.

[20]  Hyunsoo Kim,et al.  Clinically Applicable Deep Learning Algorithm Using Quantitative Proteomic Data. , 2019, Journal of proteome research.

[21]  Wojciech Samek,et al.  UDSMProt: universal deep sequence models for protein classification , 2020, Bioinformatics.

[22]  J. Neefjes,et al.  Towards a systems understanding of MHC class I and MHC class II antigen presentation , 2011, Nature Reviews Immunology.

[23]  Pushmeet Kohli,et al.  Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13) , 2019, Proteins.

[24]  Behnam Neyshabur,et al.  Predicting protein‐protein interactions through sequence‐based deep learning , 2018, Bioinform..

[25]  Brian Hutchinson,et al.  Predicting the Effect of Single and Multiple Mutations on Protein Structural Stability , 2018, Molecules.

[26]  Bing Zhang,et al.  DeepRescore: Leveraging Deep Learning to Improve Peptide Identification in Immunopeptidomics , 2020, Proteomics.

[27]  Hong-Bin Shen,et al.  ImPLoc: a multi-instance deep learning model for the prediction of protein subcellular localization based on immunohistochemistry images , 2019, Bioinform..

[28]  Randy J Read,et al.  Evaluation of template‐based modeling in CASP13 , 2019, Proteins.

[29]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction: Progress and new directions in round XI , 2016, Proteins.

[30]  Shoba Ranganathan,et al.  Mass spectrometry-based protein identification in proteomics - a review , 2020, Briefings Bioinform..

[31]  Russ B. Altman,et al.  Predicting HLA class II antigen presentation through integrated deep learning , 2019, Nature Biotechnology.

[32]  Jian Peng,et al.  Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields , 2015, Scientific Reports.

[33]  Jianjun Hu,et al.  DeepMHC: Deep Convolutional Neural Networks for High-performance peptide-MHC Binding Affinity Prediction , 2017, bioRxiv.

[34]  Hao Chi,et al.  MS/MS Spectrum Prediction for Modified Peptides Using pDeep2 Trained by Transfer Learning. , 2019, Analytical chemistry.

[35]  Mohammed AlQuraishi End-to-end differentiable learning of protein structure , 2018, bioRxiv.

[36]  Quanhui Wang,et al.  Expansion of the ion library for mining SWATH-MS data through fractionation proteomics. , 2014, Analytical chemistry.

[37]  Jennifer G. Abelin,et al.  Defining HLA-II Ligand Processing and Binding Rules with Mass Spectrometry Enhances Cancer Epitope Prediction. , 2019, Immunity.

[38]  Ningning He,et al.  Identification of Protein Lysine Crotonylation Sites by a Deep Learning Framework With Convolutional Neural Networks , 2020, IEEE Access.

[39]  Jürgen Cox,et al.  A systematic investigation into the nature of tryptic HCD spectra. , 2012, Journal of proteome research.

[40]  Wojciech Samek,et al.  USMPep: universal sequence models for major histocompatibility complex binding affinity prediction , 2020, BMC Bioinformatics.

[41]  David Gfeller,et al.  Predicting Antigen Presentation—What Could We Learn From a Million Peptides? , 2018, Front. Immunol..

[42]  O. Krokhin,et al.  Sequence-specific retention calculator. Algorithm for peptide retention prediction in ion-pair RP-HPLC: application to 300- and 100-A pore size C18 sorbents. , 2006, Analytical chemistry.

[43]  Byunghan Lee,et al.  Deep learning in bioinformatics , 2016, Briefings Bioinform..

[44]  Yu Xue,et al.  GPS 2.1: enhanced prediction of kinase-specific phosphorylation sites with an algorithm of motif length selection. , 2011, Protein engineering, design & selection : PEDS.

[45]  Alireza Nasiri,et al.  Attention mechanism-based deep learning pan-specific model for interpretable MHC-I peptide binding prediction , 2019, bioRxiv.

[46]  R. Beavis,et al.  An Improved Model for Prediction of Retention Times of Tryptic Peptides in Ion Pair Reversed-phase HPLC , 2004, Molecular & Cellular Proteomics.

[47]  E. Chuangsuwanich,et al.  Uncovering Thousands of New Peptides with Sequence-Mask-Search Hybrid De Novo Peptide Sequencing Framework* , 2019, Molecular & Cellular Proteomics.

[48]  Yang Zhang,et al.  Locus-specific Retention Predictor (LsRP): A Peptide Retention Time Predictor Developed for Precision Proteomics , 2017, Scientific Reports.

[49]  V. Velculescu,et al.  High-Throughput Prediction of MHC Class I and II Neoantigens with MHCnuggets , 2019, Cancer Immunology Research.

[50]  Ole Winther,et al.  DeepLoc: prediction of protein subcellular localization using deep learning , 2017, Bioinform..

[51]  Fei Yang,et al.  OpenMS-Simulator: an open-source software for theoretical tandem mass spectrum prediction , 2015, BMC Bioinformatics.

[52]  David T Jones,et al.  Recent developments in deep learning applied to protein structure prediction , 2019, Proteins.

[53]  Matthew D. Zimmerman,et al.  The impact of structural genomics: the first quindecennial , 2016, Journal of Structural and Functional Genomics.

[54]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[55]  Ali Torkamani,et al.  Artificial intelligence in clinical and genomic diagnostics , 2019, Genome Medicine.

[56]  Rui Xu,et al.  A Comprehensive Evaluation of MS/MS Spectrum Prediction Tools for Shotgun Proteomics , 2020, Proteomics.

[57]  Huanming Yang,et al.  Improved Peptide Retention Time Prediction in Liquid Chromatography through Deep Learning. , 2018, Analytical chemistry.

[58]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[59]  Zhen Chen,et al.  Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites , 2018, Genom. Proteom. Bioinform..

[60]  Risto Miikkulainen,et al.  Designing neural networks through neuroevolution , 2019, Nat. Mach. Intell..

[61]  Yan Xu,et al.  A deep learning method to more accurately recall known lysine acetylation sites , 2019, BMC Bioinformatics.

[62]  Kaizhong Zhang,et al.  MHCherryPan. a novel model to predict the binding affinity of pan-specific class I HLA-peptide , 2019, 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[63]  Zhongqi Zhang Prediction of collision-induced-dissociation spectra of peptides with post-translational or process-induced modifications. , 2011, Analytical chemistry.

[64]  Cheng Chang,et al.  A Deep Learning‐Based Tumor Classifier Directly Using MS Raw Data , 2020, Proteomics.

[65]  Zachary Wu,et al.  Learned protein embeddings for machine learning , 2018, Bioinformatics.

[66]  Kentaro Tomii,et al.  DeepECA: an end-to-end learning framework for protein contact prediction from a multiple sequence alignment , 2020, BMC Bioinformatics.

[67]  K. Resing,et al.  Mapping protein post-translational modifications with mass spectrometry , 2007, Nature Methods.

[68]  John-William Sidhom,et al.  AI-MHC: an allele-integrated deep learning framework for improving Class I & Class II HLA-binding predictions , 2018, bioRxiv.

[69]  Zhongqi Zhang,et al.  Prediction of low-energy collision-induced dissociation spectra of peptides with three or more charges. , 2005, Analytical chemistry.

[70]  Fei-Fei Li,et al.  Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[71]  Karl Mechtler,et al.  CharmeRT: Boosting Peptide Identifications by Chimeric Spectra Identification and Retention Time Prediction , 2018, Journal of proteome research.

[72]  Yanchun Liang,et al.  MusiteDeep: a deep‐learning framework for general and kinase‐specific phosphorylation site prediction , 2017, Bioinform..

[73]  Bing Zhang,et al.  Using phosphoproteomics data to understand cellular signaling: a comprehensive guide to bioinformatics resources , 2020, Clinical Proteomics.

[74]  Andrew F. Jarnuczak,et al.  The functional landscape of the human phosphoproteome , 2019, Nature Biotechnology.

[75]  Yizeng Liang,et al.  Prediction of peptide fragment ion mass spectra by data mining techniques. , 2014, Analytical chemistry.

[76]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[77]  N. Hacohen,et al.  A large peptidome dataset improves HLA class I epitope prediction across most of the human population , 2019, Nature Biotechnology.

[78]  Geoffrey I. Webb,et al.  Large-scale comparative assessment of computational predictors for lysine post-translational modification sites , 2018, Briefings Bioinform..

[79]  David E. Kim,et al.  Large-scale determination of previously unsolved protein structures using evolutionary information , 2015, eLife.

[80]  Hongyi Zhou,et al.  DESTINI: A deep-learning approach to contact-driven protein structure prediction , 2019, Scientific Reports.

[81]  Maxat Kulmanov,et al.  DeepGOPlus: improved protein function prediction from sequence , 2019, bioRxiv.

[82]  Yuehui Chen,et al.  K_net: Lysine Malonylation Sites Identification With Neural Network , 2020, IEEE Access.

[83]  Natapol Pornputtapong,et al.  MHCSeqNet: a deep neural network model for universal MHC binding prediction , 2018, BMC Bioinformatics.

[84]  David T. Jones,et al.  High precision in protein contact prediction using fully convolutional neural networks and minimal sequence features , 2018, Bioinform..

[85]  B. Ma Novor: Real-Time Peptide de Novo Sequencing Software , 2015, Journal of The American Society for Mass Spectrometry.

[86]  Alexander M. Franks,et al.  DART-ID increases single-cell proteome coverage , 2018, bioRxiv.

[87]  Zhongqi Zhang Prediction of low-energy collision-induced dissociation spectra of peptides. , 2004, Analytical chemistry.

[88]  C. Mant,et al.  Prediction of peptide retention times in reversed-phase high-performance liquid chromatography I. Determination of retention coefficients of amino acid residues of model synthetic peptides , 1986 .

[89]  Victor Spicer,et al.  Sequence-Specific Model for Peptide Retention Time Prediction in Strong Cation Exchange Chromatography. , 2017, Analytical chemistry.

[90]  M. Dong,et al.  pNovo+: de novo peptide sequencing using complementary HCD and ETD tandem mass spectra. , 2013, Journal of proteome research.

[91]  Lennart Martens,et al.  The Age of Data‐Driven Proteomics: How Machine Learning Enables Novel Workflows , 2020, Proteomics.

[92]  Yang Zhang,et al.  Deep‐learning contact‐map guided protein structure prediction in CASP13 , 2019, Proteins.

[93]  Chengjin Zhang,et al.  DeepGly: A Deep Learning Framework With Recurrent and Convolutional Neural Networks to Identify Protein Glycation Sites From Imbalanced Data , 2019, IEEE Access.

[94]  Kathryn E. Kirchoff,et al.  EMBER: Multi-label prediction of kinase-substrate phosphorylation events through deep learning , 2020, bioRxiv.

[95]  X. Gu,et al.  DeepHLApan: A Deep Learning Approach for Neoantigen Prediction Considering Both HLA-Peptide Binding and Immunogenicity , 2019, Front. Immunol..

[96]  Peter Cresswell,et al.  Pathways of antigen processing. , 2013, Annual review of immunology.

[97]  Ari Frank,et al.  Predicting intensity ranks of peptide fragment ions. , 2009, Journal of proteome research.

[98]  Yang Zhang,et al.  DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins , 2019, Bioinform..

[99]  Massimiliano Pontil,et al.  PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments , 2012, Bioinform..

[100]  Ruedi Aebersold,et al.  The SysteMHC Atlas: a Computational Pipeline, a Website, and a Data Repository for Immunopeptidomic Analyses. , 2020, Methods in molecular biology.

[101]  Yanbu Guo,et al.  Protein secondary structure prediction improved by recurrent neural networks integrated with two-dimensional convolutional neural networks , 2018, J. Bioinform. Comput. Biol..

[103]  Fabian J Theis,et al.  Deep learning: new computational modelling techniques for genomics , 2019, Nature Reviews Genetics.

[104]  Luhua Lai,et al.  Sequence-based prediction of protein protein interaction using a deep-learning algorithm , 2017, BMC Bioinformatics.

[105]  Jinbo Xu,et al.  Analysis of distance‐based protein structure prediction by deep learning in CASP13 , 2019, Proteins.

[106]  Jijun Tang,et al.  PhosPred-RF: A Novel Sequence-Based Predictor for Phosphorylation Sites Using Sequential Information Only , 2017, IEEE Transactions on NanoBioscience.

[107]  Ronghui Lou,et al.  Hybrid Spectral Library Combining DIA-MS Data and a Targeted Virtual Library Substantially Deepens the Proteome Coverage , 2020, iScience.

[108]  Kun Zhang,et al.  pValid: Validation Beyond the Target-Decoy Approach for Peptide Identification in Shotgun Proteomics. , 2019, Journal of proteome research.

[109]  Bo Liao,et al.  A Hybrid Deep Learning Model for Predicting Protein Hydroxylation Sites , 2018, International journal of molecular sciences.

[110]  Hao Yang,et al.  pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework , 2019, Bioinform..

[111]  Bernhard Y. Renard,et al.  Evaluating de novo sequencing in proteomics: already an accurate alternative to database‐driven peptide identification? , 2018, Briefings Bioinform..

[112]  David E. James,et al.  Illuminating the dark phosphoproteome , 2019, Science Signaling.

[113]  Chunjie Luo,et al.  pDeep: Predicting MS/MS Spectra of Peptides with Deep Learning. , 2017, Analytical chemistry.

[114]  C. Sahinalp,et al.  Proteogenomic Analysis Unveils the HLA Class I-Presented Immunopeptidome in Melanoma and EGFR-Mutant Lung Adenocarcinoma , 2020, bioRxiv.

[115]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[116]  N. Blom,et al.  Prediction of post‐translational glycosylation and phosphorylation of proteins from the amino acid sequence , 2004, Proteomics.

[117]  John C Whittaker,et al.  Review of factors that influence the abundance of ions produced in a tandem mass spectrometer and statistical methods for discovering these factors. , 2009, Mass spectrometry reviews.

[118]  Ha Young Kim,et al.  Prediction of mutation effects using a deep temporal convolutional network , 2019, Bioinform..

[119]  Jinyan Li,et al.  Prediction of 8-state protein secondary structures by a novel deep learning architecture , 2018, BMC Bioinformatics.

[120]  Mahmood A. Rashid,et al.  Protein secondary structure prediction using neural networks and deep learning: A review , 2019, Comput. Biol. Chem..

[121]  Huiqing Liu,et al.  DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction , 2019, BMC Bioinformatics.

[122]  Lennart Martens,et al.  MS2PIP prediction server: compute and visualize MS2 peak intensity predictions for CID and HCD fragmentation , 2015, Nucleic Acids Res..

[123]  Xiaoxia Wang,et al.  ACME: pan-specific peptide-MHC class I binding prediction through attention-based deep neural networks , 2019, Bioinform..

[124]  Predrag Radivojac,et al.  The structural and functional signatures of proteins that undergo multiple events of post‐translational modification , 2014, Protein science : a publication of the Protein Society.

[125]  Anne E Carpenter,et al.  Opportunities and obstacles for deep learning in biology and medicine , 2017, bioRxiv.

[126]  N. Hacohen,et al.  Thousands of novel unannotated proteins expand the MHC I immunopeptidome in cancer , 2020, bioRxiv.

[127]  Eric W. Deutsch,et al.  A repository of assays to quantify 10,000 human proteins by SWATH-MS , 2014, Scientific Data.

[128]  Feng Liu,et al.  Deep Learning and Its Applications in Biomedicine , 2018, Genom. Proteom. Bioinform..

[129]  Zhen Li,et al.  Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model , 2016, bioRxiv.

[130]  Lennart Martens,et al.  MS2PIP: a tool for MS/MS peak intensity prediction , 2013, Bioinform..

[131]  Ching-Tai Chen,et al.  MS2CNN: predicting MS/MS spectrum based on protein sequence using deep convolutional neural networks , 2019, BMC Genomics.

[132]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[133]  John Chilton,et al.  Using iRT, a normalized retention time for more targeted measurement of peptides , 2012, Proteomics.

[134]  Frank Hutter,et al.  Neural Architecture Search: A Survey , 2018, J. Mach. Learn. Res..

[135]  Dongsup Kim,et al.  Deep convolutional neural networks for pan-specific peptide-MHC class I binding prediction , 2017, BMC Bioinformatics.

[136]  P. Lasch,et al.  Isolation Window Optimization of Data-Independent Acquisition Using Predicted Libraries for Deep and Accurate Proteome Profiling. , 2020, Analytical chemistry.

[137]  Richard D. Smith,et al.  Application of peptide LC retention time information in a discriminant function for peptide identification by tandem mass spectrometry. , 2004, Journal of proteome research.

[138]  Alireza Nasiri,et al.  Attention mechanism-based deep learning pan-specific model for interpretable MHC-I peptide binding prediction , 2019, bioRxiv.

[139]  Joseph M. Foster,et al.  Chromatographic retention time prediction for posttranslationally modified peptides , 2012, Proteomics.

[140]  Dong Xu,et al.  Musite, a Tool for Global Prediction of General and Kinase-specific Phosphorylation Sites* , 2010, Molecular & Cellular Proteomics.

[141]  Ngoc Hieu Tran,et al.  Deep learning enables de novo peptide sequencing from data-independent-acquisition mass spectrometry , 2018, Nature Methods.

[142]  Mathias Wilhelm,et al.  Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning , 2019, Nature Methods.

[143]  Chao Fang,et al.  MUFOLD‐SS: New deep inception‐inside‐inception networks for protein secondary structure prediction , 2018, Proteins.

[144]  Minghao Yin,et al.  General and Species-Specific Lysine Acetylation Site Prediction Using a Bi-Modal Deep Architecture , 2018, IEEE Access.

[145]  Jianlin Cheng,et al.  CONFOLD: Residue‐residue contact‐guided ab initio protein folding , 2015, Proteins.

[146]  Davide Chicco,et al.  Siamese Neural Networks: An Overview , 2021, Artificial Neural Networks, 3rd Edition.

[147]  Renzhi Cao,et al.  Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13 , 2019, Proteins.

[148]  David Baker,et al.  Macromolecular modeling with rosetta. , 2008, Annual review of biochemistry.

[149]  Haoyang Zeng,et al.  Quantification of Uncertainty in Peptide-MHC Binding Prediction Improves High-Affinity Peptide Selection for Therapeutic Design. , 2019, Cell systems.

[150]  Matthew The,et al.  Uncertainty estimation of predictions of peptides' chromatographic retention times in shotgun proteomics , 2016, Bioinform..

[151]  Matteo Dal Peraro,et al.  A further leap of improvement in tertiary structure prediction in CASP13 prompts new routes for future assessments , 2019, Proteins.

[152]  Yan Xu,et al.  DeepUbi: a deep learning framework for prediction of ubiquitination sites in proteins , 2019, BMC Bioinformatics.

[153]  Jianfeng Feng,et al.  A machine learning approach to explore the spectra intensity pattern of peptides using tandem mass spectrometry data , 2008, BMC Bioinformatics.

[154]  P. Pevzner,et al.  PepNovo: de novo peptide sequencing via probabilistic network modeling. , 2005, Analytical chemistry.

[155]  C. Sander,et al.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families , 2011, Proceedings of the National Academy of Sciences.

[156]  Mohammed AlQuraishi,et al.  AlphaFold at CASP13 , 2019, Bioinform..

[157]  Weilong Zhao,et al.  Systematically benchmarking peptide-MHC binding predictors: From synthetic to naturally processed epitopes , 2018, PLoS Comput. Biol..

[158]  Yu-Chieh Wang,et al.  Protein post-translational modifications and regulation of pluripotency in human stem cells , 2013, Cell Research.

[159]  DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction , 2020, BMC Bioinformatics.

[160]  Yu Xue,et al.  DeepNitro: Prediction of Protein Nitration and Nitrosylation Sites by Deep Learning , 2018, Genom. Proteom. Bioinform..

[161]  J. Ahn,et al.  Predicting clinical benefit of immunotherapy by antigenic or functional mutations affecting tumour immunogenicity , 2020, Nature Communications.

[162]  Lukas Käll,et al.  Training, selection, and robust calibration of retention time models for targeted proteomics. , 2010, Journal of proteome research.

[163]  Thomas A. Hopf,et al.  Protein 3D Structure Computed from Evolutionary Sequence Variation , 2011, PloS one.

[164]  A. Brunger Version 1.2 of the Crystallography and NMR system , 2007, Nature Protocols.

[165]  Justin Bo-Kai Hsu,et al.  Characterization and Identification of Lysine Succinylation Sites based on Deep Learning Method , 2019, Scientific Reports.

[166]  Yanchun Liang,et al.  Capsule network for protein post-translational modification site prediction , 2018, Bioinform..

[167]  Andre Franke,et al.  Amino acid encoding for deep learning applications , 2020, BMC Bioinformatics.

[168]  Xing-Ming Zhao,et al.  DeepPhos: prediction of protein phosphorylation sites with deep learning , 2019, Bioinform..

[169]  Yuxin Cui,et al.  DeepSeqPanII: an interpretable recurrent neural network model with attention mechanism for peptide-HLA class II binding prediction , 2019, bioRxiv.

[170]  Brian Kuhlman,et al.  Advances in protein structure prediction and design , 2019, Nature Reviews Molecular Cell Biology.

[171]  P. Jensen Recent advances in antigen processing and presentation , 2007, Nature Immunology.

[172]  Lennart Martens,et al.  Machine learning applications in proteomics research: How the past can boost the future , 2014, Proteomics.

[173]  Ming Li,et al.  PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry. , 2003, Rapid communications in mass spectrometry : RCM.

[174]  Gordon A Anderson,et al.  Use of artificial neural networks for the accurate prediction of peptide liquid chromatography elution times in proteome analyses. , 2003, Analytical chemistry.

[175]  George M. Church,et al.  Unified rational protein engineering with sequence-based deep representation learning , 2019, Nature Methods.

[176]  Ehsaneddin Asgari,et al.  Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics , 2015, PloS one.

[177]  Bing Zhang,et al.  Cancer neoantigen prioritization through sensitive and reliable proteogenomics analysis , 2020, Nature Communications.

[178]  Martin Eisenacher,et al.  The PRIDE database and related tools and resources in 2019: improving support for quantification data , 2018, Nucleic Acids Res..

[179]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[180]  Morten Nielsen,et al.  A generic deep convolutional neural network framework for prediction of receptor–ligand interactions—NetPhosPan: application to kinase phosphorylation prediction , 2018, Bioinform..

[181]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[182]  Jie Hou,et al.  DNCON2: improved protein contact prediction using two-level deep convolutional neural networks , 2017, bioRxiv.

[183]  Yingming Zhao,et al.  Modification‐specific proteomics: Strategies for characterization of post‐translational modifications using enrichment techniques , 2009, Proteomics.

[184]  Jianyi Yang,et al.  Improved protein structure prediction using predicted interresidue orientations , 2020, Proceedings of the National Academy of Sciences.

[185]  Xiaohui Liu,et al.  In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics , 2020, Nature Communications.

[186]  D. Baker,et al.  Assessing the utility of coevolution-based residue–residue contact predictions in a sequence- and structure-rich era , 2013, Proceedings of the National Academy of Sciences.

[187]  Predrag Radivojac,et al.  On the accuracy and limits of peptide fragmentation spectrum prediction. , 2011, Analytical chemistry.

[188]  Alex Rubinsteyn,et al.  MHCflurry: Open-Source Class I MHC Binding Affinity Prediction. , 2018, Cell systems.

[189]  Bin Ma,et al.  Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning* , 2019, Molecular & Cellular Proteomics.

[190]  Abdollah Dehzangi,et al.  SPRINT-Gly: predicting N- and O-linked glycosylation sites of human and mouse proteins by using sequence and predicted structural properties , 2019, Bioinform..

[191]  Demis Hassabis,et al.  Improved protein structure prediction using potentials from deep learning , 2020, Nature.

[192]  J. Meek Prediction of peptide retention times in high-pressure liquid chromatography on the basis of amino acid composition. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[193]  Volkan Atalay,et al.  DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks , 2019, Scientific Reports.

[194]  B Van Puyvelde,et al.  Removing the hidden data dependency of DIA with predicted spectral libraries , 2019, bioRxiv.

[195]  Michele A. Busby,et al.  Supplementary Materials for Deep learning using tumor HLA peptide mass spectrometry datasets improves neoantigen identification , 2018 .

[196]  Roman A. Zubarev,et al.  The SysteMHC Atlas project , 2017, Nucleic Acids Res..

[197]  M. Mann,et al.  Proteomic analysis of post-translational modifications , 2003, Nature Biotechnology.

[198]  Elizabeth Guruceaga,et al.  DeepMSPeptide: peptide detectability prediction using deep learning , 2019, Bioinform..

[199]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[200]  Yu Xue,et al.  GPS 2.0, a Tool to Predict Kinase-specific Phosphorylation Sites in Hierarchy *S , 2008, Molecular & Cellular Proteomics.

[201]  Wei Wang,et al.  Mutation effect estimation on protein–protein interactions using deep contextualized representation learning , 2020, NAR genomics and bioinformatics.

[202]  Dong Xu,et al.  Large-scale prediction of protein ubiquitination sites using a multimodal deep architecture , 2018, BMC Systems Biology.

[203]  Xiaohui Xie,et al.  HLA class I binding prediction via convolutional neural networks , 2017, bioRxiv.