Improved prediction of MHC II antigen presentation through integration and motif deconvolution of mass spectrometry MHC eluted ligand data

Major Histocompatibility Complex II (MHC II) molecules play a vital role in the onset and control of cellular immunity. In a highly selective process, MHC II presents peptides derived from exogenous antigens on the surface of antigen-presenting cells for T cell scrutiny. Understanding the rules defining this presentation holds critical insights into the regulation and potential manipulation of the cellular immune system. Here, we apply the NNAlign_MA machine learning framework to analyse and integrate large-scale eluted MHC II ligand mass spectrometry (MS) data sets to advance prediction of CD4+ epitopes. NNAlign_MA allows integration of mixed data types, handling ligands with multiple potential allele annotations, encoding of ligand context, leveraging information between data sets, and has pan-specific power allowing accurate predictions outside the set of molecules included in the training data. Applying this framework, we identified accurate binding motifs of more than 50 MHC class II molecules described by MS data, particularly expanding coverage for DP and DQ beyond that obtained using current MS motif deconvolution techniques. Further, in large-scale benchmarking, the final model termed NetMHCIIpan-4.0, demonstrated improved performance beyond current state-of-the-art predictors for ligand and CD4+ T cell epitope prediction. These results suggest NNAlign_MA and NetMHCIIpan-4.0 are powerful tools for analysis of immunopeptidome MS data, prediction of T cell epitopes and development of personalized immunotherapies.

[1]  O. Schilling,et al.  Proteomic identification of protease cleavage sites characterizes prime and non-prime specificity of cysteine cathepsins B, L, and S. , 2011, Journal of proteome research.

[2]  D. Neri,et al.  Membranal and Blood‐Soluble HLA Class II Peptidome Analyses Using Data‐Dependent and Independent Acquisition , 2018, Proteomics.

[3]  Jennifer G. Abelin,et al.  Defining HLA-II Ligand Processing and Binding Rules with Mass Spectrometry Enhances Cancer Epitope Prediction. , 2019, Immunity.

[4]  Massimo Andreatta,et al.  NNAlign_MA; MHC Peptidome Deconvolution for Accurate MHC Binding Motif Characterization and Improved T-cell Epitope Predictions* , 2019, Molecular & Cellular Proteomics.

[5]  Morten Nielsen,et al.  NetMHCcons: a consensus method for the major histocompatibility complex class I predictions , 2011, Immunogenetics.

[6]  William S. Lane,et al.  Predominant naturally processed peptides bound to HLA-DR1 are derived from MHC-related molecules and are heterogeneous in size , 1992, Nature.

[7]  Valerio Zolla,et al.  The Dendritic Cell Major Histocompatibility Complex II (MHC II) Peptidome Derives from a Variety of Processing Pathways and Includes Peptides with a Broad Spectrum of HLA-DM Sensitivity* , 2016, The Journal of Biological Chemistry.

[8]  J. Traherne,et al.  Human MHC architecture and evolution: implications for disease association studies , 2008, International journal of immunogenetics.

[9]  David Gfeller,et al.  Predicting Antigen Presentation—What Could We Learn From a Million Peptides? , 2018, Front. Immunol..

[10]  Morten Nielsen,et al.  Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion , 2012, Nucleic Acids Res..

[11]  A Sette,et al.  The relation between major histocompatibility complex (MHC) restriction and the capacity of Ia to bind immunogenic peptides , 1987, Science.

[12]  Maria V. Tejada-Simon,et al.  Naturally Processed HLA Class II Peptides Reveal Highly Conserved Immunogenic Flanking Region Sequence Preferences That Reflect Antigen Processing Rather Than Peptide-MHC Interactions1 , 2001, The Journal of Immunology.

[13]  Etienne Caron,et al.  Analysis of Major Histocompatibility Complex (MHC) Immunopeptidomes Using Mass Spectrometry* , 2015, Molecular & Cellular Proteomics.

[14]  Oliver Kohlbacher,et al.  HLA ligandome analysis of primary chronic lymphocytic leukemia (CLL) cells under lenalidomide treatment confirms the suitability of lenalidomide for combination with T-cell-based immunotherapy , 2017, Oncoimmunology.

[15]  Morten Nielsen,et al.  Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method , 2007, BMC Bioinformatics.

[16]  M. Nielsen,et al.  NetMHCpan-4.0: Improved Peptide–MHC Class I Interaction Predictions Integrating Eluted Ligand and Peptide Binding Affinity Data , 2017, The Journal of Immunology.

[17]  C. Bailey-Kellogg,et al.  Design and engineering of deimmunized biotherapeutics. , 2016, Current opinion in structural biology.

[18]  Hau-San Wong,et al.  TEPITOPEpan: Extending TEPITOPE for Peptide Binding Prediction Covering over 700 HLA-DR Molecules , 2012, PloS one.

[19]  S. Shaffer,et al.  HLA-DO Modulates the Diversity of the MHC-II Self-peptidome* , 2018, Molecular & Cellular Proteomics.

[20]  Shabaz Mohammed,et al.  Sampling From the Proteome to the Human Leukocyte Antigen-DR (HLA-DR) Ligandome Proceeds Via High Specificity* , 2016, Molecular & Cellular Proteomics.

[21]  O. Lund,et al.  NetMHCpan, a method for MHC class I binding prediction beyond humans , 2008, Immunogenetics.

[22]  Sri H. Ramarathinam,et al.  A Systems Approach to Understand Antigen Presentation and the Immune Response. , 2016, Methods in molecular biology.

[23]  Morten Nielsen,et al.  MHC Class II epitope predictive algorithms , 2010, Immunology.

[24]  Andreas Handel,et al.  Dominant protection from HLA-linked autoimmunity by antigen-specific regulatory T cells , 2017, Nature.

[25]  S. Ribeiro,et al.  CD4+ T Cell Epitope Discovery and Rational Vaccine Design , 2010, Archivum Immunologiae et Therapiae Experimentalis.

[26]  O. Lund,et al.  novel sequence representations Reliable prediction of T-cell epitopes using neural networks with , 2003 .

[27]  Dario Neri,et al.  High‐resolution analysis of the murine MHC class II immunopeptidome , 2016, European journal of immunology.

[28]  C. Freund,et al.  Quantification of HLA-DM-Dependent Major Histocompatibility Complex of Class II Immunopeptidomes by the Peptide Landscape Antigenic Epitope Alignment Utility , 2018, Front. Immunol..

[29]  Jennifer G. Abelin,et al.  Mass Spectrometry Profiling of HLA‐Associated Peptidomes in Mono‐allelic Cells Enables More Accurate Epitope Prediction , 2017, Immunity.

[30]  Morten Nielsen,et al.  Improved peptide-MHC class II interaction prediction through integration of eluted ligand and peptide affinity data , 2019, Immunogenetics.

[31]  Morten Nielsen,et al.  GibbsCluster: unsupervised clustering and alignment of peptide sequences , 2017, Nucleic Acids Res..

[32]  Morten Nielsen,et al.  Accurate pan-specific prediction of peptide-MHC class II binding affinity with improved binding core identification , 2015, Immunogenetics.

[33]  Morten Nielsen,et al.  NNAlign: a platform to construct and evaluate artificial neural network models of receptor–ligand interactions , 2017, Nucleic Acids Res..

[34]  Morten Nielsen,et al.  Footprints of antigen processing boost MHC class II natural ligand predictions , 2018, Genome Medicine.

[35]  David Gfeller,et al.  Unsupervised HLA Peptidome Deconvolution Improves Ligand Prediction Accuracy and Predicts Cooperative Effects in Peptide–HLA Interactions , 2016, The Journal of Immunology.

[36]  Catherine E Costello,et al.  Immunogenic HLA-DR-Presented Self-Peptides Identified Directly from Clinical Samples of Synovial Tissue, Synovial Fluid, or Peripheral Blood in Patients with Rheumatoid Arthritis or Lyme Arthritis. , 2017, Journal of proteome research.

[37]  M. Nielsen,et al.  Machine learning reveals a non‐canonical mode of peptide binding to MHC class II molecules , 2017, Immunology.

[38]  O. Lund,et al.  NetMHCIIpan-3.0, a common pan-specific MHC class II prediction method including all three human MHC class II isotypes, HLA-DR, HLA-DP and HLA-DQ , 2013, Immunogenetics.

[39]  Russ B. Altman,et al.  Predicting HLA class II antigen presentation through integrated deep learning , 2019, Nature Biotechnology.

[40]  Ash A. Alizadeh,et al.  Antigen Presentation Profiling Reveals Recognition of Lymphoma Immunoglobulin Neoantigens , 2017, Nature.

[41]  Sri H. Ramarathinam,et al.  The interplay between citrullination and HLA-DRB1 polymorphism in shaping peptide binding hierarchies in rheumatoid arthritis , 2018, The Journal of Biological Chemistry.

[42]  Gajendra P. S. Raghava,et al.  ProPred: prediction of HLA-DR binding sites , 2001, Bioinform..

[43]  Anders Eklund,et al.  Approach for Identifying Human Leukocyte Antigen (HLA)-DR Bound Peptides from Scarce Clinical Samples * , 2016, Molecular & Cellular Proteomics.

[44]  S. Shaffer,et al.  HLA-DO Modulates the Diversity of the MHC-II Self-peptidome. , 2019, Molecular & cellular proteomics : MCP.

[45]  E. Unanue,et al.  Variations in MHC Class II Antigen Processing and Presentation in Health and Disease. , 2016, Annual review of immunology.

[46]  Morten Nielsen,et al.  Improved Prediction of Bovine Leucocyte Antigens (BoLA) Presented Ligands by Use of Mass-Spectrometry-Determined Ligand and in Vitro Binding Data , 2017, Journal of proteome research.

[47]  George Coukos,et al.  Robust prediction of HLA class II epitopes by deep motif deconvolution of immunopeptidomes , 2019, Nature Biotechnology.

[48]  Sandeep Kumar Dhanda,et al.  Determination of a Predictive Cleavage Motif for Eluted Major Histocompatibility Complex Class II Ligands , 2018, Front. Immunol..

[49]  M. Mann,et al.  Direct identification of clinically relevant neoepitopes presented on native human melanoma tissue by mass spectrometry , 2016, Nature Communications.

[50]  Francesco Leonetti,et al.  Substrate Profiling of Cysteine Proteases Using a Combinatorial Peptide Library Identifies Functionally Unique Specificities* , 2006, Journal of Biological Chemistry.

[51]  Alessandro Sette,et al.  The Immune Epitope Database (IEDB): 2018 update , 2018, Nucleic Acids Res..

[52]  R. Toes,et al.  A molecular basis for the association of the HLA-DRB1 locus, citrullination, and rheumatoid arthritis , 2013, The Journal of experimental medicine.

[53]  Morten Nielsen,et al.  Quantitative Predictions of Peptide Binding to Any HLA-DR Molecule of Known Sequence: NetMHCIIpan , 2008, PLoS Comput. Biol..

[54]  O. Lund,et al.  NetMHCpan, a Method for Quantitative Predictions of Peptide Binding to Any HLA-A and -B Locus Protein of Known Sequence , 2007, PloS one.

[55]  Morten Nielsen,et al.  Computational Tools for the Identification and Interpretation of Sequence Motifs in Immunopeptidomes , 2018, Proteomics.

[56]  Hong Yu,et al.  Identification of MHC-Bound Peptides from Dendritic Cells Infected with Salmonella enterica Strain SL1344: Implications for a Nontyphoidal Salmonella Vaccine. , 2017, Journal of proteome research.

[57]  Morten Nielsen,et al.  Different binding motifs of the celiac disease-associated HLA molecules DQ2.5, DQ2.2, and DQ7.5 revealed by relative quantitative proteomics of endogenous peptide repertoires , 2014, Immunogenetics.

[58]  S. Mallal,et al.  Characterization of Magnitude and Antigen Specificity of HLA-DP, DQ, and DRB3/4/5 Restricted DENV-Specific CD4+ T Cell Responses , 2019, Front. Immunol..

[59]  Frank Noé,et al.  Major Histocompatibility Complex (MHC) Class I and MHC Class II Proteins: Conformational Plasticity in Antigen Presentation , 2017, Front. Immunol..

[60]  J. Greenbaum,et al.  Improved methods for predicting peptide binding affinity to MHC class II molecules , 2018, Immunology.

[61]  V. Velculescu,et al.  High-Throughput Prediction of MHC Class I and II Neoantigens with MHCnuggets , 2019, Cancer Immunology Research.

[62]  Oliver Kohlbacher,et al.  Immunoinformatics and epitope prediction in the age of genomic medicine , 2015, Genome Medicine.

[63]  U. Şahin,et al.  Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices , 1999, Nature Biotechnology.

[64]  O. Lund,et al.  Predictions versus high-throughput experiments in T-cell epitope discovery: competition or synergy? , 2012, Expert review of vaccines.

[65]  J. Voorberg,et al.  Analysis of the HLA‐DR peptidome from human dendritic cells reveals high affinity repertoires and nonconventional pathways of peptide generation , 2017, Journal of leukocyte biology.

[66]  The UniProt Consortium,et al.  UniProt: a worldwide hub of protein knowledge , 2018, Nucleic Acids Res..

[67]  Morten Nielsen,et al.  NN-align. An artificial neural network-based alignment algorithm for MHC class II peptide binding prediction , 2009, BMC Bioinformatics.