Predicting CD4 T-cell epitopes based on antigen cleavage, MHCII presentation, and TCR recognition

Accurate predictions of T-cell epitopes would be useful for designing vaccines, immunotherapies for cancer and autoimmune diseases, and improved protein therapies. The humoral immune response involves uptake of antigens by antigen presenting cells (APCs), APC processing and presentation of peptides on MHC class II (pMHCII), and T-cell receptor (TCR) recognition of pMHCII complexes. Most in silico methods predict only peptide-MHCII binding, resulting in significant over-prediction of CD4 T-cell epitopes. We present a method, ITCell, for prediction of T-cell epitopes within an input protein antigen sequence for given MHCII and TCR sequences. The method integrates information about three stages of the immune response pathway: antigen cleavage, MHCII presentation, and TCR recognition. First, antigen cleavage sites are predicted based on the cleavage profiles of cathepsins S, B, and H. Second, for each 12-mer peptide in the antigen sequence we predict whether it will bind to a given MHCII, based on the scores of modeled peptide-MHCII complexes. Third, we predict whether or not any of the top scoring peptide-MHCII complexes can bind to a given TCR, based on the scores of modeled ternary peptide-MHCII-TCR complexes and the distribution of predicted cleavage sites. Our benchmarks consist of epitope predictions generated by this algorithm, checked against 20 peptide-MHCII-TCR crystal structures, as well as epitope predictions for four peptide-MHCII-TCR complexes with known epitopes and TCR sequences but without crystal structures. ITCell successfully identified the correct epitopes as one of the 20 top scoring peptides for 22 of 24 benchmark cases. To validate the method using a clinically relevant application, we utilized five factor VIII-specific TCR sequences from hemophilia A subjects who developed an immune response to factor VIII replacement therapy. The known HLA-DR1-restricted factor VIII epitope was among the six top-scoring factor VIII peptides predicted by ITCall to bind HLA-DR1 and all five TCRs. Our integrative approach is more accurate than current single-stage epitope prediction algorithms applied to the same benchmarks. It is freely available as a web server (http://salilab.org/itcell). Author summary Knowledge of T-cell epitopes is useful for designing vaccines, improving cancer immunotherapy, studying autoimmune diseases, and engineering protein replacement therapies. Unfortunately, experimental methods for identification of T-cell epitopes are slow, expensive, and not always applicable. Thus, a more accurate computational method for prediction of T-cell epitopes needs to be developed. While the T-cell response to extracellular antigens proceeds through multiple stages, current computational methods rely only on the prediction of peptide binding affinity to an MHCII receptor on antigen presenting cells, resulting in a relatively high number of false-positive predictions of T-cell epitopes within protein antigens. We developed an integrative approach to predict T-cell epitopes that computationally combines information from three stages of the humoral immune response pathway: antigen cleavage, MHCII presentation, and TCR recognition, resulting in an increased accuracy of epitope predictions. This method was applied to predict epitopes within blood coagulation factor VIII (FVIII) that were recognized by TCRs from hemophilia A subjects who developed an anti-FVIII antibody response. The correct epitope was predicted after modeling all possible 12-mer FVIII peptides bound in ternary complexes with the relevant MHCII (HLA-DR1) and each of five experimentally determined FVIII-specific TCR sequences.

[1]  E. James,et al.  FVIII proteins with a modified immunodominant T-cell epitope exhibit reduced immunogenicity and normal FVIII activity. , 2018, Blood advances.

[2]  Purvesh Khatri,et al.  Antigen Identification for Orphan T Cell Receptors Expressed on Tumor-Infiltrating Lymphocytes , 2017, Cell.

[3]  Roman A. Zubarev,et al.  The SysteMHC Atlas project , 2017, Nucleic Acids Res..

[4]  P. Alam Results and Problems in Cell Differentiation , 2018 .

[5]  P. Paz,et al.  T cells from hemophilia A subjects recognize the same HLA-restricted FVIII epitope with a narrow TCR repertoire. , 2016, Blood.

[6]  S. Sadegh-Nasseri A step-by-step overview of the dynamic process of epitope selection by major histocompatibility complex class II for presentation to helper T cells , 2016, F1000Research.

[7]  James McCluskey,et al.  T cell receptor reversed polarity recognition of a self-antigen major histocompatibility complex , 2015, Nature Immunology.

[8]  Y. Doyon,et al.  In vivo genome editing of the albumin locus as a platform for protein replacement therapy. , 2015, Blood.

[9]  Morten Nielsen,et al.  Accurate pan-specific prediction of peptide-MHC class II binding affinity with improved binding core identification , 2015, Immunogenetics.

[10]  D. Ostrov,et al.  New approaches for predicting T cell-mediated drug reactions: A role for inducible and potentially preventable autoimmunity. , 2015, The Journal of allergy and clinical immunology.

[11]  J. Derisi,et al.  Destructin-1 is a collagen-degrading endopeptidase secreted by Pseudogymnoascus destructans, the causative agent of white-nose syndrome , 2015, Proceedings of the National Academy of Sciences.

[12]  Deborah Hix,et al.  The immune epitope database (IEDB) 3.0 , 2014, Nucleic Acids Res..

[13]  Maxim N. Artyomov,et al.  Checkpoint Blockade Cancer Immunotherapy Targets Tumour-Specific Mutant Antigens , 2014, Nature.

[14]  N. Friedman,et al.  T-cell receptor repertoires share a restricted set of public and abundant CDR3 sequences that are associated with self-related immunity , 2014, Genome research.

[15]  R. Cole,et al.  Divergent Paths for the Selection of Immunodominant Epitopes from Distinct Antigenic Sources , 2014, Nature Communications.

[16]  Morten Nielsen,et al.  Dataset size and composition impact the reliability of performance benchmarks for peptide-MHC binding predictions , 2014, BMC Bioinformatics.

[17]  K. Lewis,et al.  High-resolution mapping of epitopes on the C2 domain of factor VIII by analysis of point mutants using surface plasmon resonance. , 2014, Blood.

[18]  Evan W. Newell,et al.  Beyond model antigens: high-dimensional methods for the analysis of antigen-specific T cells , 2014, Nature Biotechnology.

[19]  Oriol Fornes,et al.  On the use of knowledge-based potentials for the evaluation of models of protein-protein, protein-DNA, and protein-RNA interactions. , 2014, Advances in protein chemistry and structural biology.

[20]  Andrej Sali,et al.  Optimized atomic statistical potentials: assessment of protein interfaces and loops , 2013, Bioinform..

[21]  Robert A Holt,et al.  Sequence analysis of T-cell repertoires in health and disease , 2013, Genome Medicine.

[22]  Wilfred Ndifon,et al.  CD4+ T Cell-Receptor Repertoire Diversity is Compromised in the Spleen but Not in the Bone Marrow of Aged Mice Due to Private and Sporadic Clonal Expansions , 2013, Front. Immunol..

[23]  B. Byrne,et al.  B-Cell depletion and immunomodulation before initiation of enzyme replacement therapy blocks the immune response to acid alpha-glucosidase in infantile-onset Pompe disease. , 2013, The Journal of pediatrics.

[24]  O. Lund,et al.  NetMHCIIpan-3.0, a common pan-specific MHC class II prediction method including all three human MHC class II isotypes, HLA-DR, HLA-DP and HLA-DQ , 2013, Immunogenetics.

[25]  Mark M Davis,et al.  Combinatorial tetramer staining and mass cytometry analysis facilitate T-cell epitope mapping and characterization , 2013, Nature Biotechnology.

[26]  Alma L Burlingame,et al.  Global identification of peptidase specificity by multiplex substrate profiling , 2012, Nature Methods.

[27]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[28]  T. Beddoe,et al.  Killer cell immunoglobulin-like receptor 3DL1-mediated recognition of human leukocyte antigen B , 2011, Nature.

[29]  Richard A. Moore,et al.  Exhaustive T-cell repertoire sequencing of human peripheral blood samples reveals signatures of antigen selection and a directly measured repertoire size of at least 1 million clonotypes. , 2011, Genome research.

[30]  Roland Martin,et al.  Structure of a TCR with high affinity for self‐antigen reveals basis for escape from negative selection , 2011, The EMBO journal.

[31]  S. Ranganathan,et al.  Understanding TR Binding to pMHC Complexes: How Does a TR Scan Many pMHC Complexes yet Preferentially Bind to One , 2011, PloS one.

[32]  Geoffrey I. Webb,et al.  Bioinformatic Approaches for Predicting substrates of Proteases , 2011, J. Bioinform. Comput. Biol..

[33]  O. Lund,et al.  NetMHCIIpan-2.0 - Improved pan-specific HLA-DR predictions using a novel concurrent alignment and weight optimization training procedure , 2010, Immunome research.

[34]  R. Cole,et al.  A reductionist cell-free major histocompatibility complex class II antigen processing system identifies immunodominant epitopes , 2010, Nature Medicine.

[35]  R. Cole,et al.  A novel Minimalist Cell-Free MHC Class II Antigen Processing System Identifies Immunodominant Epitopes , 2010, Nature medicine.

[36]  D. Jewell,et al.  Comprehensive, Quantitative Mapping of T Cell Epitopes in Gluten in Celiac Disease , 2010, Science Translational Medicine.

[37]  Morten Nielsen,et al.  MHC Class II epitope predictive algorithms , 2010, Immunology.

[38]  Nir London,et al.  Sub‐angstrom modeling of complexes between flexible peptides and globular proteins , 2010, Proteins.

[39]  Ursula Pieper,et al.  Prediction of protease substrates using sequence and structure features , 2010, Bioinform..

[40]  Ying Xu,et al.  Limitations of Ab Initio Predictions of Peptide Binding to MHC Class II Molecules , 2010, PloS one.

[41]  E. Tolosa,et al.  Antigen processing and presentation in multiple sclerosis. , 2010, Results and problems in cell differentiation.

[42]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[43]  Abigail Wacher,et al.  Comprehensive assessment of T-cell receptor beta-chain diversity in alphabeta T cells. , 2009, Blood.

[44]  K. Gevaert,et al.  Improved visualization of protein consensus sequences by iceLogo , 2009, Nature Methods.

[45]  E. James,et al.  Lineages of human T-cell clones, including T helper 17/T helper 1 cells, isolated at different stages of anti-factor VIII immune responses. , 2009, Blood.

[46]  Yoram Louzoun,et al.  Viruses selectively mutate their CD8+ T-cell epitopes—a large-scale immunomic analysis , 2009, Bioinform..

[47]  Peter R Baker,et al.  In-depth Analysis of Tandem Mass Spectrometry Data from Disparate Instrument Types*S , 2008, Molecular & Cellular Proteomics.

[48]  Mark Johnson,et al.  NCBI BLAST: a better web interface , 2008, Nucleic Acids Res..

[49]  John Sidney,et al.  A Systematic Assessment of MHC Class II Peptide Binding Predictions and Evaluation of a Consensus Approach , 2008, PLoS Comput. Biol..

[50]  Morten Nielsen,et al.  Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method , 2007, BMC Bioinformatics.

[51]  Alex Bateman,et al.  An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[52]  K. P. Murphy,et al.  Janeway's immunobiology , 2007 .

[53]  James Theiler,et al.  Polyvalent vaccines for optimal coverage of potential T-cell epitopes in global HIV-1 variants , 2007, Nature Medicine.

[54]  Francesco Leonetti,et al.  Substrate Profiling of Cysteine Proteases Using a Combinatorial Peptide Library Identifies Functionally Unique Specificities* , 2006, Journal of Biological Chemistry.

[55]  Ruth Nussinov,et al.  PatchDock and SymmDock: servers for rigid and symmetric docking , 2005, Nucleic Acids Res..

[56]  P. Kloetzel,et al.  Modeling the MHC class I pathway by combining predictions of proteasomal cleavage,TAP transport and MHC class I binding , 2005, Cellular and Molecular Life Sciences CMLS.

[57]  Ruth Nussinov,et al.  A method for simultaneous alignment of multiple protein structures , 2004, Proteins.

[58]  Ruth Nussinov,et al.  Efficient Unbound Docking of Rigid Molecules , 2002, WABI.

[59]  A. Abbas,et al.  Basic Immunology : Functions and Disorders of the Immune System , 2001 .

[60]  Emmanuel Beaudoing,et al.  Size Estimate of the αβ TCR Repertoire of Naive Mouse Splenocytes1 , 2000, The Journal of Immunology.

[61]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[62]  J. Hansen,et al.  Rapid screening of T-cell receptor (TCR) variable gene usage by multiplex PCR: application for assessment of clonal composition. , 1999, Tissue antigens.

[63]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[64]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.