T-cell epitope prediction based on self-tolerance

T-cell epitopes, i.e., peptides capable of inducing a T-cell mediated immune response, represent suitable components for vaccines against infectious diseases and cancer. The development of accurate T-cell epitope prediction methods is thus of great interest to immunologists and the pharmaceutical industry. Whether a particular peptide is a T-cell epitope depends on the availability of (a) an MHC molecule capable of presenting the peptide on the cell surface and (b) a suitable T cell. In order to ensure self-tolerance of the immune system, T cells reactive to self-peptides are eliminated via negative selection processes. The composition of the T-cell repertoire thus depends on the host proteome. These complex dependencies along with a lack of data render T-cell epitope prediction a rather challenging problem. It is commonly reduced to the simpler MHC binding prediction problem. While state-of-the-art MHC binding prediction methods are highly accurate, the actual prediction of T-cell epitopes leaves room for improvement. Previously proposed approaches to T-cell epitope prediction do not take the dependencies on the host proteome into account but utilize peptide sequence information only. Their low prediction accuracies can be attributed to this limited view on T-cell reactivity and to a biased data basis. Moving from simple sequence-based predictors to predictors taking system-wide properties into account, we present a novel approach to T-cell epitope prediction combining sequence information and information on self-tolerance. In a thorough study on a small but unbiased data set, our method outperforms purely sequence-based predictors, indicating the validity of our approach.

[1]  B. Schölkopf,et al.  Accurate Splice Site Detection for Caenorhabditis elegans , 2004 .

[2]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[3]  S Brunak,et al.  Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach. , 2003, Tissue antigens.

[4]  Ibrahim Emam,et al.  ArrayExpress update—from an archive of functional genomics experiments to the atlas of gene expression , 2008, Nucleic Acids Res..

[5]  Dennis B. Troup,et al.  NCBI GEO: archive for high-throughput functional genomic data , 2008, Nucleic Acids Res..

[6]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[7]  C. Zheng,et al.  ; 0 ; , 1951 .

[8]  A. Vitiello,et al.  The relationship between class I binding affinity and immunogenicity of potential cytotoxic T cell epitopes. , 1994, Journal of immunology.

[9]  E. Birney,et al.  The International Protein Index: An integrated database for proteomics experiments , 2004, Proteomics.

[10]  James McCluskey,et al.  More than one reason to rethink the use of peptides in vaccine design , 2007, Nature Reviews Drug Discovery.

[11]  D. Mueller Mechanisms maintaining peripheral tolerance , 2010, Nature Immunology.

[12]  Shinn-Ying Ho,et al.  POPI: predicting immunogenicity of MHC class I binding peptides by mining informative physicochemical properties , 2007, Bioinform..

[13]  Gunnar Rätsch,et al.  The SHOGUN Machine Learning Toolbox , 2010, J. Mach. Learn. Res..

[14]  Gajendra P.S. Raghava,et al.  Prediction of CTL epitopes using QM, SVM and ANN techniques. , 2004, Vaccine.

[15]  O. Lund,et al.  novel sequence representations Reliable prediction of T-cell epitopes using neural networks with , 2003 .