The PickPocket method for predicting binding specificities for receptors based on receptor pocket similarities: application to MHC-peptide binding

MOTIVATION Receptor-ligand interactions play an important role in controlling many biological systems. One prominent example is the binding of peptides to the major histocompatibility complex (MHC) molecules controlling the onset of cellular immune responses. Thousands of MHC allelic versions exist, making determination of the binding specificity for each variant experimentally infeasible. Here, we present a method that can extrapolate from variants with known binding specificity to those where no experimental data are available. RESULTS For each position in the peptide ligand, we extracted the polymorphic pocket residues in MHC molecules that are in close proximity to the peptide residue. For MHC molecules with known specificities, we established a library of pocket-residues and corresponding binding specificities. The binding specificity for a novel MHC molecule is calculated as the average of the specificities of MHC molecules in this library weighted by the similarity of their pocket-residues to the query. This PickPocket method is demonstrated to accurately predict MHC-peptide binding for a broad range of MHC alleles, including human and non-human species. In contrast to neural network-based pan-specific methods, PickPocket was shown to be robust both when data is scarce and when the similarity to MHC molecules with characterized binding specificity is low. A consensus method combining the PickPocket and NetMHCpan methods was shown to achieve superior predictive performance. This study demonstrates how integration of diverse algorithmic approaches can lead to improved prediction. The method may also be used for making ligand-binding predictions for other types of receptors where many variants exist.

[1]  Alessandro Sette,et al.  Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method , 2005, BMC Bioinformatics.

[2]  Partho Ghosh,et al.  The Structure and Stability of an HLA-A*0201/Octameric Tax Peptide Complex with an Empty Conserved Peptide-N-Terminal Binding Site1 , 2000, The Journal of Immunology.

[3]  Jean-Philippe Vert,et al.  Efficient peptide-MHC-I binding prediction for alleles with few known binders , 2008, Bioinform..

[4]  Ora Schueler-Furman,et al.  Learning MHC I - peptide binding , 2006, ISMB.

[5]  Vladimir Brusic,et al.  MULTIPRED: a computational system for prediction of promiscuous HLA binding peptides , 2005, Nucleic Acids Res..

[6]  Kun Yu,et al.  Methods for Prediction of Peptide Binding to MHC Molecules: A Comparative Study , 2002, Molecular medicine.

[7]  Morten Nielsen,et al.  A Community Resource Benchmarking Predictions of Peptide Binding to MHC-I Molecules , 2006, PLoS Comput. Biol..

[8]  Morten Nielsen,et al.  Pan-specific MHC class I predictors: a benchmark of HLA class I pan-specific prediction methods , 2009, Bioinform..

[9]  O. Lund,et al.  NetMHCpan, a method for MHC class I binding prediction beyond humans , 2008, Immunogenetics.

[10]  U. Şahin,et al.  Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices , 1999, Nature Biotechnology.

[11]  Bjoern Peters,et al.  A roadmap for the immunomics of category A-C pathogens. , 2005, Immunity.

[12]  S. Henikoff,et al.  Amino acid substitution matrices. , 2000, Advances in protein chemistry.

[13]  Søren Brunak,et al.  MHC Class I Epitope Binding Prediction Trained on Small Data Sets , 2004, ICARIS.

[14]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[15]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[16]  O. Lund,et al.  NetMHCpan, a Method for Quantitative Predictions of Peptide Binding to Any HLA-A and -B Locus Protein of Known Sequence , 2007, PloS one.

[17]  Vladimir Brusic,et al.  Prediction of promiscuous peptides that bind HLA class I molecules , 2002, Immunology and cell biology.

[18]  O. Lund,et al.  novel sequence representations Reliable prediction of T-cell epitopes using neural networks with , 2003 .

[19]  Yoshio Urano,et al.  Optimal alignments of biological sequences on a microcomputer , 1985, Comput. Appl. Biosci..

[20]  Morten Nielsen,et al.  Modeling the adaptive immune system: predictions and simulations , 2007, Bioinform..

[21]  James Robinson,et al.  The IMGT/HLA database , 2008, Nucleic Acids Res..

[22]  Morten Nielsen,et al.  The peptide-binding specificity of HLA-A*3001 demonstrates membership of the HLA-A3 supertype , 2008, Immunogenetics.

[23]  Bjoern Peters,et al.  Quantitative peptide binding motifs for 19 human and mouse MHC class I molecules derived using positional scanning combinatorial peptide libraries , 2008, Immunome research.

[24]  Khalid Hattaf,et al.  Modeling the adaptive immune response in HBV infection , 2011, Journal of mathematical biology.