Application of Genetic Search in Derivation of Matrix Models of Peptide Binding to MHC Molecules

T cells of the vertebrate immune system recognise peptides bound by major histocompatibility complex (MHC) molecules on the surface of host cells. Peptide binding to MHC molecules is necessary for immune recognition, but only a subset of peptides are capable of binding to a particular MHC molecule. Common amino acid patterns (binding motifs) have been observed in sets of peptides that bind to specific MHC molecules. Recently, matrix models for peptide/MHC interaction have been reported. These encode the rules of peptide/ MHC interactions for an individual MHC molecule as a 20 x 9 matrix where the contribution to binding of each amino acid at each position within a 9-mer peptide is quantified. The artificial intelligence techniques of genetic search and machine learning have proved to be very useful in the area of biological sequence analysis. The availability of peptide/MHC binding data can facilitate derivation of binding matrices using machine learning techniques. We performed a simulation study to determine the minimum number of peptide samples required to derive matrices, given the pre-defined accuracy of the matrix model. The matrices were derived using a genetic search. In addition, matrices for peptide binding to the human class I MHC molecules, HLA-B35 and -A24, were derived, validated by independent experimental data and compared to previously-reported matrices. The results indicate that at least 150 peptide samples are required to derive matrices of acceptable accuracy. This result is based on a maximum noise content of 5%, the availability of precise affinity measurements and that acceptable accuracy is determined by an area under the Relative Operating Characteristic curve (Aroc) of > 0.8. More than 600 peptide samples are required to derive matrices of excellent accuracy (Aroc > 0.9). Finally, we derived a human HLA-B27 binding matrix using a genetic search and 404 experimentally-tested peptides, and estimated its accuracy at Aroc > 0.88. The results of this study are expected to be of practical interest to immunologists for efficient identification of peptides as candidates for immunotherapy.

[1]  Z. Nagy,et al.  Precise prediction of major histocompatibility complex class II-peptide interaction based on peptide side chain scanning , 1994, The Journal of experimental medicine.

[2]  Vladimir Brusic,et al.  Prediction of MHC class II-binding peptides using an evolutionary algorithm and artificial neural network , 1998, Bioinform..

[3]  G. Chelvanayagam A roadmap for HLA-A, HLA-B, and HLA-C peptide binding specificities , 1996, Immunogenetics.

[4]  Mahesan Niranjan,et al.  On the Practical Applicability of VC Dimension Bounds , 1995, Neural Computation.

[5]  Vladimir Brusic,et al.  MHCPEP--a database of MHC-binding peptides: update 1995 , 1996, Nucleic Acids Res..

[6]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[7]  Umesh V. Vazirani,et al.  An Introduction to Computational Learning Theory , 1994 .

[8]  R. J. Stonier,et al.  Complex Systems: Mechanism of Adaptation , 1994 .

[9]  Vladimir Brusic,et al.  A Peptide-binding Motif for I-Ag7, the Class II Major Histocompatibility Complex (MHC) Molecule of NOD and Biozzi AB/H Mice , 1997, The Journal of experimental medicine.

[10]  Nicol N. Schraudolph,et al.  A User's Guide to GAucsd 1.4 , 1992 .

[11]  Vladimir Brusic,et al.  MHCPEP, a database of MHC-binding peptides: update 1996 , 1997, Nucleic Acids Res..

[12]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[13]  J A Koziol,et al.  Prediction of binding to MHC class I molecules. , 1995, Journal of immunological methods.

[14]  M. Torres,et al.  Nomenclature for factors of the HLA system. , 2011, Bulletin of the World Health Organization.

[15]  H. Rammensee,et al.  Peptides naturally presented by MHC class I molecules. , 1993, Annual review of immunology.

[16]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, CACM.

[17]  D. Wiley,et al.  Crystallographic analysis of endogenous peptides associated with HLA-DR1 suggests a common, polyproline II-like conformation for bound peptides. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[18]  M F del Guercio,et al.  Prominent roles of secondary anchor residues in peptide binding to HLA-A24 human class I molecules. , 1995, Journal of immunology.

[19]  Arne Svejgaard,et al.  Nomenclature for Factors of the HLA System, 1994 , 1994, Human immunology.

[20]  S Ferrone,et al.  HLA-B*3501-peptide interactions: role of anchor residues of peptides in their binding to HLA-B*3501 molecules. , 1994, International immunology.

[21]  Hidde L. Ploegh,et al.  Empty MHC class I molecules come out in the cold , 1990, Nature.

[22]  C. Schönbach,et al.  Fine tuning of peptide binding to HLA-B*3501 molecules by nonanchor residues. , 1995, Journal of immunology.

[23]  P. Cresswell,et al.  Assembly, transport, and function of MHC class II molecules. , 1994, Annual review of immunology.

[24]  Vladimir Brusic,et al.  MHCPEP, a database of MHC-binding peptides: update 1996 , 1997, Nucleic Acids Res..

[25]  W R Mayr,et al.  Nomenclature for factors of the HLA system, 1994. , 1994, Tissue antigens.

[26]  K. Parker,et al.  Scheme for ranking potential HLA-A2 binding peptides based on independent binding of individual peptide side-chains. , 1994, Journal of immunology.

[27]  Kiyoshi Miwa,et al.  Refined peptide HLA-B*3501 binding motif reveals differences in 9-mer to 11-mer peptide binding , 1996, Immunogenetics.

[28]  D. Zaller,et al.  Prediction of peptide affinity to HLA DRB1*0401. , 1994, International archives of allergy and immunology.