The value of short amino acid sequence matches for prediction of protein allergenicity.

Typically, genetically engineered crops contain traits encoded by one or a few newly expressed proteins. The allergenicity assessment of newly expressed proteins is an important component in the safety evaluation of genetically engineered plants. One aspect of this assessment involves sequence searches that compare the amino acid sequence of the protein to all known allergens. Analyses are performed to determine the potential for immunologically based cross-reactivity where IgE directed against a known allergen could bind to the protein and elicit a clinical reaction in sensitized individuals. Bioinformatic searches are designed to detect global sequence similarity and short contiguous amino acid sequence identity. It has been suggested that potential allergen cross-reactivity may be predicted by identifying matches as short as six to eight contiguous amino acids between the protein of interest and a known allergen. A series of analyses were performed, and match probabilities were calculated for different size peptides to determine if there was a scientifically justified search window size that identified allergen sequence characteristics. Four probability modeling methods were tested: (1) a mock protein and a mock allergen database, (2) a mock protein and genuine allergen database, (3) a genuine allergen and genuine protein database, and (4) a genuine allergen and genuine protein database combined with a correction for repeating peptides. These analyses indicated that searches for short amino acid sequence matches of eight amino acids or fewer to identify proteins as potential cross-reactive allergens is a product of chance and adds little value to allergy assessments for newly expressed proteins.

[1]  A. Burks,et al.  Molecular cloning and epitope analysis of the peanut allergen Ara h 3. , 1999, The Journal of clinical investigation.

[2]  W. Pearson Effective protein sequence comparison. , 1996, Methods in enzymology.

[3]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[4]  G. Reese,et al.  Tropomyosin: An Invertebrate Pan–Allergen , 1999, International Archives of Allergy and Immunology.

[5]  S. Taylor,et al.  Assessment of the allergenic potential of foods derived from genetically engineered crop plants. , 1996, Critical reviews in food science and nutrition.

[6]  • Codex Guideline for the Conduct of Food Safety Assessment of Foods Derived from Recombinant-DNA Plants , 2003 .

[7]  A. Burks,et al.  Biochemical and Structural Analysis of the IgE Binding Sites on Ara h1, an Abundant and Highly Allergenic Peanut Protein* , 1998, The Journal of Biological Chemistry.

[8]  H. Sampson,et al.  Identification of IgE- and IgG-binding epitopes on alpha(s1)-casein: differences in patients with persistent and transient cow's milk allergy. , 2001, The Journal of allergy and clinical immunology.

[9]  G. Kleter,et al.  Screening of transgenic proteins expressed in transgenic food crops for the presence of short amino acid sequences identical to potential, IgE – binding linear epitopes of allergens , 2002, BMC Structural Biology.

[10]  R. Simon,et al.  Food biotechnology and genetic engineering. , 1996 .

[11]  K. R. Woods,et al.  A computer program for predicting protein antigenic determinants. , 1983, Molecular immunology.

[12]  D. Beezhold,et al.  Human IgE-binding epitopes of the latex allergen Hev b 5. , 1999, The Journal of allergy and clinical immunology.

[13]  A. Silvanovich,et al.  Bioinformatic Methods for Allergenicity Assessment Using a Comprehensive Allergen Database , 2002, International Archives of Allergy and Immunology.

[14]  Michael B. Stadler,et al.  Allergenicity prediction by protein sequence , 2003, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[15]  R. Aalberse,et al.  Structural biology of allergens. , 2000, The Journal of allergy and clinical immunology.

[16]  Richard E. Goodman,et al.  Bioinformatic Methods for Identifying Known or Potential Allergens in the Safety Assessment of Genetically Modified Crops , 2002 .

[17]  V. Kurup,et al.  Conformational and Linear B-Cell Epitopes of Asp f 2, a Major Allergen of Aspergillus fumigatus, Bind Differently to Immunoglobulin E Antibody in the Sera of Allergic Bronchopulmonary Aspergillosis Patients , 1999, Infection and Immunity.