A curated gluten protein sequence database to support development of proteomics methods for determination of gluten in gluten-free foods

The unique physiochemical properties of wheat gluten enable a diverse range of food products to be manufactured. However, gluten triggers coeliac disease, a condition which is treated using a gluten-free diet. Analytical methods are required to confirm if foods are gluten-free, but current immunoassay-based methods can unreliable and proteomic methods offer an alternative but require comprehensive and well annotated sequence databases which are lacking for gluten. A manually a curated database (GluPro V1.0) of gluten proteins, comprising 630 discrete unique full length protein sequences has been compiled. It is representative of the different types of gliadin and glutenin components found in gluten. An in silico comparison of their coeliac toxicity was undertaken by analysing the distribution of coeliac toxic motifs. This demonstrated that whilst the α-gliadin proteins contained more toxic motifs, these were distributed across all gluten protein sub-types. Comparison of annotations observed using a discovery proteomics dataset acquired using ion mobility MS/MS showed that more reliable identifications were obtained using the GluPro V1.0 database compared to the complete reviewed Viridiplantae database. This highlights the value of a curated sequence database specifically designed to support the proteomic workflows and the development of methods to detect and quantify gluten. Significance We have constructed the first manually curated open-source wheat gluten protein sequence database (GluPro V1.0) in a FASTA format to support the application of proteomic methods for gluten protein detection and quantification. We have also analysed the manually verified sequences to give the first comprehensive overview of the distribution of sequences able to elicit a reaction in coeliac disease, the prevalent form of gluten intolerance. Provision of this database will improve the reliability of gluten protein identification by proteomic analysis, and aid the development of targeted mass spectrometry methods in line with Codex Alimentarius Commission requirements for foods designed to meet the needs of gluten intolerant individuals.

[1]  B. Popping,et al.  Gluten and gluten-free: issues and considerations of labeling regulations, detection methods, and assay validation. , 2012, Journal of AOAC International.

[2]  E. Nevo,et al.  Classification of wheat low-molecular-weight glutenin subunit genes and its chromosome assignment by developing LMW-GS group-specific primers , 2005, Theoretical and Applied Genetics.

[3]  Ismael Padioleau,et al.  Celiac disease T-cell epitopes from gamma-gliadins: immunoreactivity depends on the genome of origin, transcript frequency, and flanking protein variation , 2012, BMC Genomics.

[4]  H. Tlaskalova-Hogenova,et al.  Characterization of human, mouse and rabbit anti-gliadin antibodies by ELISA and western blotting , 2008, Folia Microbiologica.

[5]  L. Sollid,et al.  Prolyl Endopeptidase-Mediated Destruction of T Cell Epitopes in Whole Gluten: Chemical and Immunological Characterization , 2005, Journal of Pharmacology and Experimental Therapeutics.

[6]  Steven P Gygi,et al.  Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigations , 2005, Nature Methods.

[7]  Priyanka Garg,et al.  Manually curated database of rice proteins , 2013, Nucleic Acids Res..

[8]  A. Iwaniak,et al.  BIOPEP database and other programs for processing bioactive peptide sequences. , 2008, Journal of AOAC International.

[9]  Anuradha Balasundaram,et al.  Quantitative Proteomic Profiling of Peanut Allergens in Food Ingredients Used for Oral Food Challenges. , 2016, Analytical chemistry.

[10]  T. Mothes,et al.  Monoclonal antibody R5 for detection of putatively coeliac-toxic gliadin peptides , 2006 .

[11]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[12]  L. Valledor,et al.  Improving the quality of protein identification in non-model species. Characterization of Quercus ilex seed and Pinus radiata needle proteomes by using SEQUEST and custom databases. , 2014, Journal of proteomics.

[13]  Angéla Juhász,et al.  ProPepper: a curated database for identification and analysis of peptide and immune-responsive epitope composition of cereal grain protein families , 2015, Database J. Biol. Databases Curation.

[14]  P. Shewry,et al.  N-terminal amino acid sequences of ω-gliadins and ω-secalins: Implications for the evolution of prolamin genes , 1983 .

[15]  P. Shewry,et al.  Improving wheat to remove coeliac epitopes but retain functionality , 2016, Journal of cereal science.

[16]  F. Pineau,et al.  Identification of IgE‐binding epitopes on gliadins for patients with food allergy to wheat , 2005, Allergy.

[17]  Yuge Li,et al.  Molecular characterization of α-gliadin genes from common wheat cultivar Zhengmai 004 and their role in quality and celiac disease , 2014 .

[18]  Y. Gu,et al.  The wheat ω-gliadin genes: structure and EST analysis , 2009, Functional & Integrative Genomics.

[19]  P. I. Payne Genetics of Wheat Storage Proteins and the Effect of Allelic Variation on Bread-Making Quality , 1987 .

[20]  H. Wieser,et al.  Gluten and wheat sensitivities – An overview , 2016 .

[21]  Katharina Anne Scherf,et al.  Recent developments in analytical methods for tracing gluten , 2016 .

[22]  P. Ferranti,et al.  Proteomic analysis in allergy and intolerance to wheat products , 2011, Expert review of proteomics.

[23]  M. Mann,et al.  Unbiased quantitative proteomics of lipid rafts reveals high specificity for signaling factors , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[24]  V. Raia,et al.  Association between innate response to gliadin and activation of pathogenic T cells in coeliac disease , 2003, The Lancet.

[25]  Steven P Gygi,et al.  A proteomics approach to understanding protein ubiquitination , 2003, Nature Biotechnology.

[26]  J. Callahan,et al.  Characterization of grain-specific peptide markers for the detection of gluten by mass spectrometry. , 2014, Journal of agricultural and food chemistry.

[27]  E. Morita,et al.  Specific IgE Determination to Epitope Peptides of ω-5 Gliadin and High Molecular Weight Glutenin Subunit Is a Useful Tool for Diagnosis of Wheat-Dependent Exercise-Induced Anaphylaxis1 , 2005, The Journal of Immunology.

[28]  S. Denery-Papini,et al.  Allergy to deamidated gluten in patients tolerant to wheat: specific epitopes linked to deamidation , 2012, Allergy.

[29]  J. Drijfhout,et al.  Natural variation in toxicity of wheat: potential for selection of nontoxic varieties for celiac disease patients. , 2005, Gastroenterology.

[30]  Michel Schneider,et al.  The UniProtKB/Swiss-Prot knowledgebase and its Plant Proteome Annotation Program. , 2009, Journal of proteomics.

[31]  D. Edwards,et al.  WheatGenome.info: an integrated database and portal for wheat genome information. , 2012, Plant & cell physiology.

[32]  Rodrigo Lopez,et al.  Analysis Tool Web Services from the EMBL-EBI , 2013, Nucleic Acids Res..

[33]  F. Koning,et al.  Nomenclature and listing of celiac disease relevant gluten T-cell epitopes restricted by HLA-DQ molecules , 2012, Immunogenetics.

[34]  Qi Sun,et al.  PPDB, the Plant Proteomics Database at Cornell , 2008, Nucleic Acids Res..

[35]  G R Corazza,et al.  The immune recognition of gluten in coeliac disease , 2005, Clinical and experimental immunology.

[36]  M. Mitreva,et al.  Alpha-gliadin genes from the A, B, and D genomes of wheat contain different sets of celiac disease epitopes , 2006, BMC Genomics.

[37]  Zoltan Takats,et al.  Identification of the Species of Origin for Meat Products by Rapid Evaporative Ionization Mass Spectrometry. , 2016, Journal of agricultural and food chemistry.

[38]  J. Heazlewood The Green Proteome: Challenges in Plant Proteomics , 2011, Front. Plant Sci..

[39]  Geoffrey J. Barton,et al.  Jalview Version 2—a multiple sequence alignment editor and analysis workbench , 2009, Bioinform..

[40]  M. Rossignol,et al.  Plant proteome analysis: A 2004–2006 update , 2006, Proteomics.

[41]  F. Ulberth,et al.  Analytical methods for detection of gluten in food--method developments in support of food labeling legislation. , 2011, Journal of AOAC International.

[42]  Peter R. Shewry,et al.  Wheat: chemistry and technology. , 2009 .

[43]  D. Lafiandra,et al.  Characterization of a low-molecular-weight glutenin subunit gene from bread wheat and the corresponding protein that represents a major subunit of the glutenin polymer. , 1998, Plant physiology.

[44]  M. Barać,et al.  Characterization of Proteins from Grain of Different Bread and Durum Wheat Genotypes , 2011, International journal of molecular sciences.

[45]  Lennart Martens,et al.  DBToolkit: processing protein databases for peptide-centric proteomics , 2005, Bioinform..

[46]  P. Shewry,et al.  Comprehensive Proteomic Profiling of Wheat Gluten Using a Combination of Data-Independent and Data-Dependent Acquisition , 2017, Front. Plant Sci..

[47]  Aviaja Anna Hansen,et al.  MiDAS: the field guide to the microbes of activated sludge , 2015, Database J. Biol. Databases Curation.

[48]  Josep Rubert,et al.  Saffron authentication based on liquid chromatography high resolution tandem mass spectrometry and multivariate data analysis. , 2016, Food chemistry.

[49]  Conrad Bessant,et al.  Current perspectives and recommendations for the development of mass spectrometry methods for the determination of allergens in foods. , 2011, Journal of AOAC International.

[50]  S. Moose,et al.  A maize zinc-finger protein binds the prolamin box in zein gene promoters and interacts with the basic leucine zipper transcriptional activator Opaque2. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[51]  P. Shewry,et al.  The prolamin storage proteins of cereal seeds: structure and evolution. , 1990, The Biochemical journal.

[52]  B. Bakan,et al.  Immunoglobulin‐E‐binding epitopes of wheat allergens in patients with food allergy to wheat and in mice experimentally sensitized to wheat proteins , 2011, Clinical and experimental allergy : journal of the British Society for Allergy and Clinical Immunology.