Uses of Phage Display in Agriculture: Sequence Analysis and Comparative Modeling of Late Embryogenesis Abundant Client Proteins Suggest Protein-Nucleic Acid Binding Functionality

A group of intrinsically disordered, hydrophilic proteins—Late Embryogenesis Abundant (LEA) proteins—has been linked to survival in plants and animals in periods of stress, putatively through safeguarding enzymatic function and prevention of aggregation in times of dehydration/heat. Yet despite decades of effort, the molecular-level mechanisms defining this protective function remain unknown. A recent effort to understand LEA functionality began with the unique application of phage display, wherein phage display and biopanning over recombinant Seed Maturation Protein homologs from Arabidopsis thaliana and Glycine max were used to retrieve client proteins at two different temperatures, with one intended to represent heat stress. From this previous study, we identified 21 client proteins for which clones were recovered, sometimes repeatedly. Here, we use sequence analysis and homology modeling of the client proteins to ascertain common sequence and structural properties that may contribute to binding affinity with the protective LEA protein. Our methods uncover what appears to be a predilection for protein-nucleic acid interactions among LEA client proteins, which is suggestive of subcellular residence. The results from this initial computational study will guide future efforts to uncover the protein protective mechanisms during heat stress, potentially leading to phage-display-directed evolution of synthetic LEA molecules.

[1]  Johannes Söding,et al.  The HHpred interactive server for protein homology detection and structure prediction , 2005, Nucleic Acids Res..

[2]  Adel Golovin,et al.  MSDmotif: exploring protein sites and motifs , 2008, BMC Bioinformatics.

[3]  N. Ban,et al.  Crystal Structure of the Eukaryotic 60S Ribosomal Subunit in Complex with Initiation Factor 6 , 2011, Science.

[4]  M. A. Hemminga,et al.  Molecular mobility in the cytoplasm: an approach to describe and predict lifespan of dry germplasm. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[5]  M. Jaquinod,et al.  Structure and Function of a Mitochondrial Late Embryogenesis Abundant Protein Are Revealed by Desiccation[W] , 2007, The Plant Cell Online.

[6]  P. Gustafsson,et al.  Structural Investigation of Disordered Stress Proteins. Comparison of Full-Length Dehydrins with Isolated Peptides of Their Conserved Segments1 , 2006, Plant Physiology.

[7]  B. Séraphin,et al.  Structure of the Exon Junction Core Complex with a Trapped DEAD-Box ATPase Bound to RNA , 2006, Science.

[8]  W. Möller,et al.  Phosphate‐binding sequences in nucleotide‐binding proteins , 1985, FEBS letters.

[9]  Marco Biasini,et al.  Toward the estimation of the absolute quality of individual protein structure models , 2010, Bioinform..

[10]  Patrice Gouet,et al.  ESPript: analysis of multiple sequence alignments in PostScript , 1999, Bioinform..

[11]  A. Biegert,et al.  HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment , 2011, Nature Methods.

[12]  J. Cushman,et al.  Temperature-Induced Extended Helix/Random Coil Transitions in a Group 1 Late Embryogenesis-Abundant Protein from Soybean1 , 2002, Plant Physiology.

[13]  T. Steitz,et al.  The roles of ribosomal proteins in the structure assembly, and evolution of the large ribosomal subunit. , 2004, Journal of molecular biology.

[14]  B. Mikami,et al.  Conservation and divergence on plant seed 11S globulins based on crystal structures. , 2010, Biochimica et biophysica acta.

[15]  Torsten Schwede,et al.  BIOINFORMATICS Bioinformatics Advance Access published November 12, 2005 The SWISS-MODEL Workspace: A web-based environment for protein structure homology modelling , 2022 .

[16]  Sergey Melnikov,et al.  The Structure of the Eukaryotic Ribosome at 3.0 Å Resolution , 2011, Science.

[17]  J. Buitink,et al.  Metabolic dysfunction and unabated respiration precede the loss of membrane integrity during dehydration of germinating radicles. , 2000, Plant physiology.

[18]  A Sali,et al.  Comparative protein modeling by satisfaction of spatial restraints. , 1996, Molecular medicine today.

[19]  E V Koonin,et al.  A superfamily of ATPases with diverse functions containing either classical or deviant ATP-binding motif. , 1993, Journal of molecular biology.

[20]  J. Clegg,et al.  Cryptobiosis--a peculiar state of biological organization. , 2001, Comparative biochemistry and physiology. Part B, Biochemistry & molecular biology.

[21]  A. Tunnacliffe,et al.  LEA proteins prevent protein aggregation due to water stress. , 2005, The Biochemical journal.

[22]  M. Ginsberg,et al.  Arginyl-glycyl-aspartic acid (RGD): a cell adhesion motif. , 1991, Trends in biochemical sciences.

[23]  P. R. Sibbald,et al.  The P-loop--a common motif in ATP- and GTP-binding proteins. , 1990, Trends in biochemical sciences.

[24]  Wendell Q. Sun,et al.  CYTOPLASMIC VITRIFICATION AND SURVIVAL OF ANHYDROBIOTIC ORGANISMS , 1997 .

[25]  Daniel N. Wilson,et al.  Localization of eukaryote-specific ribosomal proteins in a 5.5-Å cryo-EM map of the 80S eukaryotic ribosome , 2010, Proceedings of the National Academy of Sciences.

[26]  D. Merkler,et al.  C-terminal amidated peptides: production by the in vitro enzymatic amidation of glycine-extended peptides and the importance of the amide to bioactivity. , 1994, Enzyme and microbial technology.

[27]  W. Vranken,et al.  Determination of the three-dimensional solution structure of Raphanus sativus antifungal protein 1 by 1H NMR. , 1998, Journal of molecular biology.

[28]  Johannes Söding,et al.  The MPI Bioinformatics Toolkit for protein sequence analysis , 2006, Nucleic Acids Res..

[29]  J. Boudet,et al.  MtPM25 is an atypical hydrophobic late embryogenesis-abundant protein that dissociates cold and desiccation-aggregated proteins. , 2010, Plant, cell & environment.

[30]  T. Close,et al.  Purification and partial characterization of a dehydrin involved in chilling tolerance during seedling emergence of cowpea. , 1999, Plant physiology.

[31]  SödingJohannes Protein homology detection by HMM--HMM comparison , 2005 .

[32]  S. Hitchcock-DeGregori,et al.  Dual requirement for flexibility and specificity for binding of the coiled-coil tropomyosin to its target, actin. , 2006, Structure.

[33]  S. Gilroy,et al.  Feeling green: mechanosensing in plants. , 2009, Trends in cell biology.

[34]  R D Appel,et al.  Protein identification and analysis tools in the ExPASy server. , 1999, Methods in molecular biology.

[35]  Torsten Schwede,et al.  The SWISS-MODEL Repository and associated resources , 2008, Nucleic Acids Res..

[36]  J. Carpenter,et al.  The role of vitrification in anhydrobiosis. , 1998, Annual review of physiology.

[37]  E. Wurtele,et al.  The embryo-specific EMB-1 protein of Daucus carota is flexible and unstructured in solution☆ , 1996 .

[38]  A. Covarrubias,et al.  Functional Analysis of the Group 4 Late Embryogenesis Abundant Proteins Reveals Their Relevance in the Adaptive Response during Water Deficit in Arabidopsis1[C][W][OA] , 2010, Plant Physiology.

[39]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[40]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[41]  H. Kalbitzer,et al.  The recombinant dehydrin-like desiccation stress protein from the resurrection plant Craterostigma plantagineum displays no defined three-dimensional structure in its native state. , 1996, Biological chemistry.

[42]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[43]  G. Galau,et al.  Developmental biochemistry of cottonseed embryogenesis and germination: changing messenger ribonucleic acid populations as shown by reciprocal heterologous complementary deoxyribonucleic acid--messenger ribonucleic acid hybridization. , 1981, Biochemistry.

[44]  Paul Horton,et al.  Nucleic Acids Research Advance Access published May 21, 2007 WoLF PSORT: protein localization predictor , 2007 .

[45]  G. Galau,et al.  Developmental biochemistry of cottonseed embryogenesis and germination: changing messenger ribonucleic acid populations as shown by in vitro and in vivo protein synthesis. , 1981, Biochemistry.

[46]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[47]  Narayanan Eswar,et al.  Structure of the mammalian 80S ribosome at 8.7 A resolution. , 2008, Structure.

[48]  M. Toner,et al.  LEA proteins during water stress: not just for plants anymore. , 2011, Annual review of physiology.

[49]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[50]  Santosh Kumar,et al.  Identification of Late Embryogenesis Abundant (LEA) Protein Putative Interactors Using Phage Display , 2012, International journal of molecular sciences.

[51]  T. Schwede,et al.  Protein structure homology modeling using SWISS-MODEL workspace , 2008, Nature Protocols.

[52]  Leucine Zipper , 2001 .

[53]  K. R. Woods,et al.  Prediction of protein antigenic determinants from amino acid sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[54]  J. Cushman,et al.  Conformation of a Group 2 Late Embryogenesis Abundant Protein from Soybean. Evidence of Poly (l-Proline)-type II Structure1 , 2003, Plant Physiology.

[55]  D. W. Hughes,et al.  Abscisic acid induction of cloned cotton late embryogenesis-abundant (Lea) mRNAs , 1986, Plant Molecular Biology.

[56]  J. Buitink,et al.  Glass formation in plant anhydrobiotes: survival in the dry state. , 2004, Cryobiology.

[57]  Andrej ⩽ali,et al.  Comparative protein modeling by satisfaction of spatial restraints , 1995 .

[58]  C. Fierke,et al.  Mitochondrial ribonuclease P structure provides insight into the evolution of catalytic strategies for precursor-tRNA 5′ processing , 2012, Proceedings of the National Academy of Sciences.

[59]  G. Grant,et al.  Role of aromatic amino acids in protein-nucleic acid recognition. , 2007, Biopolymers.

[60]  J. Walker,et al.  Distantly related sequences in the alpha‐ and beta‐subunits of ATP synthase, myosin, kinases and other ATP‐requiring enzymes and a common nucleotide binding fold. , 1982, The EMBO journal.

[61]  S Subramani,et al.  Identification of Peroxisomal Targeting Signals Located at the Carboxy Terminus of Four Peroxisomal Proteins Materials and Methods Reagents , 1988 .

[62]  W. Merrick,et al.  GTP-binding domain: three consensus sequence elements with distinct spacing. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[63]  W. Wolkers,et al.  Isolation and characterization of a D-7 LEA protein from pollen that stabilizes glasses in vitro. , 2001, Biochimica et biophysica acta.

[64]  John L Markley,et al.  Solution structure of a late embryogenesis abundant protein (LEA14) from Arabidopsis thaliana, a cellular stress‐related protein , 2005, Protein science : a publication of the Protein Society.

[65]  Solution structure of the C‐terminal domain of multiprotein bridging factor 1 (MBF1) of Trichoderma reesei , 2009, Proteins.

[66]  J. Farrant,et al.  The most prevalent protein in a heat-treated extract of pea (Pisum sativum) embryos is an LEA group I protein; its conformation is not affected by exposure to high temperature , 1997, Seed Science Research.

[67]  John M. Walker,et al.  The Proteomics Protocols Handbook , 2005, Humana Press.

[68]  D. Higgins,et al.  Finding flexible patterns in unaligned protein sequences , 1995, Protein science : a publication of the Protein Society.

[69]  S. Sainsbury,et al.  The crystal structure of NGO0477 from Neisseria gonorrhoeae reveals a novel protein fold incorporating a helix‐turn‐helix motif , 2010, Proteins.

[70]  A. Mildvan,et al.  ATP-binding site of adenylate kinase: mechanistic implications of its homology with ras-encoded p21, F1-ATPase, and other nucleotide-binding proteins. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[71]  S. Clarke,et al.  Substrates of the Arabidopsis thaliana Protein Isoaspartyl Methyltransferase 1 Identified Using Phage Display and Biopanning* , 2010, The Journal of Biological Chemistry.

[72]  D. Merkler C‐Terminal Amidated Peptides: Production by the in vitro Enzymic Amidation of Glycine‐Extended Peptides and the Importance of the Amide to Bioactivity , 1994 .