Site‐Specific Isotopic Labeling (SSIL): Access to High‐Resolution Structural and Dynamic Information in Low‐Complexity Proteins

Remarkable technical progress in the area of structural biology has paved the way to study previously inaccessible targets. For example, large protein complexes can now be easily investigated by cryo‐electron microscopy, and modern high‐field NMR magnets have challenged the limits of high‐resolution characterization of proteins in solution. However, the structural and dynamic characteristics of certain proteins with important functions still cannot be probed by conventional methods. These proteins in question contain low‐complexity regions (LCRs), compositionally biased sequences where only a limited number of amino acids is repeated multiple times, which hamper their characterization. This Concept article describes a site‐specific isotopic labeling (SSIL) strategy, which combines nonsense suppression and cell‐free protein synthesis to overcome these limitations. An overview on how poly‐glutamine tracts were made amenable to high‐resolution structural studies is used to illustrate the usefulness of SSIL. Furthermore, we discuss the potential of this methodology to give further insights into the roles of LCRs in human pathologies and liquid–liquid phase separation, as well as the challenges that must be addressed in the future for the popularization of SSIL.

[1]  Farren J. Isaacs,et al.  Robust production of recombinant phosphoproteins using cell-free protein synthesis , 2015, Nature Communications.

[2]  P. Schultz,et al.  An archaebacteria-derived glutamyl-tRNA synthetase and tRNA pair for unnatural amino acid mutagenesis of proteins in Escherichia coli. , 2003, Nucleic acids research.

[3]  S. Westenhoff,et al.  Efficient Isotope Editing of Proteins for Site-Directed Vibrational Spectroscopy. , 2016, Journal of the American Chemical Society.

[4]  Christian Griesinger,et al.  Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients , 1999 .

[5]  G. Wagner,et al.  Emerging solution NMR methods to illuminate the structural and dynamic properties of proteins. , 2019, Current opinion in structural biology.

[6]  Douglas R. Walker,et al.  Glycine Substitutions in Collagen Heterotrimers Alter Triple Helical Assembly. , 2017, Biomacromolecules.

[7]  S. Luo,et al.  Conformation Polymorphism of Polyglutamine Proteins. , 2018, Trends in biochemical sciences.

[8]  P. Schultz,et al.  Die Erweiterung des genetischen Codes , 2005 .

[9]  Anna Zawadzka‐Kazimierczuk,et al.  High-dimensional NMR methods for intrinsically disordered proteins studies. , 2018, Methods.

[10]  Lisa D. Muiznieks,et al.  Direct observation of structure and dynamics during phase separation of an elastomeric protein , 2017, Proceedings of the National Academy of Sciences.

[11]  T. Katoh,et al.  Flexizymes for genetic code reprogramming , 2011, Nature Protocols.

[12]  Peter G. Schultz,et al.  Genomically Recoded Organisms Expand Biological Functions , 2013, Science.

[13]  Peter G Schultz,et al.  Evolution of multiple, mutually orthogonal prolyl-tRNA synthetase/tRNA pairs for unnatural amino acid mutagenesis in Escherichia coli , 2012, Proceedings of the National Academy of Sciences.

[14]  J. Hartgerink,et al.  Control of Collagen Triple Helix Stability by Phosphorylation. , 2017, Biomacromolecules.

[15]  John C. Wootton,et al.  Non-globular Domains in Protein Sequences: Automated Segmentation Using Complexity Measures , 1994, Comput. Chem..

[16]  D. Wemmer,et al.  Site-specific isotopic labeling of proteins for NMR studies , 1992 .

[17]  D. Söll,et al.  Expanding the Genetic Code of Escherichia coli with Phosphoserine , 2011, Science.

[18]  I. Felli,et al.  Novel methods based on (13)C detection to study intrinsically disordered proteins. , 2014, Journal of magnetic resonance.

[19]  H. Chan,et al.  Structural and hydrodynamic properties of an intrinsically disordered region of a germ cell-specific protein on phase separation , 2017, Proceedings of the National Academy of Sciences.

[20]  J. Hartgerink,et al.  Synthetic, Register-Specific, AAB Heterotrimers to Investigate Single Point Glycine Mutations in Osteogenesis Imperfecta. , 2016, Biomacromolecules.

[21]  Markus Zweckstetter,et al.  Automatic assignment of the intrinsically disordered protein Tau with 441-residues. , 2010, Journal of the American Chemical Society.

[22]  Peter G Schultz,et al.  Adding new chemistries to the genetic code. , 2010, Annual review of biochemistry.

[23]  E D Laue,et al.  Dual amino acid-selective and site-directed stable-isotope labeling of the human c-Ha-Ras protein by cell-free synthesis , 1998, Journal of biomolecular NMR.

[24]  H. Gamper,et al.  Amino acid–dependent stability of the acyl linkage in aminoacyl-tRNA , 2014, RNA.

[25]  O. Galzitskaya,et al.  Occurrence of disordered patterns and homorepeats in eukaryotic and bacterial proteomes. , 2012, Molecular bioSystems.

[26]  J. Wieruszeski,et al.  High-Resolution Magic Angle Spinning NMR Study of Resin-Bound Polyalanine Peptides , 2000 .

[27]  T. Huber,et al.  Mehrfache Markierung von Proteinen mit nichtnatürlichen Aminosäuren , 2012 .

[28]  J. Hartgerink,et al.  Comparative NMR analysis of collagen triple helix organization from N- to C-termini. , 2015, Biomacromolecules.

[29]  M. Diamond,et al.  Polyglutamine diseases: emerging concepts in pathogenesis and therapy. , 2007, Human molecular genetics.

[30]  Robert M Vernon,et al.  First-generation predictors of biological protein phase separation. , 2019, Current opinion in structural biology.

[31]  Nathalie Sibille,et al.  A General Strategy to Access Structural Information at Atomic Resolution in Polyglutamine Homorepeats , 2018, Angewandte Chemie.

[32]  P. Schultz,et al.  Progress toward the evolution of an organism with an expanded genetic code. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Andrey V Kajava,et al.  Protein homorepeats sequences, structures, evolution, and functions. , 2010, Advances in Protein Chemistry and Structural Biology.

[34]  Thomas Huber,et al.  Multiple-site labeling of proteins with unnatural amino acids. , 2012, Angewandte Chemie.

[35]  Gregorio Alanis-Lobato,et al.  Context characterization of amino acid homorepeats using evolution, position, and order , 2017, Proteins.

[36]  M. Blackledge,et al.  Exploring free-energy landscapes of intrinsically disordered proteins at atomic resolution using NMR spectroscopy. , 2014, Chemical reviews.

[37]  J. Wootton,et al.  Analysis of compositionally biased regions in sequence databases. , 1996, Methods in enzymology.

[38]  S. Hecht,et al.  T4 RNA ligase mediated preparation of novel "chemically misacylated" tRNAPheS. , 1984, Biochemistry.

[39]  P. Schultz,et al.  Expanding the genetic code. , 2002, Chemical communications.

[40]  Xavier Salvatella,et al.  Sequence Context Influences the Structure and Aggregation Behavior of a PolyQ Tract. , 2016, Biophysical journal.

[41]  R. Pappu,et al.  Tadpole-like Conformations of Huntingtin Exon 1 Are Characterized by Conformational Heterogeneity that Persists regardless of Polyglutamine Length. , 2018, Journal of molecular biology.

[42]  Lucio Frydman,et al.  Structure and Dynamics of the Huntingtin Exon-1 N-Terminus: A Solution NMR Perspective. , 2017, Journal of the American Chemical Society.

[43]  Martin H. Schaefer,et al.  Evolution and function of CAG/polyglutamine repeats in protein–protein interaction networks , 2012, Nucleic acids research.

[44]  Michael C Jewett,et al.  Cell-free protein synthesis from genomically recoded bacteria enables multisite incorporation of noncanonical amino acids , 2018, Nature Communications.

[45]  H. Dyson,et al.  Unfolded proteins and protein folding studied by NMR. , 2004, Chemical reviews.

[46]  Hue Sun Chan,et al.  Theories for Sequence-Dependent Phase Behaviors of Biomolecular Condensates. , 2018, Biochemistry.

[47]  M. Kainosho,et al.  Stable isotope labeling methods for protein NMR spectroscopy , 2008 .

[48]  H. Inokuchi,et al.  Co-expression of yeast amber suppressor tRNATyr and tyrosyl-tRNA synthetase in Escherichia coli: possibility to expand the genetic code. , 1998, Journal of biochemistry.

[49]  H. Paulson,et al.  Polyglutamine neurodegeneration: protein misfolding revisited , 2008, Trends in Neurosciences.

[50]  Michael C. Jewett,et al.  Cell-free Protein Synthesis from a Release Factor 1 Deficient Escherichia coli Activates Efficient and Multiple Site-specific Nonstandard Amino Acid Incorporation , 2013, ACS synthetic biology.

[51]  V. Sklenar,et al.  Toward optimal-resolution NMR of intrinsically disordered proteins. , 2014, Journal of magnetic resonance.

[52]  Vladimir N Uversky,et al.  Intrinsically disordered proteins in overcrowded milieu: Membrane-less organelles, phase separation, and intrinsic disorder. , 2017, Current opinion in structural biology.

[53]  P. Schultz,et al.  Adaptation of an orthogonal archaeal leucyl-tRNA and synthetase pair for four-base, amber, and opal suppression. , 2003, Biochemistry.

[54]  Nicholas E. Dixon,et al.  15N‐Labelled proteins by cell‐free protein synthesis , 2006 .

[55]  Stephen W. Michnick,et al.  Mechanisms and Consequences of Macromolecular Phase Separation , 2016, Cell.

[56]  A. Munnich,et al.  Polyalanine expansions in human. , 2004, Human molecular genetics.

[57]  C. Ponting,et al.  Protein repeats: structures, functions, and evolution. , 2001, Journal of structural biology.

[58]  A. Meola,et al.  Robust and low cost uniform (15)N-labeling of proteins expressed in Drosophila S2 cells and Spodoptera frugiperda Sf9 cells for NMR applications. , 2014, Journal of structural biology.

[59]  P G Schultz,et al.  A general method for site-specific incorporation of unnatural amino acids into proteins. , 1989, Science.

[60]  Nicolas L. Fawzi,et al.  Residue-by-Residue View of In Vitro FUS Granules that Bind the C-Terminal Domain of RNA Polymerase II. , 2015, Molecular cell.

[61]  V. Dötsch,et al.  Protein labeling strategies for liquid-state NMR spectroscopy using cell-free synthesis. , 2018, Progress in nuclear magnetic resonance spectroscopy.

[62]  F. Walker Huntington's disease , 2007, The Lancet.

[63]  Ling Peng,et al.  19F NMR: a valuable tool for studying biological events. , 2013, Chemical Society reviews.

[64]  Vladimir N. Uversky,et al.  Intrinsic Disorder in Proteins with Pathogenic Repeat Expansions , 2017, Molecules.

[65]  P. Schultz,et al.  A New Orthogonal Suppressor tRNA/Aminoacyl‐tRNA Synthetase Pair for Evolving an Organism with an Expanded Genetic Code , 2000 .

[66]  P. Schimmel,et al.  Breaking sieve for steric exclusion of a noncognate amino acid from active site of a tRNA synthetase. , 2005, Proceedings of the National Academy of Sciences of the United States of America.