Towards large-scale validation of protein flexibility using rigidity analysis

Proteins are dynamic molecules involved in virtually every chemical process in our bodies. Understanding how they flex and bend provides fundamental insights to their functions. At the atomic level, protein motion cannot be observed using existing experimental methods. To gain insights into these motions, simulation methods are used. However such simulations are computationally expensive. Rigidity analysis is a fast, alternative graph-based method to molecular simulations, that gives information about the flexibility properties of molecules modeled as mechanical structures. Due to the lack of convenient tools for curating protein data, the usefulness of rigidity analysis has been demonstrated on only a handful of proteins to infer several of their biophysical properties. Previous studies also relied on heuristics to determine which choice of modeling options of important stabilizing interactions allowed for extracting relevant biological observations from rigidity analysis results. Thus there is no agreed-upon choice of modeling of stabilizing interactions that is validated with experimental data. In this thesis we make progress towards large-scale validation of protein flexibility using rigidity analysis. We have developed the KINARI software to test the predictive power of using rigidity analysis to infer biophysical properties of proteins. We develop new tools for curating protein data files and for generating biological functional forms and crystal lattices of molecules. We show that rigidity analysis of these biological assemblies provides structural and functional information that would be missed if only the unprocessed data of protein structures were analyzed. To provide a proof-of-concept that rigidity analysis can be used to perform fast evaluation of in silico mutations that may not be easy to perform in vitro, we have developed KINARI-Mutagen. Finally, we perform a systematic study in which we vary how hydrogen bonds and hydrophobic interactions are modeled when constructing a mechanical framework of a protein. We propose a general method to evaluate how varying the modeling of these important inter-atomic interactions affects the degree to which rigidity parameters correlate with experimental stability data.

[1]  D. Garboczi,et al.  Structural basis of Fabry disease. , 2002, Molecular genetics and metabolism.

[2]  D. Jacobs,et al.  Protein flexibility predictions using graph theory , 2001, Proteins.

[3]  Mark Gerstein,et al.  RigidFinder: A fast and sensitive method to detect rigid blocks in large macromolecular complexes , 2010, Proteins.

[4]  Alok K. Mitra,et al.  Membrane Remodeling by the Double-Barrel Scaffolding Protein of Poxvirus , 2011, PLoS pathogens.

[5]  J. Mccammon,et al.  HIV‐1 protease molecular dynamics of a wild‐type and of the V82F/I84V mutant: Possible contributions to drug resistance and a potential new target site for drugs , 2004, Protein science : a publication of the Protein Society.

[6]  B. Alder,et al.  Studies in Molecular Dynamics. I. General Method , 1959 .

[7]  M. Karplus,et al.  Dynamics of folded proteins , 1977, Nature.

[8]  Christopher Jones Circular dichroism: Principles and applications , 1995 .

[9]  Michael R. Shirts,et al.  Atomistic protein folding simulations on the submillisecond time scale using worldwide distributed computing. , 2003, Biopolymers.

[10]  G. Laman On graphs and rigidity of plane skeletal structures , 1970 .

[11]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[12]  Brian W. Matthews,et al.  Hydrophobic stabilization in T4 lysozyme determined directly by multiple substitutions of Ile 3 , 1988, Nature.

[13]  D E Tronrud,et al.  Contributions of left-handed helical residues to the structure and stability of bacteriophage T4 lysozyme. , 1990, Journal of molecular biology.

[14]  J E Jimenez-Roldan,et al.  Comparative analysis of rigidity across protein families , 2009, Physical biology.

[15]  William C. Rose,et al.  FEEDING EXPERIMENTS WITH MIXTURES OF HIGHLY PURIFIED AMINO ACIDS VIII. ISOLATION AND IDENTIFICATION OF A NEW ESSENTIAL AMINO ACID , 1935 .

[16]  J Bello,et al.  The activity of crystalline ribonuclease A. , 1965, Biochimica et biophysica acta.

[17]  I. Bahar,et al.  Folding core predictions from network models of proteins , 2004 .

[18]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[19]  M. Senechal,et al.  Crystalline Symmetries, An informal mathematical introduction , 1990 .

[20]  Jacobs,et al.  Generic rigidity percolation: The pebble game. , 1995, Physical review letters.

[21]  Ileana Streinu,et al.  Using rigidity analysis to probe mutation-induced structural changes in proteins , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW).

[22]  A. H. Wang,et al.  The refined crystal structure of an eel pout type III antifreeze protein RD1 at 0.62-A resolution reveals structural microheterogeneity of protein and solvation. , 2003, Biophysical journal.

[23]  D. Jacobs,et al.  Protein flexibility and dynamics using constraint theory. , 2001, Journal of molecular graphics & modelling.

[24]  B. Matthews,et al.  Structure of bacteriophage T4 lysozyme refined at 1.7 A resolution. , 1987, Journal of molecular biology.

[25]  B. Matthews,et al.  Protein flexibility and adaptability seen in 25 crystal forms of T4 lysozyme. , 1995, Journal of molecular biology.

[26]  Akinori Sarai,et al.  ProTherm and ProNIT: thermodynamic databases for proteins and protein–nucleic acid interactions , 2005, Nucleic Acids Res..

[27]  Judith Klein-Seetharaman,et al.  Identification of core amino acids stabilizing rhodopsin. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[28]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[29]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[30]  M. Levitt,et al.  Accurate prediction of the stability and activity effects of site-directed mutagenesis on a protein core , 1991, Nature.

[31]  Thomas Walz,et al.  The Hexamer Structure of the Rift Valley Fever Virus Nucleoprotein Suggests a Mechanism for its Assembly into Ribonucleoprotein Complexes , 2011, PLoS pathogens.

[32]  J. Kendrew,et al.  A Three-Dimensional Model of the Myoglobin Molecule Obtained by X-Ray Analysis , 1958, Nature.

[33]  M. Teeter,et al.  Primary structure of the hydrophobic plant protein crambin. , 1981, Biochemistry.

[34]  Markus Eilers,et al.  Implications of threonine hydrogen bonding in the glycophorin A transmembrane helix dimer. , 2002, Biophysical journal.

[35]  N. Go,et al.  Studies on protein folding, unfolding and fluctuations by computer simulation. I. The effect of specific amino acid sequence represented by specific inter-unit interactions. , 2009 .

[36]  Audrey Lee-St. John,et al.  Pebble game algorithms and sparse graphs , 2007, Discret. Math..

[37]  H. Gohlke,et al.  Exploiting the Link between Protein Rigidity and Thermostability for Data‐Driven Protein Engineering , 2008 .

[38]  Mark Gerstein,et al.  MolMovDB: analysis and visualization of conformational change and structural flexibility , 2003, Nucleic Acids Res..

[39]  Ileana Streinu,et al.  Towards accurate modeling for protein rigidity analysis , 2012, 2012 IEEE 2nd International Conference on Computational Advances in Bio and medical Sciences (ICCABS).

[40]  Tiong-Seng Tay,et al.  Rigidity of multi-graphs. I. Linking rigid bodies in n-space , 1984, J. Comb. Theory, Ser. B.

[41]  T L Blundell,et al.  Prediction of the stability of protein mutants based on structural environment-dependent amino acid substitution and propensity tables. , 1997, Protein engineering.

[42]  J. Maxwell,et al.  The Scientific Papers of James Clerk Maxwell: On the Calculation of the Equilibrium and Stiffness of Frames , 1864 .

[43]  Shigeyuki Yokoyama,et al.  Crystal structure of uncharacterized protein TTHA1756 from Thermus thermophilus HB8: Structural variety in UPF0150 family proteins , 2008, Proteins.

[44]  S. L. Mayo,et al.  Automated design of the surface positions of protein helices , 1997, Protein science : a publication of the Protein Society.

[45]  Piero Fariselli,et al.  A neural-network-based method for predicting protein stability changes upon single point mutations , 2004, ISMB/ECCB.

[46]  J. Lakowicz Principles of fluorescence spectroscopy , 1983 .

[47]  Ileana Streinu,et al.  Towards biophysical validation of constraint modeling for rigidity analysis of proteins , 2012, BCB.

[48]  D. W. Noid Studies in Molecular Dynamics , 1976 .

[49]  Scott Brown,et al.  Coarse-grained sequences for protein folding and design , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[50]  W A Hendrickson,et al.  Highly ordered crystals of the plant seed protein crambin. , 1979, Journal of molecular biology.

[51]  Leslie A Kuhn,et al.  Protein unfolding: Rigidity lost , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[52]  D Gilis,et al.  Predicting protein stability changes upon mutation using database-derived potentials: solvent accessibility determines the importance of local versus non-local interactions along the sequence. , 1997, Journal of molecular biology.

[53]  Arlo Z. Randall,et al.  Prediction of protein stability changes for single‐site mutations using support vector machines , 2005, Proteins.

[54]  Michael Börsch,et al.  Proton-powered subunit rotation in single membrane-bound F0F1-ATP synthase , 2004, Nature Structural &Molecular Biology.

[55]  U. Sauer,et al.  Dissection of helix capping in T4 lysozyme by structural and thermodynamic analysis of six amino acid substitutions at Thr 59. , 1992, Biochemistry.

[56]  M. Gerstein,et al.  The morph server: a standardized system for analyzing and visualizing macromolecular motions in a database framework. , 2000, Nucleic acids research.

[57]  U Heinemann,et al.  RNase T1 mutant Glu46Gln binds the inhibitors 2'GMP and 2'AMP at the 3' subsite. , 1992, Journal of molecular biology.

[58]  Ing Ren Tsang,et al.  Cluster size diversity, percolation, and complex systems. , 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[59]  Tom Alber,et al.  Contributions of hydrogen bonds of Thr 157 to the thermodynamic stability of phage T4 lysozyme , 1988, Nature.

[60]  J. Richardson,et al.  The penultimate rotamer library , 2000, Proteins.

[61]  Brian W Matthews,et al.  Contributions of all 20 amino acids at site 96 to the stability and structure of T4 lysozyme , 2009, Protein science : a publication of the Protein Society.

[62]  A. Palmer,et al.  Solution NMR spin relaxation methods for characterizing chemical exchange in high-molecular-weight systems. , 2005, Methods in enzymology.

[63]  B. Matthews,et al.  The response of T4 lysozyme to large‐to‐small substitutions within the core and its relation to the hydrophobic effect , 1998, Protein science : a publication of the Protein Society.

[64]  Maikun Teng,et al.  Purification, N-terminal sequencing, crystallization and preliminary structural determination of atratoxin-b, a short-chain alpha-neurotoxin from Naja atra venom. , 2003, Acta crystallographica. Section D, Biological crystallography.

[65]  N. Go,et al.  Studies on protein folding, unfolding, and fluctuations by computer simulation. II. A. Three‐dimensional lattice model of lysozyme , 1978 .

[66]  Oleg V. Tsodikov,et al.  Novel computer program for fast exact calculation of accessible and molecular surface areas and average surface curvature , 2002, J. Comput. Chem..

[67]  M Levitt,et al.  Stabilization of phage T4 lysozyme by engineered disulfide bonds. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[68]  B. Matthews,et al.  Temperature-sensitive mutations of bacteriophage T4 lysozyme occur at sites with low mobility and low solvent accessibility in the folded protein. , 1987, Biochemistry.

[69]  Yang Li,et al.  KINARI-Web: a server for protein rigidity analysis , 2011, Nucleic Acids Res..

[70]  A Wlodawer,et al.  Crystal structure of two covalent nucleoside derivatives of ribonuclease A. , 1991, Biochemistry.

[71]  S J Wodak,et al.  Contribution of the hydrophobic effect to protein stability: analysis based on simulations of the Ile-96----Ala mutation in barnase. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[72]  Ad Bax,et al.  Flexibility and function in HIV-1 protease , 1995, Nature Structural Biology.

[73]  B. Matthews,et al.  Response of a protein structure to cavity-creating mutations and its relation to the hydrophobic effect. , 1992, Science.

[74]  David I. Stuart,et al.  Insights into the Evolution of a Complex Virus from the Crystal Structure of Vaccinia Virus D13 , 2011, Structure.

[75]  Ileana Streinu,et al.  Periodic rigidity of protein crystal structures , 2012, 2012 IEEE 2nd International Conference on Computational Advances in Bio and medical Sciences (ICCABS).

[76]  J. Louis,et al.  Flap opening and dimer-interface flexibility in the free and inhibitor-bound HIV protease, and their implications for function. , 1999, Structure.

[77]  C. Hutchison,et al.  Mutagenesis at a specific position in a DNA sequence. , 1978, The Journal of biological chemistry.

[78]  Michael Thorpe,et al.  Flexibility in biomolecules , 2005 .

[79]  B. Rupp Biomolecular Crystallography: Principles, Practice, and Application to Structural Biology , 2009 .

[80]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[81]  S. Weiss,et al.  Single-molecule fluorescence studies of protein folding and conformational dynamics. , 2006, Chemical reviews.

[82]  Robert M. Clegg,et al.  Chapter 1 Förster resonance energy transfer—FRET what is it, why do it, and how it's done , 2009 .