From structure to function: the convergence of structure based models and co-evolutionary information.

Understanding protein folding and function is one of the most important problems in biological research. Energy landscape theory and the folding funnel concept have provided a framework to investigate the mechanisms associated to these processes. Since protein energy landscapes are in most cases minimally frustrated, structure based models (SMBs) have successfully determined the geometrical features associated with folding and functional transitions. However, structural information is limited, particularly with respect to different functional configurations. This is a major limitation for SBMs. Alternatively, statistical methods to study amino acid co-evolution provide information on residue-residue interactions useful for the study of structure and function. Here, we show how the combination of these two methods gives rise to a novel way to investigate the mechanisms associated with folding and function. We use this methodology to explore the mechanistic aspects of protein translocation in the integral membrane protease FtsH. Dual basin-SBM simulations using the open and closed state of this hexameric motor reveals a functionally important paddling motion in the catalytic cycle. We also find that Direct Coupling Analysis (DCA) predicts physical contacts between AAA and peptidase domains of the motor, which are crucial for the open to close transition. Our combined method, which uses structural information from the open state experimental structure and co-evolutionary couplings, suggests that this methodology can be used to explore the functional landscape of complex biological macromolecules previously inaccessible to methods dependent on experimental structural information. This efficient way to sample the conformational space of large systems creates a theoretical/computational framework capable of better characterizing the functional landscape in large biomolecular assemblies.

[1]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[2]  William R. Taylor,et al.  Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences , 2011, PloS one.

[3]  Thomas E. Creighton,et al.  Folding and binding editorial overviewCurrent Opinion in Structural Biology 1992, 2:15 , 1992 .

[4]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[5]  J. Onuchic,et al.  Protein folding funnels: a kinetic approach to the sequence-structure relationship. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[6]  P. Wolynes,et al.  The energy landscapes and motions of proteins. , 1991, Science.

[7]  Changbong Hyeon,et al.  The Origin of Minus-end Directionality and Mechanochemistry of Ncd Motors , 2012, PLoS Comput. Biol..

[8]  A. Lesk,et al.  Correlation of co-ordinated amino acid substitutions with function in viruses related to tobacco mosaic virus. , 1987, Journal of molecular biology.

[9]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[10]  A. Valencia,et al.  Emerging methods in protein co-evolution , 2013, Nature Reviews Genetics.

[11]  David Baker,et al.  Role of the Biomolecular Energy Gap in Protein Design, Structure, and Evolution , 2012, Cell.

[12]  Thomas A. Hopf,et al.  Protein 3D Structure Computed from Evolutionary Sequence Variation , 2011, PloS one.

[13]  D. Baker,et al.  Assessing the utility of coevolution-based residue–residue contact predictions in a sequence- and structure-rich era , 2013, Proceedings of the National Academy of Sciences.

[14]  D. Baker,et al.  A surprising simplicity to protein folding , 2000, Nature.

[15]  Dill,et al.  Folding and binding. , 1996, Current opinion in structural biology.

[16]  D. Thirumalai,et al.  Protein folding kinetics: timescales, pathways and energy landscapes in terms of sequence-dependent properties. , 1996, Folding & design.

[17]  K. Dill,et al.  The Protein-Folding Problem, 50 Years On , 2012, Science.

[18]  Osamu Miyashita,et al.  Conformational transitions of adenylate kinase: switching by cracking. , 2007, Journal of molecular biology.

[19]  Massimiliano Pontil,et al.  PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments , 2012, Bioinform..

[20]  Changbong Hyeon,et al.  Mechanical control of the directional stepping dynamics of the kinesin motor , 2007, Proceedings of the National Academy of Sciences.

[21]  T. Ogura,et al.  Conserved Pore Residues in the AAA Protease FtsH Are Important for Proteolysis and Its Coupling to ATP Hydrolysis* , 2003, Journal of Biological Chemistry.

[22]  J. Onuchic,et al.  Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations , 2006, Proceedings of the National Academy of Sciences.

[23]  Markus Meister,et al.  The molecular architecture of the metalloprotease FtsH. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Jeffrey K. Noel,et al.  SMOG@ctbp: simplified deployment of structure-based models in GROMACS , 2010, Nucleic Acids Res..

[25]  J. Onuchic,et al.  Theory of Protein Folding This Review Comes from a Themed Issue on Folding and Binding Edited Basic Concepts Perfect Funnel Landscapes and Common Features of Folding Mechanisms , 2022 .

[26]  P. Wolynes,et al.  Spin glasses and the statistical mechanics of protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[27]  C. Sander,et al.  Correlated mutations and residue contacts in proteins , 1994, Proteins.

[28]  Wolfgang Wenzel,et al.  Mirror images as naturally competing conformations in protein folding. , 2012, The journal of physical chemistry. B.

[29]  Peter G Wolynes,et al.  Protein topology determines binding mechanism. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[30]  E. Aurell,et al.  Improved contact prediction in proteins: using pseudolikelihoods to infer Potts models. , 2012, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  Shoji Takada,et al.  Paddling mechanism for the substrate translocation by AAA+ motor revealed by multiscale molecular simulations , 2009, Proceedings of the National Academy of Sciences.

[32]  Changbong Hyeon,et al.  Internal strain regulates the nucleotide binding site of the kinesin leading head , 2007, Proceedings of the National Academy of Sciences.

[33]  U. Baumann,et al.  Structure and function of the bacterial AAA protease FtsH. , 2012, Biochimica et biophysica acta.

[34]  J. Onuchic,et al.  Funnels, pathways, and the energy landscape of protein folding: A synthesis , 1994, Proteins.

[35]  R. Zwanzig,et al.  Levinthal's paradox. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Cecilia Clementi,et al.  Quantifying the roughness on the free energy landscape: entropic bottlenecks and protein folding rates. , 2004, Journal of the American Chemical Society.

[37]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[38]  Peter G Wolynes,et al.  A survey of flexible protein binding mechanisms and their transition states using native topology based energy landscapes. , 2005, Journal of molecular biology.

[39]  José N Onuchic,et al.  Slipknotting upon native-like loop formation in a trefoil knot protein , 2010, Proceedings of the National Academy of Sciences.

[40]  J. Onuchic,et al.  An all‐atom structure‐based potential for proteins: Bridging minimal models with all‐atom empirical forcefields , 2009, Proteins.

[41]  Christoph Bieniossek,et al.  The crystal structure of apo-FtsH reveals domain movements necessary for substrate unfolding and translocation , 2009, Proceedings of the National Academy of Sciences.

[42]  Thomas A. Hopf,et al.  Protein structure prediction from sequence variation , 2012, Nature Biotechnology.

[43]  José N Onuchic,et al.  Conformational Transitions in Adenylate Kinase , 2008, Journal of Biological Chemistry.

[44]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[45]  A. Fersht,et al.  Characterizing transition states in protein folding: an essential step in the puzzle. , 1995, Current opinion in structural biology.

[46]  F. Morcos,et al.  Genomics-aided structure prediction , 2012, Proceedings of the National Academy of Sciences.

[47]  J. Onuchic,et al.  Topological and energetic factors: what determines the structural details of the transition state ensemble and "en-route" intermediates for protein folding? An investigation for small globular proteins. , 2000, Journal of molecular biology.

[48]  Terence Hwa,et al.  Coevolutionary signals across protein lineages help capture multiple protein conformations , 2013, Proceedings of the National Academy of Sciences.

[49]  Peter G Wolynes,et al.  Symmetry and frustration in protein energy landscapes: a near degeneracy resolves the Rop dimer-folding mystery. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Nicholas P. Schafer,et al.  AWSEM-MD: protein structure prediction using coarse-grained physical potentials and bioinformatically based local structure biasing. , 2012, The journal of physical chemistry. B.

[51]  D Thirumalai,et al.  Exploring the energy landscape in proteins. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[52]  J. Onuchic,et al.  DIFFUSIVE DYNAMICS OF THE REACTION COORDINATE FOR PROTEIN FOLDING FUNNELS , 1996, cond-mat/9601091.

[53]  J. Onuchic,et al.  Ligand-induced global transitions in the catalytic domain of protein kinase A , 2009, Proceedings of the National Academy of Sciences.

[54]  C. Sander,et al.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families , 2011, Proceedings of the National Academy of Sciences.

[55]  C. Levinthal Are there pathways for protein folding , 1968 .