From mutations to mechanisms and dysfunction via computation and mining of protein energy landscapes

BackgroundThe protein energy landscape underscores the inherent nature of proteins as dynamic molecules interconverting between structures with varying energies. Reconstructing a protein’s energy landscape holds the key to characterizing a protein’s equilibrium conformational dynamics and its relationship to function. Many pathogenic mutations in protein sequences alter the equilibrium dynamics that regulates molecular interactions and thus protein function. In principle, reconstructing energy landscapes of a protein’s healthy and diseased variants is a central step to understanding how mutations impact dynamics, biological mechanisms, and function.ResultsRecent computational advances are yielding detailed, sample-based representations of protein energy landscapes. In this paper, we propose and describe two novel methods that leverage computed, sample-based representations of landscapes to reconstruct them and extract from them informative local structures that reveal the underlying organization of an energy landscape. Such structures constitute landscape features that, as we demonstrate here, can be utilized to detect alterations of landscapes upon mutation.ConclusionsThe proposed methods detect altered protein energy landscape features in response to sequence mutations. By doing so, the methods allow formulating hypotheses on the impact of mutations on specific biological activities of a protein. This work demonstrates that the availability of energy landscapes of healthy and diseased variants of a protein opens up new avenues to harness the quantitative information embedded in landscapes to summarize mechanisms via which mutations alter protein dynamics to percolate to dysfunction.

[1]  D. Heidrich,et al.  Searching for saddle points of potential energy surfaces by following a reduced gradient , 1998 .

[2]  R. Nussinov,et al.  Folding funnels and binding mechanisms. , 1999, Protein engineering.

[3]  R. Nussinov,et al.  Folding funnels, binding funnels, and protein function , 1999, Protein science : a publication of the Protein Society.

[4]  Michael Hirsch,et al.  Improved RGF method to find saddle points , 2002, J. Comput. Chem..

[5]  Herbert Edelsbrunner,et al.  Topological Persistence and Simplification , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[6]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[7]  Afra Zomorodian,et al.  Computing Persistent Homology , 2005, Discret. Comput. Geom..

[8]  J. Onuchic,et al.  Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations , 2006, Proceedings of the National Academy of Sciences.

[9]  Alberto Rodríguez Casal,et al.  Set estimation under convexity type assumptions , 2007 .

[10]  Beatriz Pateiro-López,et al.  Set estimation under convexity type restrictions , 2008 .

[11]  D. Boehr,et al.  How Do Proteins Interact? , 2008, Science.

[12]  Daniel Russel,et al.  The structural dynamics of macromolecular processes. , 2009, Current opinion in cell biology.

[13]  R. Nussinov,et al.  The role of dynamic conformational ensembles in biomolecular recognition. , 2009, Nature chemical biology.

[14]  Mohammad Reza Ahmadian,et al.  Germline KRAS mutations cause aberrant biochemical and physical properties leading to developmental disorders , 2011, Human mutation.

[15]  Herbert Edelsbrunner,et al.  Alpha, Betti and the Megaparsec Universe: On the Topology of the Cosmic Web , 2013, Trans. Comput. Sci..

[16]  Deniz Erdogmus,et al.  Locally Defined Principal Curves and Surfaces , 2011, J. Mach. Learn. Res..

[17]  Carla Mattos,et al.  A comprehensive survey of Ras mutations in cancer. , 2012, Cancer research.

[18]  Carla Mattos,et al.  The allosteric switch and conformational states in Ras GTPase affected by small molecules. , 2013, The Enzymes.

[19]  Mohammad Reza Ahmadian,et al.  Diverging gain-of-function mechanisms of two novel KRAS mutations associated with Noonan and cardio-facio-cutaneous syndromes. , 2013, Human molecular genetics.

[20]  Ruth Nussinov,et al.  A second molecular biology revolution? The energy landscapes of biomolecular function. , 2014, Physical chemistry chemical physics : PCCP.

[21]  Christina Kiel,et al.  Structure‐energy‐based predictions and network modelling of RASopathy and cancer missense mutations , 2014, Molecular systems biology.

[22]  Wanli Qiao,et al.  Submitted to the Annals of Statistics THEORETICAL ANALYSIS OF NONPARAMETRIC FILAMENT ESTIMATION By , 2015 .

[23]  Amarda Shehu,et al.  A Data-Driven Evolutionary Algorithm for Mapping Multibasin Protein Energy Landscapes , 2015, J. Comput. Biol..

[24]  Dorian Mazauric,et al.  Conformational ensembles and sampled energy landscapes: Analysis and comparison , 2015, J. Comput. Chem..

[25]  Ruth Nussinov,et al.  Mapping the Conformation Space of Wildtype and Mutant H-Ras with a Memetic, Cellular, and Multiscale Evolutionary Algorithm , 2015, PLoS Comput. Biol..

[26]  Erion Plaku,et al.  Computing transition paths in multiple-basin proteins with a probabilistic roadmap algorithm guided by structure data , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[27]  Amarda Shehu,et al.  Computing energy landscape maps and structural excursions of proteins , 2016, BMC Genomics.

[28]  Michael Palmgren,et al.  Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana , 2016, BMC Genomics.

[29]  Erion Plaku,et al.  Sample-based Models of Protein Structural Transitions , 2016, BCB.

[30]  Ruth Nussinov,et al.  Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics , 2016, PLoS Comput. Biol..

[31]  De-Shuang Huang,et al.  Guest Editorial for Special Section on the 10th International Conference on Intelligent Computing (ICIC) , 2016, TCBB.

[32]  Erion Plaku,et al.  Structure-Guided Protein Transition Modeling with a Probabilistic Roadmap Algorithm , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[33]  Erion Plaku,et al.  Sample-Based Models of Protein Energy Landscapes and Slow Structural Rearrangements , 2018, J. Comput. Biol..