Reconstructing and Decomposing Protein Energy Landscapes to Organize Structure Spaces and Reveal Biologically-active States

The concept of energy landscape has become a useful construction in protein modeling due to its ability to relate structures and structural dynamics to function. While great progress is being made in probing energy landscapes, it remains unclear how to reconstruct the landscape from computed structures. Recently, our laboratories have made headway in this direction via concepts from topological and statistical analysis of spatial data. In this paper, we propose a novel approach to reconstruct the underlying energy landscape populated by computed/sampled energy-evaluated structures of a molecule and decompose it into basins of attraction. We demonstrate that such a construction not only allows deep analysis of the efficacy of a structure computation algorithm and the energy function it employs in the first place, but, more importantly, makes important steps toward addressing the open decoy selection problem in template-free protein structure prediction.

[1]  Valerio Pascucci,et al.  Computing Morse-Smale Complexes with Accurate Geometry , 2012, IEEE Transactions on Visualization and Computer Graphics.

[2]  Beatriz Pateiro López Set estimation under convexity type restrictions , 2011 .

[3]  B. McConkey,et al.  Discrimination of native protein structures using atom–atom contact scoring , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Ruth Nussinov,et al.  A second molecular biology revolution? The energy landscapes of biomolecular function. , 2014, Physical chemistry chemical physics : PCCP.

[5]  Ruth Nussinov,et al.  Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics , 2016, PLoS Comput. Biol..

[6]  Amarda Shehu,et al.  A Principled Comparative Analysis of Dimensionality Reduction Techniques on Protein Structure Decoy Data , 2016 .

[7]  Ruqian Lu,et al.  Sorting protein decoys by machine-learning-to-rank , 2016, Scientific Reports.

[8]  Anna Tramontano,et al.  Assessment of the assessment: Evaluation of the model quality estimates in CASP10 , 2014, Proteins.

[9]  Ruth Nussinov,et al.  Mapping the Conformation Space of Wildtype and Mutant H-Ras with a Memetic, Cellular, and Multiscale Evolutionary Algorithm , 2015, PLoS Comput. Biol..

[10]  Bernd Hamann,et al.  A Practical Approach to Morse-Smale Complex Computation: Scalability and Generality , 2008, IEEE Transactions on Visualization and Computer Graphics.

[11]  R. Nussinov,et al.  The role of dynamic conformational ensembles in biomolecular recognition. , 2009, Nature chemical biology.

[12]  Trilce Estrada,et al.  Automatic selection of near-native protein-ligand conformations using a hierarchical clustering and volunteer computing , 2010, BCB '10.

[13]  Chen Keasar,et al.  Purely Structural Protein Scoring Functions Using Support Vector Machine and Ensemble Learning , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[14]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[15]  Frédéric Cazals,et al.  The structural bioinformatics library: modeling in biomolecular science and beyond , 2017, Bioinform..

[16]  J. Skolnick,et al.  GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. , 2011, Biophysical journal.

[17]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[18]  Nasrin Akhter,et al.  From Extraction of Local Structures of Protein Energy Landscapes to Improved Decoy Selection in Template-Free Protein Structure Prediction , 2018, Molecules.

[19]  Anna Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP) — round x , 2014, Proteins.