Clustering of Protein Conformations Using Parallelized Dimensionality Reduction

—Ascertaining the conformational landscape of a macromolecule, like protein is indispensable to understanding its characteristics and functions. In this work, an amassment of these techniques is presented, that would be an aid in sampling of these conformations better and faster. The datasets that represent these conformational dynamics of proteins are complex and high dimensional. Therefore, there arises a need for dimensionality reduction methods that best conserve the variance and further the analysis of the data. We present a parallelized version of a well-known dimensionality reduction method, Isomap. Isomap has been shown to produce better results than linear dimensionality reduction in approximating the complex landscape of protein folding. However, the algorithm is compute-intensive for large proteins or a large number of samples, used to model a path that a protein undergoes. We present an algorithm, parallelized using OpenMP, with a speed-up of approximately twice. The results are in agreement with the ones obtained using sequential Isomap.

[1]  Zbigniew J. Czech,et al.  Introduction to Parallel Computing , 2017 .

[2]  Nurit Haspel,et al.  Detecting intermediate protein conformations using algebraic topology , 2016, BMC Bioinformatics.

[3]  Rafael Najmanovich,et al.  ENCoM server: exploring protein conformational space and the effect of mutations on protein function and stability , 2015, Nucleic Acids Res..

[4]  T. Siméon,et al.  Modeling protein conformational transitions by a combination of coarse-grained normal mode analysis and robotics-inspired methods , 2013, BMC Structural Biology.

[5]  Zhiyuan Yan,et al.  An Implementation of Parallel Floyd-Warshall Algorithm Based on Hybrid MPI and OpenMP , 2012 .

[6]  Amarda Shehu,et al.  Guiding the Search for Native-like Protein Conformations with an Ab-initio Tree-based Exploration , 2010, Int. J. Robotics Res..

[7]  L. Kavraki,et al.  Tracing conformational changes in proteins , 2009, 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop.

[8]  Guang Song,et al.  Protein elastic network models and the ranges of cooperativity , 2009, Proceedings of the National Academy of Sciences.

[9]  Ora Schueler-Furman,et al.  Rapid Sampling of Molecular Motions with Prior Information Constraints , 2009, PLoS Comput. Biol..

[10]  Dahlia R. Weiss,et al.  Can morphing methods predict intermediate structures? , 2009, Journal of molecular biology.

[11]  Ameet Talwalkar,et al.  Large-scale manifold learning , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Michael Levitt,et al.  Combining efficient conformational sampling with a deformable elastic network model facilitates structure refinement at low resolution. , 2007, Structure.

[13]  Lydia E Kavraki,et al.  Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction , 2006, Proc. Natl. Acad. Sci. USA.

[14]  Holger Gohlke,et al.  The Amber biomolecular simulation programs , 2005, J. Comput. Chem..

[15]  Wenjun Zheng,et al.  Identification of dynamical correlations within the myosin motor domain by the normal mode analysis of an elastic network model. , 2005, Journal of molecular biology.

[16]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[17]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[18]  P. Gács,et al.  Algorithms , 1992 .

[19]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[20]  Amir Vajdi,et al.  A new DP algorithm for comparing gene expression data using geometric similarity , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[21]  W. Marsden I and J , 2012 .

[22]  Eric O. Postma,et al.  Dimensionality Reduction: A Comparative Review , 2008 .

[23]  Joshua B. Tenenbaum,et al.  Global Versus Local Methods in Nonlinear Dimensionality Reduction , 2002, NIPS.