Constraint Network Analysis (CNA): A Python Software Package for Efficiently Linking Biomacromolecular Structure, Flexibility, (Thermo-)Stability, and Function

For deriving maximal advantage from information on biomacromolecular flexibility and rigidity, results from rigidity analyses must be linked to biologically relevant characteristics of a structure. Here, we describe the Python-based software package Constraint Network Analysis (CNA) developed for this task. CNA functions as a front- and backend to the graph-based rigidity analysis software FIRST. CNA goes beyond the mere identification of flexible and rigid regions in a biomacromolecule in that it (I) provides a refined modeling of thermal unfolding simulations that also considers the temperature-dependence of hydrophobic tethers, (II) allows performing rigidity analyses on ensembles of network topologies, either generated from structural ensembles or by using the concept of fuzzy noncovalent constraints, and (III) computes a set of global and local indices for quantifying biomacromolecular stability. This leads to more robust results from rigidity analyses and extends the application domain of rigidity analyses in that phase transition points ("melting points") and unfolding nuclei ("structural weak spots") are determined automatically. Furthermore, CNA robustly handles small-molecule ligands in general. Such advancements are important for applying rigidity analysis to data-driven protein engineering and for estimating the influence of ligand molecules on biomacromolecular stability. CNA maintains the efficiency of FIRST such that the analysis of a single protein structure takes a few seconds for systems of several hundred residues on a single core. These features make CNA an interesting tool for linking biomacromolecular structure, flexibility, (thermo-)stability, and function. CNA is available from http://cpclab.uni-duesseldorf.de/software for nonprofit organizations.

[1]  Holger Gohlke,et al.  CNA web server: rigidity theory-based thermal unfolding simulations of proteins for linking structure, (thermo-)stability, and function , 2013, Nucleic Acids Res..

[2]  Holger Gohlke,et al.  Global and local indices for characterizing biomolecular flexibility and rigidity , 2013, J. Comput. Chem..

[3]  Holger Gohlke,et al.  Thermostabilizing mutations preferentially occur at structural weak spots with a high mutation ratio. , 2012, Journal of biotechnology.

[4]  Donald J. Jacobs,et al.  Calculating Ensemble Averaged Descriptions of Protein Rigidity without Sampling , 2012, PloS one.

[5]  Stephen A. Wells,et al.  Inhibition of HIV-1 protease: the rigidity perspective , 2012, Bioinform..

[6]  Holger Gohlke,et al.  Hot Spots and Transient Pockets: Predicting the Determinants of Small-Molecule Binding to a Protein-Protein Interface , 2012, J. Chem. Inf. Model..

[7]  Ileana Streinu,et al.  Using rigidity analysis to probe mutation-induced structural changes in proteins , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW).

[8]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[9]  Holger Gohlke,et al.  A Normal Mode-Based Geometric Simulation Approach for Exploring Biologically Relevant Conformational Transitions in Proteins , 2011, J. Chem. Inf. Model..

[10]  H. Gohlke,et al.  Protein rigidity and thermophilic adaptation , 2011, Proteins.

[11]  Tim A. H. te Beek,et al.  A series of PDB related databases for everyday needs , 2010, Nucleic Acids Res..

[12]  Dennis R Livesay,et al.  Allosteric response is both conserved and variable across three CheY orthologs. , 2010, Biophysical journal.

[13]  Paul D. Adams,et al.  Evidence of Functional Protein Dynamics from X-Ray Crystallographic Ensembles , 2010, PLoS Comput. Biol..

[14]  Holger Gohlke,et al.  HIV-1 TAR RNA Spontaneously Undergoes Relevant Apo-to-Holo Conformational Transitions in Molecular Dynamics and Constrained Geometrical Simulations , 2010, J. Chem. Inf. Model..

[15]  A. Rader,et al.  Thermostability in rubredoxin and its relationship to mechanical rigidity , 2009, Physical biology.

[16]  Charalampos G. Kalodimos,et al.  Dynamic activation of an allosteric regulatory protein , 2009, Nature.

[17]  Holger Gohlke,et al.  Constraint counting on RNA structures: linking flexibility and function. , 2009, Methods.

[18]  J E Jimenez-Roldan,et al.  Comparative analysis of rigidity across protein families , 2009, Physical biology.

[19]  H. Gohlke,et al.  Statics of the ribosomal exit tunnel: implications for cotranslational peptide folding, elongation regulation, and antibiotics binding. , 2009, Journal of Molecular Biology.

[20]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[21]  Hepan Tan,et al.  Identification of putative, stable binding regions through flexibility analysis of HIV‐1 gp120 , 2009, Proteins.

[22]  Naoki Katoh,et al.  A Proof of the Molecular Conjecture , 2009, SCG '09.

[23]  H. Gohlke,et al.  Exploiting the Link between Protein Rigidity and Thermostability for Data‐Driven Protein Engineering , 2008 .

[24]  Holger Gohlke,et al.  Analyzing the flexibility of RNA structures by constraint counting. , 2008, Biophysical journal.

[25]  Chris Morley,et al.  Pybel: a Python wrapper for the OpenBabel cheminformatics toolkit , 2008, Chemistry Central journal.

[26]  Aqeel Ahmed,et al.  Protein Flexibility and Mobility in Structure-Based Drug Design , 2007 .

[27]  Csaba Böde,et al.  Network analysis of protein dynamics , 2007, FEBS letters.

[28]  H. Gohlke,et al.  Multiscale modeling of macromolecular conformational changes combining concepts from rigidity and elastic network theory , 2006, Proteins.

[29]  Dennis R Livesay,et al.  Conserved quantitative stability/flexibility relationships (QSFR) in an orthologous RNase H pair , 2005, Proteins.

[30]  M. Thorpe,et al.  Constrained geometric simulation of diffusive motion in proteins , 2005, Physical biology.

[31]  W. Whiteley Counting out to the flexibility of molecules , 2005, Physical biology.

[32]  M. Thorpe,et al.  Protein flexibility using constraints from molecular dynamics simulations , 2005, Physical Biology.

[33]  Donald J Jacobs,et al.  Elucidating protein thermodynamics from the three-dimensional structure of the native state using network rigidity. , 2005, Biophysical journal.

[34]  Holger Gohlke,et al.  Change in protein flexibility upon complex formation: Analysis of Ras‐Raf using molecular dynamics and a molecular framework approach , 2004, Proteins.

[35]  Judith Klein-Seetharaman,et al.  Identification of core amino acids stabilizing rhodopsin. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Victoria A. Higman,et al.  Uncovering network systems within protein structures. , 2003, Journal of molecular biology.

[37]  G. G. Wood,et al.  Network rigidity at finite temperature: relationships between thermodynamic stability, the nonadditivity of entropy, and cooperativity in molecular systems. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  David M. Beazley,et al.  Automated scientific software scripting with SWIG , 2003, Future Gener. Comput. Syst..

[39]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[40]  A. Rader,et al.  Identifying protein folding cores from the evolution of flexible regions during unfolding. , 2002, Journal of molecular graphics & modelling.

[41]  M Karplus,et al.  Small-world view of the amino acids that play a key role in protein folding. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[42]  E. Shakhnovich,et al.  Topological determinants of protein folding , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Leslie A Kuhn,et al.  Protein unfolding: Rigidity lost , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[44]  B. Halle,et al.  Flexibility and packing in proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[45]  D. Jacobs,et al.  Protein flexibility predictions using graph theory , 2001, Proteins.

[46]  Lorna J. Smith,et al.  Understanding protein folding via free-energy surfaces from theory and experiment. , 2000, Trends in biochemical sciences.

[47]  G. Zaccai,et al.  How soft is a protein? A protein dynamics force constant measured by neutron scattering. , 2000, Science.

[48]  I. Bahar,et al.  Structure‐based analysis of protein dynamics: Comparison of theoretical results for hen lysozyme with X‐ray diffraction and NMR relaxation data , 1999, Proteins.

[49]  J. Schellman,et al.  Temperature, stability, and the hydrophobic interaction. , 1997, Biophysical journal.

[50]  B. Hendrickson,et al.  Regular ArticleAn Algorithm for Two-Dimensional Rigidity Percolation: The Pebble Game , 1997 .

[51]  B. Hendrickson,et al.  An Algorithm for Two-Dimensional Rigidity Percolation , 1997 .

[52]  T. Kiefhaber,et al.  Three-state model for lysozyme folding: triangular folding mechanism with an energetically trapped intermediate. , 1997, Journal of molecular biology.

[53]  S. L. Mayo,et al.  Automated design of the surface positions of protein helices , 1997, Protein science : a publication of the Protein Society.

[54]  C M Dobson,et al.  Fast and slow tracks in lysozyme folding: insight into the role of domains in the folding process. , 1997, Journal of molecular biology.

[55]  Jacobs,et al.  Generic rigidity percolation: The pebble game. , 1995, Physical review letters.

[56]  T. Kiefhaber,et al.  Kinetic traps in lysozyme folding. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[57]  C Redfield,et al.  Structure of hen lysozyme in solution. , 1993, Journal of molecular biology.

[58]  C. Dobson,et al.  Hydrogen exchange in native and denatured states of hen egg‐white lysozyme , 1992, Proteins.

[59]  C. Dobson,et al.  The folding of hen lysozyme involves partially structured intermediates and multiple pathways , 1992, Nature.

[60]  P. Argos,et al.  Side-chain clusters in protein structures and their role in protein folding. , 1991, Journal of molecular biology.

[61]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[62]  M. Karplus,et al.  The hinge-bending mode in lysozyme , 1976, Nature.

[63]  Abbreviations , 1971 .

[64]  William L. Jorgensen,et al.  Journal of Chemical Information and Modeling , 2005, J. Chem. Inf. Model..

[65]  I. Bahar,et al.  Folding core predictions from network models of proteins , 2004 .

[66]  A. Louisa,et al.  コロイド混合体における有効力 空乏引力から集積斥力へ | 文献情報 | J-GLOBAL 科学技術総合リンクセンター , 2002 .

[67]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[68]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[69]  P Argos,et al.  Increasing thermal stability of subtilisin from mutations suggested by strongly interacting side-chain clusters. , 1995, Protein engineering.

[70]  P. Privalov,et al.  Stability of protein structure and hydrophobic interaction. , 1988, Advances in protein chemistry.

[71]  H. Berman,et al.  Electronic Reprint Biological Crystallography the Protein Data Bank Biological Crystallography the Protein Data Bank , 2022 .