On the relation between residue flexibility and local solvent accessibility in proteins

We investigate the relationship between the flexibility, expressed with B‐factor, and the relative solvent accessibility (RSA) in the context of local, with respect to the sequence, neighborhood and related concepts such as residue depth. We observe that the flexibility of a given residue is strongly influenced by the solvent accessibility of the adjacent neighbors. The mean normalized B‐factor of the exposed residues with two buried neighbors is smaller than that of the buried residues with two exposed neighbors. Inclusion of RSA of the neighboring residues (local RSA) significantly increases correlation with the B‐factor. Correlation between the local RSA and B‐factor is shown to be stronger than the correlation that considers local distance‐ or volume‐based residue depth. We also found that the correlation coefficients between B‐factor and RSA for the 20 amino acids, called flexibility‐exposure correlation index, are strongly correlated with the stability scale that characterizes the average contributions of each amino acid to the folding stability. Our results reveal that the predicted RSA could be used to distinguish between the disordered and ordered residues and that the inclusion of local predicted RSA values helps providing a better contrast between these two types of residues. Prediction models developed based on local actual RSA and local predicted RSA show similar or better results in the context of B‐factor and disorder predictions when compared with several existing approaches. We validate our models using three case studies, which show that this work provides useful clues for deciphering the structure–flexibility–function relation. Proteins 2009. © 2009 Wiley‐Liss, Inc.

[1]  F M Poulsen,et al.  A nuclear magnetic resonance study of the hydrogen-exchange behaviour of lysozyme in crystals and solution. , 1991, Journal of molecular biology.

[2]  Jaroslav Koca,et al.  Functional flexibility of human cyclin‐dependent kinase‐2 and its evolutionary conservation , 2007, Protein science : a publication of the Protein Society.

[3]  G. Phillips,et al.  Dynamics of proteins in crystals: comparison of experiment with simple models. , 2002, Biophysical journal.

[4]  P Argos,et al.  Accessibility to internal cavities and ligand binding sites monitored by protein crystallographic thermal factors , 1998, Proteins.

[5]  Yaoqi Zhou,et al.  Real‐SPINE: An integrated system of neural networks for real‐value prediction of protein structural properties , 2007, Proteins.

[6]  Song Liu,et al.  Fold recognition by concurrent use of solvent accessibility and residue depth , 2007, Proteins.

[7]  Shaomeng Wang,et al.  M-score: a knowledge-based potential scoring function accounting for protein atom mobility. , 2006, Journal of medicinal chemistry.

[8]  M Karplus,et al.  The internal dynamics of globular proteins. , 1981, CRC critical reviews in biochemistry.

[9]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[10]  Lorna J. Smith,et al.  Long-Range Interactions Within a Nonnative Protein , 2002, Science.

[11]  L. Kay,et al.  Intrinsic dynamics of an enzyme underlies catalysis , 2005, Nature.

[12]  R. Stolarski,et al.  Stacking efficiency and flexibility analysis of aromatic amino acids in cap‐binding proteins , 2008, Proteins.

[13]  Zheng Rong Yang,et al.  RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins , 2005, Bioinform..

[14]  P. Radivojac,et al.  Protein flexibility and intrinsic disorder , 2004, Protein science : a publication of the Protein Society.

[15]  P. Radivojac,et al.  Improved amino acid flexibility parameters , 2003, Protein science : a publication of the Protein Society.

[16]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[17]  Hiroyuki Ogata,et al.  AAindex: Amino Acid Index Database , 1999, Nucleic Acids Res..

[18]  Gail J. Bartlett,et al.  Using a neural network and spatial clustering to predict the location of active sites in enzymes. , 2003, Journal of molecular biology.

[19]  P Argos,et al.  Correlation between side chain mobility and conformation in protein structures. , 1997, Protein engineering.

[20]  M. L. Connolly Solvent-accessible surfaces of proteins and nucleic acids. , 1983, Science.

[21]  M. Gromiha,et al.  Real value prediction of solvent accessibility from amino acid sequence , 2003, Proteins.

[22]  Zheng Yuan,et al.  Flexibility analysis of enzyme active sites by crystallographic temperature factors. , 2003, Protein engineering.

[23]  Lukasz Kurgan,et al.  Sequence-Based Methods for Real Value Predictions of Protein Structure , 2008 .

[24]  Andrea Bernini,et al.  Three-dimensional computation of atom depth in complex molecular structures , 2005, Bioinform..

[25]  Christopher J. Oldfield,et al.  Intrinsic disorder and functional proteomics. , 2007, Biophysical journal.

[26]  Tamotsu Noguchi,et al.  PDB-REPRDB: a database of representative protein chains from the Protein Data Bank (PDB) in 2003 , 2003, Nucleic Acids Res..

[27]  Pierre Baldi,et al.  A machine learning information retrieval approach to protein fold recognition. , 2006, Bioinformatics.

[28]  A. Atilgan,et al.  Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. , 1997, Folding & design.

[29]  Avner Schlessinger,et al.  PROFbval: predict flexible and rigid residues in proteins , 2006, Bioinform..

[30]  M. Sanner,et al.  Reduced surface: an efficient way to compute molecular surfaces. , 1996, Biopolymers.

[31]  S. Parthasarathy,et al.  Analysis of temperature factor distribution in high‐resolution protein structures , 1997, Protein science : a publication of the Protein Society.

[32]  Christine A. Orengo,et al.  Inferring Function Using Patterns of Native Disorder in Proteins , 2007, PLoS Comput. Biol..

[33]  A. Atilgan,et al.  Small-world communication of residues and significance for protein dynamics. , 2003, Biophysical journal.

[34]  Hongyi Zhou,et al.  Quantifying the effect of burial of amino acid residues on protein stability , 2003, Proteins.

[35]  B. Rost,et al.  Protein flexibility and rigidity predicted from sequence , 2005, Proteins.

[36]  Shao-Wei Huang,et al.  Deriving protein dynamical properties from weighted protein contact number , 2008, Proteins.

[37]  Guang Song,et al.  How well can we understand large-scale protein motions using normal modes of elastic network models? , 2007, Biophysical journal.

[38]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[39]  I. Bahar,et al.  Coupling between catalytic site and collective dynamics: a requirement for mechanochemical activity of enzymes. , 2005, Structure.

[40]  Zheng Yuan,et al.  Prediction of protein accessible surface areas by support vector regression , 2004, Proteins.

[41]  Yaoqi Zhou,et al.  Protein flexibility prediction by an all‐atom mean‐field statistical theory , 2005, Protein science : a publication of the Protein Society.

[42]  Peter M Fischer,et al.  Differential binding of inhibitors to active and inactive CDK2 provides insights for drug design. , 2006, Chemistry & biology.

[43]  S. Teague Implications of protein flexibility for drug discovery , 2003, Nature Reviews Drug Discovery.

[44]  I. Bahar,et al.  Structural changes involved in protein binding correlate with intrinsic motions of proteins in the unbound state. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Andrew F Neuwald,et al.  Evolutionary constraints associated with functional specificity of the CMGC protein kinases MAPK, CDK, GSK, SRPK, DYRK, and CK2α , 2004, Protein science : a publication of the Protein Society.

[46]  Zoran Obradovic,et al.  The protein trinity—linking function and disorder , 2001, Nature Biotechnology.

[47]  F. Veronese,et al.  Site-specific modification and PEGylation of pharmaceutical proteins mediated by transglutaminase. , 2008, Advanced drug delivery reviews.

[48]  Zheng Yuan,et al.  Quantifying the relationship of protein burying depth and sequence , 2007, Proteins.

[49]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[50]  Debasis Dash,et al.  Conformational flexibility may explain multiple cellular roles of PEST motifs , 2006, Proteins.

[51]  Vladimir N Uversky,et al.  Local Flexibility in Molecular Function Paradigm* , 2006, Molecular & Cellular Proteomics.

[52]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[53]  M. Gromiha,et al.  Role of structural and sequence information in the prediction of protein stability changes: comparison between buried and partially buried mutations. , 1999, Protein engineering.

[54]  Volker A. Eyrich,et al.  EVA: Large‐scale analysis of secondary structure prediction , 2001, Proteins.

[55]  R. Raz,et al.  ProMate: a structure based prediction program to identify the location of protein-protein binding sites. , 2004, Journal of molecular biology.

[56]  Jung-Ying Wang,et al.  SVM‐Cabins: Prediction of solvent accessibility using accumulation cutoff set and support vector machine , 2007, Proteins.

[57]  Oliviero Carugo,et al.  Atom depth as a descriptor of the protein interior. , 2003, Biophysical journal.

[58]  Roland L. Dunbrack,et al.  Assessment of disorder predictions in CASP6 , 2005, Proteins.

[59]  M Vihinen,et al.  Relationship of protein flexibility to thermostability. , 1987, Protein engineering.

[60]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[61]  Jenn-Kang Hwang,et al.  On the relationship between the protein structure and protein dynamics , 2008, Proteins.

[62]  Adam Godzik,et al.  Between order and disorder in protein structures: analysis of "dual personality" fragments in proteins. , 2007, Structure.

[63]  R. Varadarajan,et al.  Residue depth: a novel parameter for the analysis of protein structure and stability. , 1999, Structure.

[64]  Modesto Orozco,et al.  A consensus view of protein dynamics , 2007, Proceedings of the National Academy of Sciences.

[65]  András Kocsor,et al.  ROC analysis: applications to the classification of biological sequences and 3D structures , 2008, Briefings Bioinform..

[66]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[67]  G. Dodson,et al.  Protein flexibility: its role in structure and mechanism revealed by molecular simulations , 2006, Cellular and Molecular Life Sciences CMLS.

[68]  Ruhong Zhou,et al.  Destruction of long-range interactions by a single mutation in lysozyme , 2007, Proceedings of the National Academy of Sciences.

[69]  Lukasz A. Kurgan,et al.  Sequence based residue depth prediction using evolutionary information and predicted secondary structure , 2008, BMC Bioinformatics.

[70]  M. Vihinen,et al.  Accuracy of protein flexibility predictions , 1994, Proteins.

[71]  B. Rost,et al.  Conservation and prediction of solvent accessibility in protein families , 1994, Proteins.

[72]  B. Halle,et al.  Flexibility and packing in proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[73]  A. D. McLachlan,et al.  Solvation energy in protein folding and binding , 1986, Nature.

[74]  W A Hendrickson,et al.  Influence of solvent accessibility and intermolecular contacts on atomic mobilities in hemerythrins. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[75]  P. Karplus,et al.  Prediction of chain flexibility in proteins , 1985, Naturwissenschaften.

[76]  J M Berger,et al.  Divalent metal cofactor binding in the kinetic folding trajectory of Escherichia coli ribonuclease HI , 2000, Protein science : a publication of the Protein Society.

[77]  Torsten Schwede,et al.  Assessment of disorder predictions in CASP7 , 2007, Proteins.

[78]  Zheng Yuan,et al.  Prediction of protein B‐factor profiles , 2005, Proteins.

[79]  Sonia Longhi,et al.  A practical overview of protein disorder prediction methods , 2006, Proteins.