Resolving the ambiguity: Making sense of intrinsic disorder when PDB structures disagree

Missing regions in X‐ray crystal structures in the Protein Data Bank (PDB) have played a foundational role in the study of intrinsically disordered protein regions (IDPRs), especially in the development of in silico predictors of intrinsic disorder. However, a missing region is only a weak indication of intrinsic disorder, and this uncertainty is compounded by the presence of ambiguous regions, where more than one structure of the same protein sequence “disagrees” in terms of the presence or absence of missing residues. The question is this: are these ambiguous regions intrinsically disordered, or are they the result of static disorder that arises from experimental conditions, ensembles of structures, or domain wobbling? A novel way of looking at ambiguous regions in terms of the pattern between multiple PDB structures has been demonstrated. It was found that the propensity for intrinsic disorder increases as the level of ambiguity decreases. However, it is also shown that ambiguity is more likely to occur as the protein region is placed within different environmental conditions, and even the most ambiguous regions as a set display compositional bias that suggests flexibility. The results suggested that ambiguity is a natural result for many IDPRs crystallized under different conditions and that static disorder and wobbling domains are relatively rare. Instead, it is more likely that ambiguity arises because many of these regions were conditionally or partially disordered.

[1]  Giuseppe Biagini,et al.  Glia–neuron interactions: Neurosteroids , 2010 .

[2]  A Keith Dunker,et al.  Characterization of molecular recognition features, MoRFs, and their binding partners. , 2007, Journal of proteome research.

[3]  Gert Vriend,et al.  A series of PDB related databases for everyday needs , 2010, Nucleic Acids Res..

[4]  Zsuzsanna Dosztányi,et al.  IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content , 2005, Bioinform..

[5]  U. Jakob,et al.  Conditional disorder in chaperone action. , 2012, Trends in biochemical sciences.

[6]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[7]  M. Vihinen,et al.  Accuracy of protein flexibility predictions , 1994, Proteins.

[8]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[9]  Vladimir N. Uversky,et al.  Conditionally and transiently disordered proteins: awakening cryptic disorder to regulate protein function. , 2014, Chemical reviews.

[10]  A. Dunker,et al.  Predicting intrinsic disorder in proteins: an overview , 2009, Cell Research.

[11]  Domenico Cozzetto,et al.  The contribution of intrinsic disorder prediction to the elucidation of protein function. , 2013, Current opinion in structural biology.

[12]  P. Tompa,et al.  The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins. , 2005, Journal of molecular biology.

[13]  Tom Lenaerts,et al.  The DynaMine webserver: predicting protein dynamics from sequence , 2014, Nucleic Acids Res..

[14]  H. Dyson,et al.  Intrinsically disordered proteins in cellular signalling and regulation , 2014, Nature Reviews Molecular Cell Biology.

[15]  Adam Godzik,et al.  Between order and disorder in protein structures: analysis of "dual personality" fragments in proteins. , 2007, Structure.

[16]  Christopher J. Oldfield,et al.  Classification of Intrinsically Disordered Regions and Proteins , 2014, Chemical reviews.

[17]  W. Bennett,et al.  Structural and functional aspects of domain motions in proteins. , 1984, CRC critical reviews in biochemistry.

[18]  Lukasz Kurgan,et al.  High-throughput prediction of RNA, DNA and protein binding regions mediated by intrinsic disorder , 2015, Nucleic acids research.

[19]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[20]  Obradovic,et al.  Predicting Disordered Regions from Amino Acid Sequence: Common Themes Despite Differing Structural Characterization. , 1998, Genome informatics. Workshop on Genome Informatics.

[21]  S. Vucetic,et al.  Flavors of protein disorder , 2003, Proteins.

[22]  A Keith Dunker,et al.  Molecular recognition features (MoRFs) in three domains of life. , 2016, Molecular bioSystems.

[23]  Zoran Obradovic,et al.  DisProt: the Database of Disordered Proteins , 2006, Nucleic Acids Res..

[24]  Jinku Bao,et al.  An Overview of Predictors for Intrinsically Disordered Proteins over 2010–2014 , 2015, International journal of molecular sciences.

[25]  Zsuzsanna Dosztányi,et al.  ANCHOR: web server for predicting protein binding regions in disordered proteins , 2009, Bioinform..

[26]  A Keith Dunker,et al.  Utilization of protein intrinsic disorder knowledge in structural proteomics. , 2013, Biochimica et biophysica acta.

[27]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[28]  Yaoqi Zhou,et al.  Intrinsically Semi-disordered State and Its Role in Induced Folding and Protein Aggregation , 2013, Cell Biochemistry and Biophysics.

[29]  Zsuzsanna Dosztányi,et al.  Prediction of Protein Binding Regions in Disordered Proteins , 2009, PLoS Comput. Biol..

[30]  Tom Lenaerts,et al.  From protein sequence to dynamics and disorder with DynaMine , 2013, Nature Communications.

[31]  Maria Jesus Martin,et al.  SIFTS: Structure Integration with Function, Taxonomy and Sequences resource , 2012, Nucleic Acids Res..

[32]  M. Y. Lobanov,et al.  Disordered Patterns in Clustered Protein Data Bank and in Eukaryotic and Bacterial Proteomes , 2011, PloS one.

[33]  A. Keith Dunker,et al.  Intrinsic Disorder in the Protein Data Bank , 2007, Journal of biomolecular structure & dynamics.

[34]  Christopher J. Oldfield,et al.  Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling , 2005, Journal of molecular recognition : JMR.

[35]  Silvio C. E. Tosatto,et al.  Comprehensive large-scale assessment of intrinsic protein disorder , 2015, Bioinform..

[36]  Vladimir Vacic,et al.  Composition Profiler: a tool for discovery and visualization of amino acid composition differences , 2007, BMC Bioinformatics.

[37]  Silvio C. E. Tosatto,et al.  ESpritz: accurate and fast prediction of protein disorder , 2012, Bioinform..

[38]  J. Richardson,et al.  A high resolution structure of an inhibitor complex of the extracellular nuclease of Staphylococcus aureus. I. Experimental procedures and chain tracing. , 1971, The Journal of biological chemistry.

[39]  Anna Tramontano,et al.  Assessment of protein disorder region predictions in CASP10 , 2014, Proteins.