Library of Disordered Patterns in 3D Protein Structures

Intrinsically disordered regions serve as molecular recognition elements, which play an important role in the control of many cellular processes and signaling pathways. It is useful to be able to predict positions of disordered regions in protein chains. The statistical analysis of disordered residues was done considering 34,464 unique protein chains taken from the PDB database. In this database, 4.95% of residues are disordered (i.e. invisible in X-ray structures). The statistics were obtained separately for the N- and C-termini as well as for the central part of the protein chain. It has been shown that frequencies of occurrence of disordered residues of 20 types at the termini of protein chains differ from the ones in the middle part of the protein chain. Our systematic analysis of disordered regions in PDB revealed 109 disordered patterns of different lengths. Each of them has disordered occurrences in at least five protein chains with identity less than 20%. The vast majority of all occurrences of each disordered pattern are disordered. This allows one to use the library of disordered patterns for predicting the status of a residue of a given protein to be ordered or disordered. We analyzed the occurrence of the selected patterns in three eukaryotic and three bacterial proteomes.

[1]  E Ruoslahti,et al.  RGD and other recognition sequences for integrins. , 1996, Annual review of cell and developmental biology.

[2]  P. Tompa Intrinsically unstructured proteins. , 2002, Trends in biochemical sciences.

[3]  P. Radivojac,et al.  Improved amino acid flexibility parameters , 2003, Protein science : a publication of the Protein Society.

[4]  Michail Yu. Lobanov,et al.  Intrinsic Disorder in Protein Interactions: Insights From a Comprehensive Structural Analysis , 2009, PLoS Comput. Biol..

[5]  J. S. Sodhi,et al.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. , 2004, Journal of molecular biology.

[6]  Michail Yu. Lobanov,et al.  FoldUnfold: web server for the prediction of disordered regions in protein chain , 2006, Bioinform..

[7]  Vladimir N Uversky,et al.  Protein tandem repeats - the more perfect, the less structured. , 2010, The FEBS journal.

[8]  Mireille Régnier,et al.  A Word Counting Graph , 2009 .

[9]  John M. Hancock,et al.  Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins , 2009, Genome Biology.

[10]  P. Brooks,et al.  Role of integrins in angiogenesis. , 1996, European journal of cancer.

[11]  Jianlin Cheng,et al.  Protein disorder prediction at multiple levels of sensitivity and specificity , 2008, BMC Genomics.

[12]  C. Brown,et al.  Intrinsic protein disorder in complete genomes. , 2000, Genome informatics. Workshop on Genome Informatics.

[13]  Roland L. Dunbrack,et al.  Assessment of disorder predictions in CASP6 , 2005, Proteins.

[14]  Torsten Schwede,et al.  Assessment of disorder predictions in CASP7 , 2007, Proteins.

[15]  A. Keith Dunker,et al.  Intrinsic Disorder in the Protein Data Bank , 2007, Journal of biomolecular structure & dynamics.

[16]  J L Sussman,et al.  Structural disorder serves as a weak signal for intracellular protein degradation , 2008, Proteins.

[17]  Li Zhang,et al.  Ligand Binding to Integrins* , 2000, The Journal of Biological Chemistry.

[18]  P. Radivojac,et al.  PROTEINS: Structure, Function, and Bioinformatics Suppl 7:176–182 (2005) Exploiting Heterogeneous Sequence Properties Improves Prediction of Protein Disorder , 2022 .

[19]  A Keith Dunker,et al.  TOP-IDP-scale: a new amino acid scale measuring propensity for intrinsic disorder. , 2008, Protein and peptide letters.

[20]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[21]  Anne Poupon,et al.  Prediction of unfolded segments in a protein sequence based on amino acid composition , 2005, Bioinform..

[22]  Robert D. Finn,et al.  InterPro: the integrative protein signature database , 2008, Nucleic Acids Res..

[23]  Zoran Obradovic,et al.  Length-dependent prediction of protein intrinsic disorder , 2006, BMC Bioinformatics.

[24]  V. Uversky,et al.  Why are “natively unfolded” proteins unstructured under physiologic conditions? , 2000, Proteins.

[25]  Emily Dimmer,et al.  The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology , 2004, Nucleic Acids Res..

[26]  Michail Yu. Lobanov,et al.  Prediction of Amyloidogenic and Disordered Regions in Protein Chains , 2006, PLoS Comput. Biol..

[27]  Robert B. Russell,et al.  GlobPlot: exploring protein sequences for globularity and disorder , 2003, Nucleic Acids Res..

[28]  Zsuzsanna Dosztányi,et al.  Bioinformatical approaches to characterize intrinsically disordered/unstructured proteins , 2010, Briefings Bioinform..

[29]  A. Dunker,et al.  Predicting intrinsic disorder in proteins: an overview , 2009, Cell Research.

[30]  A Keith Dunker,et al.  Intrinsic disorder and protein function. , 2002, Biochemistry.

[31]  P. Romero,et al.  Sequence complexity of disordered protein , 2001, Proteins.

[32]  Zheng Rong Yang,et al.  RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins , 2005, Bioinform..

[33]  H. Dyson,et al.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[34]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[35]  Zsuzsanna Dosztányi,et al.  IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content , 2005, Bioinform..

[36]  T. Gibson,et al.  Protein disorder prediction: implications for structural proteomics. , 2003, Structure.

[37]  Amos Bairoch,et al.  PROSITE, a protein domain database for functional characterization and annotation , 2009, Nucleic Acids Res..

[38]  H. Dyson,et al.  Mechanism of coupled folding and binding of an intrinsically disordered protein , 2007, Nature.

[39]  Oxana V. Galzitskaya,et al.  Trend of Amino Acid Composition of Proteins of Different Taxa , 2006, J. Bioinform. Comput. Biol..

[40]  Zoran Obradovic,et al.  DisProt: the Database of Disordered Proteins , 2006, Nucleic Acids Res..

[41]  Yu-Yen Ou,et al.  Protein disorder prediction by condensed PSSM considering propensity for order or disorder , 2006, BMC Bioinformatics.

[42]  Zoran Obradovic,et al.  Predicting intrinsic disorder from amino acid sequence , 2003, Proteins.

[43]  R. Kriwacki,et al.  Regulation of cell division by intrinsically unstructured proteins: intrinsic flexibility, modularity, and signaling conduits. , 2008, Biochemistry.

[44]  M. Y. Lobanov,et al.  To be folded or to be unfolded? , 2004, Protein science : a publication of the Protein Society.

[45]  Avner Schlessinger,et al.  Natively unstructured regions in proteins identified from contact predictions , 2007, Bioinform..

[46]  H. Dyson,et al.  Coupling of folding and binding for unstructured proteins. , 2002, Current opinion in structural biology.

[47]  John Moult,et al.  Evaluation of disorder predictions in CASP5 , 2003, Proteins.

[48]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. , 2007, Journal of proteome research.

[49]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[50]  P. Tompa,et al.  Malleable machines take shape in eukaryotic transcriptional regulation. , 2008, Nature chemical biology.

[51]  P. Radivojac,et al.  Protein flexibility and intrinsic disorder , 2004, Protein science : a publication of the Protein Society.