Between order and disorder in protein structures: analysis of "dual personality" fragments in proteins.

In their natural environment, three-dimensional structures of proteins undergo significant fluctuations and are often partially or completely disordered. This phenomenon recently became the focus of much attention, as many proteins, especially from higher organisms, were shown to contain large intrinsically disordered regions. Such disordered regions may become ordered only under very specific circumstances, if at all, and can be recognized by specific amino acid composition and sequence signatures. Here, we suggest that the balance between order and disorder is much more subtle in that many regions are very close to the order/disorder boundary. Specifically, analysis of redundant sets of experimental models of protein structures, where emphasis is put on comparison of structures of identical proteins solved in different conditions and functional states, shows hundreds of fragments captured in two states: ordered and disordered. We show that such fragments, which we call here "dual personality" (DP) fragments, have distinctive features that differentiate them from both regularly folded and intrinsically disordered fragments. We hypothesize, and show on several examples, that such fragments are often targets of regulation, either by allostery or posttranslational modifications.

[1]  P. Tompa Intrinsically unstructured proteins. , 2002, Trends in biochemical sciences.

[2]  István Simon,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm035 Structural bioinformatics Local structural disorder imparts plasticity on linear motifs , 2022 .

[3]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[4]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[5]  Chun-Houh Chen GENERALIZED ASSOCIATION PLOTS: INFORMATION VISUALIZATION VIA ITERATIVELY GENERATED CORRELATION MATRICES , 2002 .

[6]  R. Huber,et al.  Flexibility and rigidity, requirements for the function of proteins and protein pigment complexes. Eleventh Keilin memorial lecture. , 1987, Biochemical Society transactions.

[7]  Sonia Longhi,et al.  A practical overview of protein disorder prediction methods , 2006, Proteins.

[8]  Ruth Nussinov,et al.  Analysis of ordered and disordered protein complexes reveals structural features discriminating between stable and unstable monomers. , 2004, Journal of molecular biology.

[9]  M. Vihinen,et al.  Accuracy of protein flexibility predictions , 1994, Proteins.

[10]  V. Uversky Intrinsically Disordered Proteins , 2000 .

[11]  D. Baker,et al.  Prediction of local structure in proteins using a library of sequence-structure motifs. , 1998, Journal of molecular biology.

[12]  A. Keith Dunker,et al.  Intrinsic Disorder in the Protein Data Bank , 2007, Journal of biomolecular structure & dynamics.

[13]  Thomas L. Madden,et al.  BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. , 1999, FEMS microbiology letters.

[14]  Adam Godzik,et al.  Clustering of highly homologous sequences to reduce the size of large protein databases , 2001, Bioinform..

[15]  A Keith Dunker,et al.  Characterization of molecular recognition features, MoRFs, and their binding partners. , 2007, Journal of proteome research.

[16]  J. Porath,et al.  Metal chelate affinity chromatography, a new approach to protein fractionation , 1975, Nature.

[17]  Amos Bairoch,et al.  ScanProsite: a reference implementation of a PROSITE scanning tool. , 2002, Applied bioinformatics.

[18]  M. Conrad Bootstrapping on the adaptive landscape. , 1979, Bio Systems.

[19]  Roland L. Dunbrack,et al.  Assessment of disorder predictions in CASP6 , 2005, Proteins.

[20]  Zoran Obradovic,et al.  DisProt: a database of protein disorder , 2005, Bioinform..

[21]  Zoran Obradovic,et al.  DisProt: the Database of Disordered Proteins , 2006, Nucleic Acids Res..

[22]  P Argos,et al.  Analysis of sequence-similar pentapeptides in unrelated protein tertiary structures. Strategies for protein folding and a guide for site-directed mutagenesis. , 1987, Journal of molecular biology.

[23]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[24]  Jun-tao Guo,et al.  Analysis of chameleon sequences and their implications in biological processes , 2007, Proteins.