论文信息 - Prediction of disordered regions in proteins from position specific score matrices

Prediction of disordered regions in proteins from position specific score matrices

We describe here the results of using a neural network based method (DISOPRED) for predicting disordered regions in 55 proteins in the 5th CASP experiment. A set of 715 highly resolved proteins with regions of disorder was used to train the network. The inputs to the network were derived from sequence profiles generated by PSI‐BLAST. A post‐filter was applied to the output of the network to prevent regions being predicted as disordered in regions of confidently predicted alpha helix or beta sheet structure. The overall two‐state prediction accuracy for the method is very high (90%) but this is highly skewed by the fact that most residues are observed to be ordered. The overall Matthews' correlation coefficient for the submitted predictions is 0.34, which gives a more realistic impression of the overall accuracy of the method, though still indicates significant predictive power. Proteins 2003;53:573–578. © 2003 Wiley‐Liss, Inc.

David T. Jones | David C. Jones | J. J. Ward

[1] B. Matthews. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[2] Kevin Struhl,et al. Folding transition in the DMA-binding domain of GCN4 on specific binding to DNA , 1990, Nature.

[3] B. Rost,et al. Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[4] Thomas L. Madden,et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[5] A.K. Dunker,et al. Identifying disordered regions in proteins from amino acid sequence , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[6] Obradovic,et al. Predicting Protein Disorder for N-, C-, and Internal Regions. , 1999, Genome informatics. Workshop on Genome Informatics.

[7] H. Dyson,et al. Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[8] D T Jones,et al. Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[9] T. N. Bhat,et al. The Protein Data Bank , 2000, Nucleic Acids Res..

[10] Christopher J. Oldfield,et al. Evolutionary Rate Heterogeneity in Proteins with Long Disordered Regions , 2002, Journal of Molecular Evolution.