Prediction of disulfide connectivity in proteins

MOTIVATION A major problem in protein structure prediction is the correct location of disulfide bridges in cysteine-rich proteins. In protein-folding prediction, the location of disulfide bridges can strongly reduce the search in the conformational space. Therefore the correct prediction of the disulfide connectivity starting from the protein residue sequence may also help in predicting its 3D structure. RESULTS In this paper we equate the problem of predicting the disulfide connectivity in proteins to a problem of finding the graph matching with the maximum weight. The graph vertices are the residues of cysteine-forming disulfide bridges, and the weight edges are contact potentials. In order to solve this problem we develop and test different residue contact potentials. The best performing one, based on the Edmonds-Gabow algorithm and Monte-Carlo simulated annealing reaches an accuracy significantly higher than that obtained with a general mean force contact potential. Significantly, in the case of proteins with four disulfide bonds in the structure, the accuracy is 17 times higher than that of a random predictor. The method presented here can be used to locate putative disulfide bridges in protein-folding. AVAILABILITY The program is available upon request from the authors. CONTACT Casadio@alma.unibo.it; Piero@biocomp.unibo.it.

[1]  E. Shakhnovich,et al.  What can disulfide bonds tell us about protein energetics, function and folding: simulations and bioninformatics analysis. , 2000, Journal of molecular biology.

[2]  Steven M. Muskal,et al.  Prediction of the disulfide-bonding state of cysteine in proteins. , 1990, Protein engineering.

[3]  H. Scheraga,et al.  Disulfide bonds and protein folding. , 2000, Biochemistry.

[4]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.

[5]  William J. Cook,et al.  Combinatorial optimization , 1997 .

[6]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[7]  M. Sternberg,et al.  Analysis and classification of disulphide connectivity in proteins. The entropic effect of cross-linkage. , 1994, Journal of molecular biology.

[8]  Kenneth Steiglitz,et al.  Combinatorial Optimization: Algorithms and Complexity , 1981 .

[9]  E. Huang,et al.  Ab initio fold prediction of small helical proteins using distance geometry and knowledge-based scoring functions. , 1999, Journal of molecular biology.

[10]  A. Fiser,et al.  Different sequence environments of cysteines and half cystines in proteins Application to predict disulfide forming residues , 1992, FEBS letters.

[11]  Harold N Gabow,et al.  An Efficient Implementation of Edmonds' Algorithm for Maximum Weight Matching on Graphs ; CU-CS-075-75 , 1975 .

[12]  András Fiser,et al.  Predicting the oxidation state of cysteines by multiple sequence alignment , 2000, Bioinform..

[13]  Jack Edmonds,et al.  Maximum matching and a polyhedron with 0,1-vertices , 1965 .

[14]  P Fariselli,et al.  Role of evolutionary information in predicting the disulfide‐bonding state of cysteine in proteins , 1999, Proteins.

[15]  T. Creighton Proteins: Structures and Molecular Properties , 1986 .

[16]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[17]  L A Mirny,et al.  How to derive a protein folding potential? A new approach to an old problem. , 1996, Journal of molecular biology.

[18]  R. Durbin,et al.  Biological sequence analysis: Background on probability , 1998 .