Predicting disulfide bond connectivity in proteins by correlated mutations analysis

MOTIVATION Prediction of disulfide bond connectivity facilitates structural and functional annotation of proteins. Previous studies suggest that cysteines of a disulfide bond mutate in a correlated manner. RESULTS We developed a method that analyzes correlated mutation patterns in multiple sequence alignments in order to predict disulfide bond connectivity. Proteins with known experimental structures and varying numbers of disulfide bonds, and that spanned various evolutionary distances, were aligned. We observed frequent variation of disulfide bond connectivity within members of the same protein families, and it was also observed that in 99% of the cases, cysteine pairs forming non-conserved disulfide bonds mutated in concert. Our data support the notion that substitution of a cysteine in a disulfide bond prompts the substitution of its cysteine partner and that oxidized cysteines appear in pairs. The method we developed predicts disulfide bond connectivity patterns with accuracies of 73, 69 and 61% for proteins with two, three and four disulfide bonds, respectively.

[1]  András Fiser,et al.  MMM: a sequence-to-structure alignment protocol , 2006, Bioinform..

[2]  Cheng-Yan Kao,et al.  Improving disulfide connectivity prediction with sequential distance between oxidized cysteines , 2005, Bioinform..

[3]  P Tufféry,et al.  Predicting the disulfide bonding state of cysteines using protein descriptors , 2002, Proteins.

[4]  V. Buchner,et al.  Paired natural cysteine mutation mapping: Aid to constraining models of protein tertiary structure , 1995, Protein science : a publication of the Protein Society.

[5]  Cheng-Yan Kao,et al.  Cysteine separations profiles on protein sequences infer disulfide connectivity , 2005, Bioinform..

[6]  Jon Beckwith,et al.  Protein disulfide bond formation in prokaryotes. , 2003, Annual review of biochemistry.

[7]  Piero Fariselli,et al.  Prediction of disulfide connectivity in proteins , 2001, Bioinform..

[8]  Pierre Baldi,et al.  Large‐scale prediction of disulphide bridges using kernel methods, two‐dimensional recursive neural networks, and weighted graph matching , 2005, Proteins.

[9]  Jenn-Kang Hwang,et al.  Prediction of disulfide connectivity from protein sequences , 2005, Proteins.

[10]  P. Lyu,et al.  Relationship between protein structures and disulfide‐bonding patterns , 2003, Proteins.

[11]  H. Scheraga,et al.  Disulfide bonds and protein folding. , 2000, Biochemistry.

[12]  J. Thornton Disulphide bridges in globular proteins. , 1981, Journal of molecular biology.

[13]  András Fiser,et al.  Predicting redox state of cysteines in proteins. , 2002, Methods in enzymology.

[14]  Stefan M. Larson,et al.  Analysis of covariation in an SH3 domain sequence alignment: applications in tertiary contact prediction and the design of compensating hydrophobic core substitutions. , 2000, Journal of molecular biology.

[15]  C. Sander,et al.  Correlated mutations and residue contacts in proteins , 1994, Proteins.

[16]  C. Sander,et al.  Correlated Mutations and Residue Contacts , 1994 .

[17]  Cheng-Yan Kao,et al.  Disulfide connectivity prediction with 70% accuracy using two‐level models , 2006, Proteins.

[18]  Jenn-Kang Hwang,et al.  Predicting disulfide connectivity patterns , 2007, Proteins.

[19]  Harold N. Gabow,et al.  An Efficient Implementation of Edmonds' Algorithm for Maximum Matching on Graphs , 1976, JACM.

[20]  Paolo Frasconi,et al.  Disulfide connectivity prediction using recursive neural networks and evolutionary information , 2004, Bioinform..

[21]  K. Burrage,et al.  Protein contact prediction using patterns of correlation , 2004, Proteins.

[22]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[23]  Harold A. Scheraga,et al.  Statistical mechanics of noncovalent bonds in polyamino acids. I–V , 1965 .

[24]  P. Hogg,et al.  Disulfide bonds as switches for protein function. , 2003, Trends in biochemical sciences.

[25]  M. Sternberg,et al.  Analysis and classification of disulphide connectivity in proteins. The entropic effect of cross-linkage. , 1994, Journal of molecular biology.

[26]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[27]  Peter Clote,et al.  Disulfide connectivity prediction using secondary structure information and diresidue frequencies , 2005, Bioinform..

[28]  Richard W. Aldrich,et al.  A perturbation-based method for calculating explicit likelihood of evolutionary co-variance in multiple sequence alignments , 2004, Bioinform..

[29]  David Eisenberg,et al.  Genomic evidence that the intracellular proteins of archaeal microbes contain disulfide bonds , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  N. Grishin,et al.  Structural classification of small, disulfide-rich protein domains. , 2006, Journal of molecular biology.

[31]  Jenn-Kang Hwang,et al.  Prediction of the bonding states of cysteines Using the support vector machines based on multiple feature vectors and cysteine state sequences , 2004, Proteins.

[32]  E. Neher How frequent are correlated changes in families of protein sequences? , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Steven C Almo,et al.  T cell immunoglobulin mucin-3 crystal structure reveals a galectin-9-independent ligand-binding surface. , 2007, Immunity.

[34]  Steven M. Muskal,et al.  Prediction of the disulfide-bonding state of cysteine in proteins. , 1990, Protein engineering.

[35]  Juswinder Singh,et al.  A classification of disulfide patterns and its relationship to protein structure and function , 2004, Protein science : a publication of the Protein Society.

[36]  H. Scheraga,et al.  Statistical mechanics of noncovalent bonds in polyamino acids. VIII. Covalent loops in proteins , 1965 .

[37]  A. Fiser,et al.  Different sequence environments of cysteines and half cystines in proteins Application to predict disulfide forming residues , 1992, FEBS letters.

[38]  C. Sander,et al.  Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations? , 1994, Protein engineering.

[39]  M. Martí-Renom,et al.  Protein similarities beyond disulphide bridge topology. , 1998, Journal of molecular biology.

[40]  A. Konagurthu,et al.  MUSTANG: A multiple structural alignment algorithm , 2006, Proteins.

[41]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[42]  András Fiser,et al.  Predicting the oxidation state of cysteines by multiple sequence alignment , 2000, Bioinform..