CRASP: a program for analysis of coordinated substitutions in multiple alignments of protein sequences

Recent results suggest that during evolution certain substitutions at protein sites may occur in a coordinated manner due to interactions between amino acid residues. Information on these coordinated substitutions may be useful for analysis of protein structure and function. CRASP is an Internet-available software tool for the detection and analysis of coordinated substitutions in multiple alignments of protein sequences. The approach is based on estimation of the correlation coefficient between the values of a physicochemical parameter at a pair of positions of sequence alignment. The program enables the user to detect and analyze pairwise relationships between amino acid substitutions at protein sequence positions, estimate the contribution of the coordinated substitutions to the evolutionary invariance or variability in integral protein physicochemical characteristics such as the net charge of protein residues and hydrophobic core volume. The CRASP program is available at http://wwwmgs.bionet.nsc.ru/mgs/programs/crasp/.

[1]  Stefan M. Larson,et al.  Analysis of covariation in an SH3 domain sequence alignment: applications in tertiary contact prediction and the design of compensating hydrophobic core substitutions. , 2000, Journal of molecular biology.

[2]  C. Sander,et al.  Correlated mutations and residue contacts in proteins , 1994, Proteins.

[3]  A. Fersht,et al.  Mutually compensatory mutations during evolution of the tetramerization domain of tumor suppressor p53 lead to impaired hetero-oligomerization. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[4]  G J Barton,et al.  Identification of functional residues and secondary structure from protein multiple sequence alignment. , 1996, Methods in enzymology.

[5]  J. Felsenstein Phylogenies and the Comparative Method , 1985, The American Naturalist.

[6]  C. Chothia,et al.  Volume changes in protein evolution. , 1994, Journal of molecular biology.

[7]  R. Casadio,et al.  A neural network based predictor of residue contacts in proteins. , 1999, Protein engineering.

[8]  S A Benner,et al.  Bona fide prediction of aspects of protein conformation. Assigning interior and surface residues from patterns of variation and conservation in homologous protein sequences. , 1994, Journal of molecular biology.

[9]  Peter H. A. Sneath,et al.  Numerical Taxonomy: The Principles and Practice of Numerical Classification , 1973 .

[10]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[11]  W. Taylor,et al.  Effectiveness of correlation analysis in identifying protein residues undergoing correlated evolution. , 1997, Protein engineering.

[12]  A. Valencia,et al.  Correlated mutations contain information about protein-protein interaction. , 1997, Journal of molecular biology.

[13]  M De Maeyer,et al.  Guiding a docking mode by phage display: selection of correlated mutations at the staphylokinase-plasmin interface. , 1999, Journal of molecular biology.

[14]  Desmond G. Higgins,et al.  Fast and sensitive multiple sequence alignments on a microcomputer , 1989, Comput. Appl. Biosci..

[15]  Palmer Encyclopedia of biostatistics , 1999, BMJ.

[16]  B. Rost,et al.  Effective use of sequence correlation and conservation in fold recognition. , 1999, Journal of molecular biology.

[17]  S. Benner,et al.  Patterns of divergence in homologous proteins as indicators of secondary and tertiary structure: a prediction of the structure of the catalytic domain of protein kinases. , 1991, Advances in enzyme regulation.

[18]  D Altschuh,et al.  Correlation of co-ordinated amino acid changes at the two-domain interface of cysteine proteases with protein stability. , 1992, Journal of molecular biology.

[19]  Dmitry A. Afonnikov Contribution of Coordinated Substitutions to the Constancy of Physicochemical Properties of ATP-Binding Loop in Protein Kinases , 2004 .

[20]  N D Clarke,et al.  Covariation of residues in the homeodomain sequence family , 1995, Protein science : a publication of the Protein Society.

[21]  S. Henikoff,et al.  Position-based sequence weights. , 1994, Journal of molecular biology.

[22]  R. Fisher The Advanced Theory of Statistics , 1943, Nature.

[23]  David Eisenberg,et al.  The helical hydrophobic moment: a measure of the amphiphilicity of a helix , 1982, Nature.

[24]  Nikolay A. Kolchanov,et al.  Detection of conserved physico-chemical characteristics of proteins by analyzing clusters of positions with co-ordinated substitutions , 2001, Bioinform..

[25]  C. Sander,et al.  Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations? , 1994, Protein engineering.

[26]  P. Bork,et al.  Prediction of potential GPI-modification sites in proprotein sequences. , 1999, Journal of molecular biology.

[27]  Hiroyuki Ogata,et al.  AAindex: Amino Acid Index Database , 1999, Nucleic Acids Res..

[28]  G. Stormo,et al.  Correlated mutations in protein sequences: Phylogenetic and structural effects , 1997 .

[29]  Sylvia B. Nagl,et al.  Can Correlated Mutations in Protein Domain Families Be Used for Protein Design? , 2001, Briefings Bioinform..

[30]  Martin Vingron,et al.  A fast and sensitive multiple sequence alignment algorithm , 1989, Comput. Appl. Biosci..

[31]  M. Kanehisa,et al.  Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. , 1996, Protein engineering.

[32]  S. Sunyaev,et al.  Dobzhansky–Muller incompatibilities in protein evolution , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[33]  L Pritchard,et al.  Do proteins learn to evolve? The Hopfield network as a basis for the understanding of protein evolution. , 2000, Journal of theoretical biology.

[34]  F. Cohen,et al.  An evolutionary trace method defines binding surfaces common to protein families. , 1996, Journal of molecular biology.

[35]  M. Kimura,et al.  Recent development of the neutral theory viewed from the Wrightian tradition of theoretical population genetics. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[36]  W. Atchley,et al.  Correlations among amino acid sites in bHLH protein domains: an information theoretic analysis. , 2000, Molecular biology and evolution.

[37]  J. Skolnick,et al.  Fold assembly of small proteins using monte carlo simulations driven by restraints derived from multiple sequence alignments. , 1998, Journal of molecular biology.