On the relationship between sequence and structure similarities in proteomics

MOTIVATION The underlying assumption of many sequence-based comparative studies in proteomics is that different aspects of protein structure and therefore functionality may be linked to particular sequence motifs. This holds true if sequence similarity is sufficiently high, but in general the relationship between protein sequence and structure appears complex and is not well understood. RESULTS Statistical analysis of multiple and pairwise structural alignments of protein structures within SCOP folds is performed. The results indicate that multiple conservation of residue identity is not common and that relationship between sequence and structure may be explained by a model based on the assumption that protein structure is tolerant to residue substitutions preserving hydropathic profile of the sequence. This model also explains the origin and specific value of the sequence similarity threshold, noticed in many previous studies, below which structural resemblance is not statistically expected.

[1]  Kim Henrick,et al.  Multiple Alignment of Protein Structures in Three Dimensions , 2005, CompLife.

[2]  T. Sixma,et al.  Crystal Structure of Acetylcholine-binding Protein from Bulinus truncatus Reveals the Conserved Structural Scaffold and Sites of Variation in Nicotinic Acetylcholine Receptors* , 2005, Journal of Biological Chemistry.

[3]  Rachel Kolodny,et al.  Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures. , 2005, Journal of molecular biology.

[4]  K Henrick,et al.  Electronic Reprint Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions , 2022 .

[5]  Rolf Apweiler,et al.  UniProt archive , 2004, Bioinform..

[6]  Akira R. Kinjo,et al.  Eigenvalue analysis of amino acid substitution matrices reveals a sharp transition of the mode of sequence conservation in proteins , 2004, Bioinform..

[7]  Philip E. Bourne,et al.  CE-MC: a multiple protein structure alignment server , 2004, Nucleic Acids Res..

[8]  Xiu-fen Lei,et al.  Measurement of DNA mismatch repair activity in live cells. , 2004, Nucleic acids research.

[9]  H. Wolfson,et al.  Multiple structural alignment by secondary structures: Algorithm and applications , 2003, Protein science : a publication of the Protein Society.

[10]  James E. Bray,et al.  The CATH database: an extended protein family resource for structural and functional genomics , 2003, Nucleic Acids Res..

[11]  T. Sixma,et al.  Crystal structure of an ACh-binding protein reveals the ligand-binding domain of nicotinic receptors , 2001, Nature.

[12]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[13]  B. Rost Twilight zone of protein sequence alignments. , 1999, Protein engineering.

[14]  Tim J. P. Hubbard,et al.  SCOP: a structural classification of proteins database , 1998, Nucleic Acids Res..

[15]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[16]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[17]  H. Wolfson,et al.  Amino acid pair interchanges at spatially conserved locations. , 1996, Journal of molecular biology.

[18]  C. Pace,et al.  Forces contributing to the conformational stability of proteins , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[19]  C. Chothia,et al.  Understanding protein structure: using scop for fold interpretation. , 1996, Methods in enzymology.

[20]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[21]  R. Sauer,et al.  Are buried salt bridges important for protein stability and conformational specificity? , 1995, Nature Structural Biology.

[22]  S. Betz Disulfide bonds and the stability of globular proteins , 1993, Protein science : a publication of the Protein Society.

[23]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[24]  C. Chothia One thousand families for the molecular biologist , 1992, Nature.

[25]  C. Sander,et al.  GTPase domains of ras p21 oncogene protein and elongation factor Tu: analysis of three-dimensional structures, sequence families, and functional sites. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[26]  L Serrano,et al.  Aromatic-aromatic interactions and protein stability. Investigation by double-mutant cycles. , 1991, Journal of molecular biology.

[27]  T L Blundell,et al.  Comparison of solvent-inaccessible cores of homologous proteins: definitions useful for protein modelling. , 1987, Protein engineering.

[28]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[29]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.