SMS 2.0: An Updated Database to Study the Structural Plasticity of Short Peptide Fragments in Non-redundant Proteins

The function of a protein molecule is greatly influenced by its three-dimensional (3D) structure and therefore structure prediction will help identify its biological function. We have updated Sequence, Motif and Structure (SMS), the database of structurally rigid peptide fragments, by combining amino acid sequences and the corresponding 3D atomic coordinates of non-redundant (25%) and redundant (90%) protein chains available in the Protein Data Bank (PDB). SMS 2.0 provides information pertaining to the peptide fragments of length 5-14 residues. The entire dataset is divided into three categories, namely, same sequence motifs having similar, intermediate or dissimilar 3D structures. Further, options are provided to facilitate structural superposition using the program structural alignment of multiple proteins (STAMP) and the popular JAVA plug-in (Jmol) is deployed for visualization. In addition, functionalities are provided to search for the occurrences of the sequence motifs in other structural and sequence databases like PDB, Genome Database (GDB), Protein Information Resource (PIR) and Swiss-Prot. The updated database along with the search engine is available over the World Wide Web through the following URL http://cluster.physics.iisc.ernet.in/sms/.

[1]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[2]  J M Thornton,et al.  Rebuilding flavodoxin from C alpha coordinates: a test study. , 1989, Proteins.

[3]  C. Deane,et al.  A novel exhaustive search algorithm for predicting the conformation of polypeptide segments in proteins , 2000, Proteins.

[4]  S. Wodak,et al.  Modelling the polypeptide backbone with 'spare parts' from known protein structures. , 1989, Protein engineering.

[5]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[6]  Hanah Margalit,et al.  Persistently conserved positions in structurally similar, sequence dissimilar proteins: Roles in preserving protein fold and function , 2002, Protein science : a publication of the Protein Society.

[7]  G. Barton,et al.  Multiple protein sequence alignment from tertiary structure comparison: Assignment of global and residue confidence levels , 1992, Proteins.

[8]  M. Levitt Accurate modeling of protein conformation by automatic segment matching. , 1992, Journal of molecular biology.

[9]  Philip E. Bourne,et al.  CKAAPs DB: a conserved key amino acid positions database , 2001, Nucleic Acids Res..

[10]  K. Sekar,et al.  An algorithm to find all identical internal sequence repeats , 2008 .

[11]  Paul E. Correa,et al.  The building of protein structures form α‐carbon coordinates , 1990 .

[12]  An algorithm to find similar internal sequence repeats , 2009 .

[13]  R. Lerner,et al.  Identical short peptide sequences in unrelated proteins can have different conformations: a testing ground for theories of immune recognition. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[14]  G. Verdine,et al.  Nucleotide-dependent Domain Movement in the ATPase Domain of a Human Type IIA DNA Topoisomerase* , 2005, Journal of Biological Chemistry.

[15]  P E Bourne,et al.  Conserved key amino acid positions (CKAAPs) derived from the analysis of common substructures in proteins , 2001, Proteins.

[16]  C Sander,et al.  On the use of sequence homologies to predict protein structure: identical pentapeptides can have completely different conformations. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Janet M. Thornton,et al.  Rebuilding flavodoxin from Cα coordinates: A test study , 1989 .

[18]  A. Lesk,et al.  How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. , 1980, Journal of molecular biology.

[19]  Philip E. Bourne,et al.  CKAAPs DB: a Conserved Key Amino Acid Positions DataBase , 2002, Nucleic Acids Res..

[20]  A. Sali,et al.  Evolution and physics in comparative protein structure modeling. , 2002, Accounts of chemical research.

[21]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[22]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[23]  M Karplus,et al.  Modeling of globular proteins. A distance-based data search procedure for the construction of insertion/deletion regions and Pro----non-Pro mutations. , 1990, Journal of molecular biology.

[24]  Kanagaraj Sekar,et al.  SMS: Sequence, Motif and Structure - A Database on the Structural Rigidity of Peptide Fragments in Non-Redundant Proteins , 2006, Silico Biol..

[25]  Ronald M Levy,et al.  Have we seen all structures corresponding to short protein fragments in the Protein Data Bank? An update. , 2003, Protein engineering.

[26]  P E Correa,et al.  The building of protein structures from alpha-carbon coordinates. , 1990, Proteins.

[27]  T. A. Jones,et al.  Using known substructures in protein model building and crystallography. , 1986, The EMBO journal.