seeMotif: exploring and visualizing sequence motifs in 3D structures

Sequence motifs are important in the study of molecular biology. Motif discovery tools efficiently deliver many function related signatures of proteins and largely facilitate sequence annotation. As increasing numbers of motifs are detected experimentally or predicted computationally, characterizing the functional roles of motifs and identifying the potential synergetic relationships between them are important next steps. A good way to investigate novel motifs is to utilize the abundant 3D structures that have also been accumulated at an astounding rate in recent years. This article reports the development of the web service seeMotif, which provides users with an interactive interface for visualizing sequence motifs on protein structures from the Protein Data Bank (PDB). Researchers can quickly see the locations and conformation of multiple motifs among a number of related structures simultaneously. Considering the fact that PDB sequences are usually shorter than those in sequence databases and/or may have missing residues, seeMotif has two complementary approaches for selecting structures and mapping motifs to protein chains in structures. As more and more structures belonging to previously uncharacterized protein families become available, combining sequence and structure information gives good opportunities to facilitate understanding of protein functions in large-scale genome projects. Available at: http://seemotif.csie.ntu.edu.tw,http://seemotif.ee.ncku.edu.tw or http://seemotif.csbb.ntu.edu.tw.

[1]  Richard J. Edwards,et al.  CompariMotif: quick and easy comparisons of sequence motifs , 2008, Bioinform..

[2]  T. Gibson,et al.  Systematic Discovery of New Recognition Peptides Mediating Protein Interaction Networks , 2005, PLoS biology.

[3]  Lin Lu,et al.  3MATRIX and 3MOTIF: a protein structure visualization system for conserved sequence motifs , 2003, Nucleic Acids Res..

[4]  Amos Bairoch,et al.  The PROSITE database , 2005, Nucleic Acids Res..

[5]  Richard J. Edwards,et al.  SLiMFinder: A Probabilistic Method for Identifying Over-Represented, Convergently Evolved, Short Linear Motifs in Proteins , 2007, PloS one.

[6]  Baw-Jhiune Liu,et al.  MAGIIC-PRO: detecting functional signatures by efficient discovery of long patterns in protein sequences , 2006, Nucleic Acids Res..

[7]  Rodrigo Lopez,et al.  A tree-based conservation scoring method for short linear motifs in multiple alignments of protein sequences , 2008, BMC Bioinformatics.

[8]  Adel Golovin,et al.  MSDmotif: exploring protein sites and motifs , 2008, BMC Bioinformatics.

[9]  The UniProt Consortium,et al.  The Universal Protein Resource (UniProt) 2009 , 2008, Nucleic Acids Res..

[10]  Richard J. Edwards,et al.  The SLiMDisc server: short, linear motif discovery in proteins , 2007, Nucleic Acids Res..

[11]  Terri K. Attwood,et al.  Motif3D: relating protein sequence motifs to 3D structure , 2003, Nucleic Acids Res..

[12]  Michael B. Yaffe,et al.  Scansite 2.0: proteome-wide prediction of cell signaling interactions using short sequence motifs , 2003, Nucleic Acids Res..

[13]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[14]  Thierry Langer,et al.  The Protein Data Bank (PDB), its related services and software tools as key components for in silico guided drug discovery. , 2008, Journal of medicinal chemistry.

[15]  Richard J. Edwards,et al.  Masking residues using context-specific evolutionary conservation significantly improves short linear motif discovery , 2009, Bioinform..

[16]  Charles Elkan,et al.  The Value of Prior Knowledge in Discovering Motifs with MEME , 1995, ISMB.

[17]  Terri K. Attwood,et al.  PRINTS and its automatic supplement, prePRINTS , 2003, Nucleic Acids Res..

[18]  Shmuel Pietrokovski,et al.  Increased coverage of protein families with the Blocks Database servers , 2000, Nucleic Acids Res..

[19]  Martin C. Frith,et al.  Discovering Sequence Motifs with Arbitrary Insertions and Deletions , 2008, PLoS Comput. Biol..

[20]  Richard J. Edwards,et al.  SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent , 2006, Nucleic acids research.

[21]  Inge Jonassen,et al.  Efficient discovery of conserved patterns using a pattern graph , 1997, Comput. Appl. Biosci..

[22]  Aris Floratos,et al.  Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm [published erratum appears in Bioinformatics 1998;14(2): 229] , 1998, Bioinform..

[23]  R. Russell,et al.  Linear motifs: Evolutionary interaction switches , 2005, FEBS letters.

[24]  Nir Ben-Tal,et al.  QuasiMotiFinder: protein annotation by searching for evolutionarily conserved motif-like patterns , 2005, Nucleic Acids Res..

[25]  Leszek Rychlewski,et al.  ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins , 2003, Nucleic Acids Res..

[26]  G. K. Sandve,et al.  A survey of motif discovery methods in an integrated framework , 2006, Biology Direct.

[27]  D. Higgins,et al.  Finding flexible patterns in unaligned protein sequences , 1995, Protein science : a publication of the Protein Society.

[28]  Darby Tien-Hao Chang,et al.  E1DS: catalytic site prediction based on 1D signatures of concurrent conservation , 2008, Nucleic Acids Res..

[29]  Richard J. Edwards,et al.  Bioinformatic discovery of novel bioactive peptides. , 2007, Nature chemical biology.

[30]  Baw-Jhiune Liu,et al.  Identification of hot regions in protein-protein interactions by sequential pattern mining , 2007, BMC Bioinformatics.

[31]  Alejandro Panjkovich,et al.  3did Update: domain–domain and peptide-mediated interactions of known 3D structure , 2008, Nucleic Acids Res..

[32]  M. Kimmel,et al.  Conflict of interest statement. None declared. , 2010 .

[33]  Heinrich Sticht,et al.  A computational strategy for the prediction of functional linear peptide motifs in proteins , 2007, Bioinform..

[34]  Sanguthevar Rajasekaran,et al.  Minimotif miner 2nd release: a database and web system for motif search , 2008, Nucleic Acids Res..

[35]  Patrick Aloy,et al.  Contextual Specificity in Peptide-Mediated Protein Interactions , 2008, PloS one.