SynDB: a Synapse protein DataBase based on synapse ontology

A synapse is the junction across which a nerve impulse passes from an axon terminal to a neuron, muscle cell or gland cell. The functions and building molecules of the synapse are essential to almost all neurobiological processes. To describe synaptic structures and functions, we have developed Synapse Ontology (SynO), a hierarchical representation that includes 177 terms with hundreds of synonyms and branches up to eight levels deep. associated 125 additional protein keywords and 109 InterPro domains with these SynO terms. Using a combination of automated keyword searches, domain searches and manual curation, we collected 14 000 non-redundant synapse-related proteins, including 3000 in human. We extensively annotated the proteins with information about sequence, structure, function, expression, pathways, interactions and disease associations and with hyperlinks to external databases. The data are stored and presented in the Synapse protein DataBase (SynDB, ). SynDB can be interactively browsed by SynO, Gene Ontology (GO), domain families, species, chromosomal locations or Tribe-MCL clusters. It can also be searched by text (including Boolean operators) or by sequence similarity. SynDB is the most comprehensive database to date for synaptic proteins.

[1]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[2]  T. Insel,et al.  Limits to growth: why neuroscience needs large-scale science , 2004, Nature Neuroscience.

[3]  Rolf Kötter,et al.  Neuroscience databases : a practical guide , 2003 .

[4]  M. Nei,et al.  Evolution of olfactory receptor genes in the human genome , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Jamie Goode,et al.  Immunoinformatics: Bioinformatic Strategies for Better Understanding of Immune Function , 2003 .

[6]  Tao Cai,et al.  Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary , 2005, Bioinform..

[7]  W. Coleman,et al.  Cancer Bioinformatics: Addressing the Challenges of Integrated Postgenomic Cancer Research , 2004, Cancer investigation.

[8]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[9]  Michael Hamacher,et al.  “Does understanding the brain need proteomics and does understanding proteomics need brains?” – Second HUPO HBPP Workshop hosted in Paris , 2004, Proteomics.

[10]  S. Grant,et al.  Proteomics in postgenomic neuroscience: the end of the beginning , 2004, Nature Neuroscience.

[11]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[12]  E. Birney,et al.  The International Protein Index: An integrated database for proteomics experiments , 2004, Proteomics.

[13]  Hans-Georg Rammensee,et al.  Immunoinformatics: bioinformatic strategies for better understanding of immune function. Introduction. , 2003, Novartis Foundation symposium.

[14]  Miki Ohira,et al.  Comprehensive genomics linking between neural development and cancer: neuroblastoma as a model. , 2004, Cancer letters.

[15]  S. Grant,et al.  Systems biology in neuroscience: bridging genes to cognition , 2003, Current Opinion in Neurobiology.

[16]  Kenneth H Buetow,et al.  The NCI Center for Bioinformatics (NCICB): Building a Foundation For In Silico Biomedical Research , 2004, Cancer investigation.

[17]  Holger Husi,et al.  Construction of a Protein-Protein Interaction Database (PPID) for Synaptic Biology , 2003 .

[18]  Marie-Paule Lefranc,et al.  IMGT-ONTOLOGY and IMGT databases, tools and Web resources for immunogenetics and immunoinformatics. , 2004, Molecular immunology.

[19]  B. Hille,et al.  Ionic channels of excitable membranes , 2001 .

[20]  Jianmin Wu,et al.  KOBAS server: a web-based platform for automated annotation and pathway identification , 2006, Nucleic Acids Res..

[21]  Cathy H. Wu,et al.  InterPro, progress and status in 2005 , 2004, Nucleic Acids Res..

[22]  P. Nelson,et al.  Activity-dependent synapse modulation and the pathogenesis of Alzheimer disease. , 2005, Current Alzheimer research.

[23]  B. Trask,et al.  Divergent V1R repertoires in five species: Amplification in rodents, decimation in primates, and a surprisingly small repertoire in dogs. , 2005, Genome research.

[24]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[25]  T. Salakoski,et al.  Selection of a representative set of structures from brookhaven protein data bank , 1992, Proteins.

[26]  Douglas E. Raines,et al.  Genetics and Genomics of Neurobehavioral Disorders , 2003, Contemporary Clinical Neuroscience.

[27]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[28]  Marc A. Martí-Renom,et al.  MODBASE: a database of annotated comparative protein structure models and associated resources , 2005, Nucleic Acids Res..

[29]  A. Rector,et al.  Relations in biomedical ontologies , 2005, Genome Biology.

[30]  M. Alexander,et al.  Principles of Neural Science , 1981 .

[31]  S. Schultz Principles of Neural Science, 4th ed. , 2001 .

[32]  Allan R. Jones,et al.  Neurogenomics: at the intersection of neurobiology and genome sciences , 2004, Nature Neuroscience.

[33]  C E Lipscomb,et al.  Medical Subject Headings (MeSH). , 2000, Bulletin of the Medical Library Association.