Collecting and harvesting biological data: the GPCRDB and NucleaRDB information systems

The amount of genomic and proteomic data that is entered each day into databases and the experimental literature is outstripping the ability of experimental scientists to keep pace. While generic databases derived from automated curation efforts are useful, most biological scientists tend to focus on a class or family of molecules and their biological impact. Consequently, there is a need for molecular class-specific or other specialized databases. Such databases collect and organize data around a single topic or class of molecules. If curated well, such systems are extremely useful as they allow experimental scientists to obtain a large portion of the available data most relevant to their needs from a single source. We are involved in the development of two such databases with substantial pharmacological relevance. These are the GPCRDB and NucleaRDB information systems, which collect and disseminate data related to G protein-coupled receptors and intra-nuclear hormone receptors, respectively. The GPCRDB was a pilot project aimed at building a generic molecular class-specific database capable of dealing with highly heterogeneous data. A first version of the GPCRDB project has been completed and it is routinely used by thousands of scientists. The NucleaRDB was started recently as an application of the concept for the generalization of this technology. The GPCRDB is available via the WWW at http://www.gpcr.org/7tm/ and the NucleaRDB at http://www.receptors.org/NR/.

[1]  Fabien Campagne,et al.  Visualisation and integration of G protein-coupled receptor related information help the modelling: Description and applications of the Viseur program , 1999, J. Comput. Aided Mol. Des..

[2]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[3]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[4]  A. IJzerman,et al.  TinyGRAP database: a bioinformatics tool to mine G-protein-coupled receptor mutant data. , 1999, Trends in pharmacological sciences.

[5]  K. Umesono,et al.  The nuclear receptor superfamily: The second decade , 1995, Cell.

[6]  Gert Vriend,et al.  A common motif in G-protein-coupled seven transmembrane helix receptors , 1993, J. Comput. Aided Mol. Des..

[7]  H. Gronemeyer,et al.  Transcription factors 3: nuclear receptors. , 1995, Protein profile.

[8]  J. Baldwin,et al.  An alpha-carbon template for the transmembrane helices in the rhodopsin family of G-protein-coupled receptors. , 1997, Journal of molecular biology.

[9]  V. Laudet,et al.  Evolution of the nuclear receptor superfamily: early diversification from an ancestral orphan receptor. , 1997, Journal of molecular endocrinology.

[10]  Gert Vriend,et al.  GPCRDB information system for G protein-coupled receptors , 2003, Nucleic Acids Res..

[11]  Sandor Suhai Genomics and Proteomics: Functional and Computational Aspects , 2000 .

[12]  Manuel C. Peitsch Membrane protein models , 1997 .

[13]  C. Sander,et al.  Positioning hydrogen atoms by optimizing hydrogen‐bond networks in protein structures , 1996, Proteins.

[14]  M. L. Jones,et al.  PDBsum: a Web-based database of summaries and analyses of all PDB structures. , 1997, Trends in biochemical sciences.

[15]  K. Umesono,et al.  A Unified Nomenclature System for the Nuclear Receptor Superfamily , 1999, Cell.

[16]  Rodrigo Lopez,et al.  The EMBL Nucleotide Sequence Database , 1999, Nucleic Acids Res..

[17]  David E. Gloriam,et al.  GPCRdb: an information system for G protein-coupled receptors , 2015, Nucleic Acids Res..

[18]  F. Cohen,et al.  An evolutionary trace method defines binding surfaces common to protein families. , 1996, Journal of molecular biology.

[19]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[20]  K. Palczewski,et al.  Crystal Structure of Rhodopsin: A G‐Protein‐Coupled Receptor , 2000, Science.

[21]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[22]  R A Sayle,et al.  RASMOL: biomolecular graphics for all. , 1995, Trends in biochemical sciences.

[23]  David Pearce,et al.  The Nuclear Receptor Resource: a growing family , 1998, Nucleic Acids Res..

[24]  D Walther,et al.  WebMol--a Java-based PDB viewer. , 1997, Trends in biochemical sciences.

[25]  L Pinsky,et al.  The androgen receptor gene mutations database. , 1994, Nucleic acids research.

[26]  Mustapha Mokrane,et al.  G Protein-Coupled Receptors, or The Power of Data , 2002 .

[27]  P. Argos,et al.  SRS: information retrieval system for molecular biology data banks. , 1996, Methods in enzymology.