LIFEdb: a database for functional genomics experiments integrating information from external sources, and serving as a sample tracking system

We have implemented LIFEdb (http://www.dkfz.de/LIFEdb) to link information regarding novel human full-length cDNAs generated and sequenced by the German cDNA Consortium with functional information on the encoded proteins produced in functional genomics and proteomics approaches. The database also serves as a sample-tracking system to manage the process from cDNA to experimental read-out and data interpretation. A web interface enables the scientific community to explore and visualize features of the annotated cDNAs and ORFs combined with experimental results, and thus helps to unravel new features of proteins with as yet unknown functions.

[1]  Paul Schimmel,et al.  M411_3c 107..110 , 2001 .

[2]  A. Ortiz,et al.  Microarrays of cells expressing de ® ned cDNAs , 2001 .

[3]  T. Nagase,et al.  Prediction of the coding sequences of unidentified human genes. XX. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro. , 2001, DNA research : an international journal for rapid publication of reports on genes and genomes.

[4]  W. Bickmore,et al.  Large-scale identification of mammalian proteins localized to nuclear sub-compartments. , 2001, Human molecular genetics.

[5]  Amos Bairoch,et al.  PROSITE: A Documented Database Using Patterns and Profiles as Motif Descriptors , 2002, Briefings Bioinform..

[6]  Graham Dellaire,et al.  The Nuclear Protein Database (NPD): sub-nuclear localisation and functional annotation of the nuclear proteome , 2003, Nucleic Acids Res..

[7]  Gerhard G. Thallinger,et al.  YPL.db: the Yeast Protein Localization database , 2002, Nucleic Acids Res..

[8]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[9]  K. Nakai,et al.  PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization. , 1999, Trends in biochemical sciences.

[10]  A. Poustka,et al.  Systematic subcellular localization of novel proteins identified by large‐scale cDNA sequencing , 2000, EMBO reports.

[11]  O Ritter,et al.  X-HUSAR, an X-based graphical interface for the analysis of genomic sequences. , 1995, Computer methods and programs in biomedicine.

[12]  R. Pepperkok,et al.  Systematic Subcellular Localization of Novel Proteins , 2006 .

[13]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[14]  Ronald W. Davis,et al.  Erratum: Initial sequencing and analysis of the human genome: International Human Genome Sequencing Consortium (Nature (2001) 409 (860-921)) , 2001 .

[15]  Stefan Wiemann,et al.  Being in the right location at the right time , 2001, Genome Biology.

[16]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[17]  Peter Ernst,et al.  A task framework for the web interface W2H , 2003, Bioinform..

[18]  Tsviya Olender,et al.  Human Gene-Centric Databases at the Weizmann Institute of Science: GeneCards, UDB, CroW 21 and HORDE , 2003, Nucleic Acids Res..

[19]  H. Mewes,et al.  Toward a catalog of human genes and proteins: sequencing and analysis of 500 novel complete protein coding human cDNAs. , 2001, Genome research.

[20]  Alistair G. Rust,et al.  Ensembl 2002: accommodating comparative genomics , 2003, Nucleic Acids Res..

[21]  Rodrigo Lopez,et al.  The EMBL Nucleotide Sequence Database , 1999, Nucleic Acids Res..

[22]  Kei-Hoi Cheung,et al.  TRIPLES: a database of gene function in Saccharomyces cerevisiae , 2000, Nucleic Acids Res..