RefSeq and LocusLink: NCBI gene-centered resources

Thousands of genes have been painstakingly identified and characterized a few genes at a time. Many thousands more are being predicted by large scale cDNA and genomic sequencing projects, with levels of evidence ranging from supporting mRNA sequence and comparative genomics to computing ab initio models. This, coupled with the burgeoning scientific literature, makes it critical to have a comprehensive directory for genes and reference sequences for key genomes. The NCBI provides two resources, LocusLink and RefSeq, to meet these needs. LocusLink organizes information around genes to generate a central hub for accessing gene-specific information for fruit fly, human, mouse, rat and zebrafish. RefSeq provides reference sequence standards for genomes, transcripts and proteins; human, mouse and rat mRNA RefSeqs, and their corresponding proteins, are discussed here. Together, RefSeq and LocusLink provide a non-redundant view of genes and other loci to support research on genes and gene families, variation, gene expression and genome annotation. Additional information about LocusLink and RefSeq is available at http://www.ncbi.nlm.nih.gov/LocusLink/.

[1]  K. Sirotkin,et al.  dbSNP-database for single nucleotide polymorphisms and other classes of minor genetic variation. , 1999, Genome research.

[2]  Gregory D Schuler,et al.  Sequence mapping by electronic PCR , 1997, Genome research.

[3]  W. Gelbart The FlyBase database of the Drosophila Genome Projects and community literature. , 1999, Nucleic acids research.

[4]  M. Westerfield,et al.  Zebrafish informatics and the ZFIN database. , 1999, Methods in cell biology.

[5]  S Povey,et al.  Guidelines for human gene nomenclature (1997). HUGO Nomenclature Committee. , 1997, Genomics.

[6]  D. Valle,et al.  Online Mendelian Inheritance In Man (OMIM) , 2000, Human mutation.

[7]  G. Schuler,et al.  Entrez: molecular biology database and retrieval system. , 1996, Methods in enzymology.

[8]  G. Schuler Pieces of the puzzle: expressed sequence tags and the catalog of human genes , 1997, Journal of Molecular Medicine.

[9]  Mathew W. Wright,et al.  Guidelines for human gene nomenclature. , 2002, Genomics.

[10]  K. Katz,et al.  Introducing RefSeq and LocusLink: curated human genome resources at the NCBI. , 2000, Trends in genetics : TIG.

[11]  G. Schuler,et al.  Making effective use of human genomic sequence data. , 1999, Trends in genetics : TIG.

[12]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[13]  Judith A. Blake,et al.  The Mouse Genome Database (MGD): expanding genetic and genomic resources for the laboratory mouse , 2000, Nucleic Acids Res..