Using the DFCI Gene Index Databases for Biological Discovery

The DFCI Gene Index Web pages provide access to analyses of ESTs and gene sequences for nearly 114 species, as well as a number of resources derived from these. Each species‐specific database is presented using a common format with a home page. A variety of methods exist that allow users to search each species‐specific database. Methods implemented currently include nucleotide or protein sequence queries using WU‐BLAST, text‐based searches using various sequence identifiers, searches by gene, tissue and library name, and searches using functional classes through Gene Ontology assignments. This protocol provides guidance for using the Gene Index Databases to extract information. Curr. Protoc. Bioinform. 29:1.6.1‐1.6.36. © 2010 by John Wiley & Sons, Inc.

[1]  X. Huang,et al.  CAP3: A DNA sequence assembly program. , 1999, Genome research.

[2]  Gregory D Schuler,et al.  Sequence mapping by electronic PCR , 1997, Genome research.

[3]  M. Boguski,et al.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[4]  M. Wagner,et al.  IMAGEne I: clustering and ranking of I.M.A.G.E. cDNA clones corresponding to known genes , 1999, Bioinform..

[5]  Lukas Wagner,et al.  A Greedy Algorithm for Aligning DNA Sequences , 2000, J. Comput. Biol..

[6]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[7]  W. Fitch Distinguishing homologous from analogous proteins. , 1970, Systematic zoology.

[8]  Daniel Lee,et al.  The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species , 2001, Nucleic Acids Res..

[9]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[10]  Ji Huang,et al.  [Serial analysis of gene expression]. , 2002, Yi chuan = Hereditas.

[11]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[12]  D. Stekel,et al.  The comparison of gene expression from multiple cDNA libraries. , 2000, Genome research.

[13]  Gregory D. Schuler,et al.  ESTablishing a human transcript map , 1995, Nature Genetics.

[14]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[15]  Huanming Yang,et al.  A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. indica) , 2002, Science.

[16]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[17]  S. Salzberg,et al.  An optimized protocol for analysis of EST sequences. , 2000, Nucleic acids research.

[18]  John Quackenbush,et al.  TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets , 2003, Bioinform..

[19]  John Quackenbush,et al.  The TIGR Gene Indices: reconstruction and representation of expressed gene sequences , 2000, Nucleic Acids Res..

[20]  G. L. Bennett,et al.  Sequence evaluation of four pooled-tissue normalized bovine cDNA libraries and construction of a gene index for cattle. , 2001, Genome research.

[21]  G. Pertea,et al.  RESOURCERER: a database for annotating and linking microarray resources within and across species , 2001, Genome Biology.

[22]  Sean R. Eddy,et al.  The Distributed Annotation System , 2001, BMC Bioinformatics.

[23]  C. V. Jongeneel,et al.  ESTScan: A Program for Detecting, Evaluating, and Reconstructing Potential Coding Regions in EST Sequences , 1999, ISMB.

[24]  G. Pertea,et al.  Cross-referencing eukaryotic genomes: TIGR Orthologous Gene Alignments (TOGA). , 2002, Genome research.

[25]  Michael P. Cooke,et al.  A Comparison of the Celera and Ensembl Predicted Gene Sets Reveals Little Overlap in Novel Genes , 2001, Cell.

[26]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[27]  Robert Miller,et al.  STACK: Sequence Tag Alignment and Consensus Knowledgebase , 2001, Nucleic Acids Res..

[28]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[29]  Martin Reczko,et al.  DIANA-EST: a statistical analysis , 2001, Bioinform..