Correct identification of genes from serial analysis of gene expression tag sequences.

SAGE (serial analysis of gene expression) is a remarkable technique for genome-wide analysis of gene expression. It is crucial to understand the extent to which SAGE can accurately indicate a gene or expressed sequence tag (EST) with a single tag. We analyzed the effect of the size of SAGE tag on gene identification. Our observation indicates that SAGE tags are in general not long enough to achieve the degree of uniqueness of identification originally envisaged. Our observations also indicate that the limitation of using SAGE tag to identify a gene can be overcome by converting SAGE tags into longer 3' EST sequences with the generation of longer cDNA fragments from SAGE tages for gene identification (GLGI) method.

[1]  Jianjun Chen,et al.  High‐throughput GLGI procedure for converting a large number of serial analysis of gene expression tag sequences into 3′ complementary DNAs , 2002, Genes, chromosomes & cancer.

[2]  K. Matsushima,et al.  Serial analysis of gene expression in human monocytes and macrophages. , 1999, Blood.

[3]  S. Altschul,et al.  A public database for gene expression in human cancers. , 1999, Cancer research.

[4]  K. Matsushima,et al.  Serial analysis of gene expression in human monocyte-derived dendritic cells. , 1999, Blood.

[5]  A. Ryo,et al.  Serial analysis of gene expression in a microglial cell line , 1999, Glia.

[6]  G. Landes,et al.  Combining serial analysis of gene expression and array technologies to identify genes differentially expressed in breast cancer. , 1999, Cancer research.

[7]  K. Matsushima,et al.  Comprehensive gene expression profile of a normal human liver. , 2000, Biochemical and biophysical research communications.

[8]  K. Matsushima,et al.  WISP-2 as a novel estrogen-responsive gene in human breast cancer cells. , 2000, Biochemical and biophysical research communications.

[9]  Sanggyu Lee,et al.  Computational Analysis of Gene Identification with SAGE , 2003, J. Comput. Biol..

[10]  K. Matsushima,et al.  Comprehensive gene expression profile of LPS-stimulated human monocytes by SAGE. , 2000, Blood.

[11]  G. Landes,et al.  Analysis of human transcriptomes , 1999, Nature Genetics.

[12]  A. Seth,et al.  Coordinate Expression of Novel Genes During Osteoblast Differentiation , 2000, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[13]  S. Welle,et al.  Inventory of high-abundance mRNAs in skeletal muscle of normal men. , 1999, Genome research.

[14]  S. Altschul,et al.  SAGEmap: a public gene expression resource. , 2000, Genome research.

[15]  A. Parle‐McDermott,et al.  Serial analysis of gene expression identifies putative metastasis-associated transcripts in colon tumour cellines , 2000, British Journal of Cancer.

[16]  A. Ryo,et al.  Serial analysis of gene expression in HIV‐1‐infected T cell lines , 1999, FEBS letters.

[17]  K. Matsushima,et al.  Identification of genes specifically expressed in human activated and mature dendritic cells through serial analysis of gene expression. , 2000, Blood.

[18]  J. L. Stanton,et al.  Molecular phenotype of the human oocyte by PCR-SAGE. , 2000, Genomics.

[19]  J. Rowley,et al.  The pattern of gene expression in human CD15+ myeloid progenitor cells , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[20]  J. Rowley,et al.  Generation of longer cDNA fragments from serial analysis of gene expression tags for gene identification. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[21]  P M Bossuyt,et al.  Genes differentially expressed in medulloblastoma and fetal brain. , 1999, Physiological genomics.