A large quantity of novel human antisense transcripts detected by LongSAGE

MOTIVATION Taking advantage of the high sensitivity and specificity of LongSAGE tag for transcript detection and genome mapping, we analyzed the 632 813 unique human LongSAGE tags deposited in public databases to identify novel human antisense transcripts. RESULTS Our study identified 45 321 tags that match the antisense strand of 9804 known mRNA sequences, 6606 of which contain antisense ESTs and 3198 are mapped only by SAGE tags. Quantitative analysis showed that the detected antisense transcripts are present at levels lower than their counterpart sense transcripts. Experimental results confirmed the presence of antisense transcripts detected by the antisense tags. We also constructed an antisense tag database that can be used to identify the antisense SAGE tags originated from the antisense strand of known mRNA sequences included in the RefSeq database. CONCLUSIONS Our study highlights the benefits of exploring SAGE data for comprehensive identification of human antisense transcripts and demonstrates the prevalence of antisense transcripts in the human genome.

[1]  Jeannie T. Lee,et al.  Tsix, a gene antisense to Xist at the X-inactivation centre , 1999, Nature Genetics.

[2]  M. Holland,et al.  Transcript Abundance in Yeast Varies over Six Orders of Magnitude* , 2002, The Journal of Biological Chemistry.

[3]  J. Lupski Structural variation in the human genome. , 2007, The New England journal of medicine.

[4]  Viatcheslav R. Akmaev,et al.  Correction of sequence-based artifacts in serial analysis of gene expression , 2004, Bioinform..

[5]  Ji Huang,et al.  [Serial analysis of gene expression]. , 2002, Yi chuan = Hereditas.

[6]  J. Rowley,et al.  Identifying novel transcripts and novel genes in the human genome by using novel SAGE tags , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Erez Y. Levanon,et al.  Widespread occurrence of antisense transcription in the human genome , 2003, Nature Biotechnology.

[8]  M. Rosbash,et al.  Three abundance classes in HeLa cell messenger RNA , 1974, Nature.

[9]  A. Sparks,et al.  Using the transcriptome to annotate the genome , 2002, Nature Biotechnology.

[10]  S. Seal,et al.  Localization of a breast cancer susceptibility gene, BRCA2, to chromosome 13q12-13. , 1994, Science.

[11]  Xiaoqiu Huang,et al.  Over 20% of human transcripts might form sense-antisense pairs. , 2004, Nucleic acids research.

[12]  R. Simons,et al.  Biological regulation by antisense RNA in prokaryotes. , 1988, Annual review of genetics.

[13]  Jay Shendure,et al.  Computational discovery of sense-antisense transcription in the human and mouse genomes , 2002, Genome Biology.

[14]  Ulrich Heinzmann,et al.  LongSAGE analysis revealed the presence of a large number of novel antisense genes in the mouse genome , 2005, Bioinform..

[15]  Ben Lehner,et al.  Antisense transcripts in the human genome. , 2002, Trends in genetics : TIG.

[16]  Sarah Barber,et al.  A mouse atlas of gene expression: large-scale digital gene-expression profiles from precisely defined developing C57BL/6J mouse tissues and cells. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Jian Wang,et al.  Detecting novel low-abundant transcripts in Drosophila. , 2005, RNA.

[18]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[19]  S. Batalov,et al.  Antisense Transcription in the Mammalian Transcriptome , 2005, Science.

[20]  D. Higgins,et al.  Overlapping Antisense Transcription in the Human Genome , 2002, Comparative and functional genomics.

[21]  Tatiana A. Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[22]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[23]  S. Altschul,et al.  A public database for gene expression in human cancers. , 1999, Cancer research.

[24]  Phillip A. Sharp,et al.  The RNAi revolution , 2004, Nature.

[25]  Thérèse Commes,et al.  Mining SAGE data allows large-scale, sensitive screening of antisense transcript expression. , 2004, Nucleic acids research.

[26]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..