LongTarget: a tool to predict lncRNA DNA-binding motifs and binding sites via Hoogsteen base-pairing analysis

MOTIVATION In mammalian cells, many genes are silenced by genome methylation. DNA methyltransferases and polycomb repressive complexes, which both lack sequence-specific DNA-binding motifs, are recruited by long non-coding RNA (lncRNA) to specific genomic sites to methylate DNA and chromatin. Increasing evidence indicates that many lncRNAs contain DNA-binding motifs that can bind to DNA by forming RNA:DNA triplexes. The identification of lncRNA DNA-binding motifs and binding sites is essential for deciphering lncRNA functions and correct and erroneous genome methylation; however, such identification is challenging because lncRNAs may contain thousands of nucleotides. No computational analysis of typical lncRNAs has been reported. Here, we report a computational method and program (LongTarget) to predict lncRNA DNA-binding motifs and binding sites. We used this program to analyse multiple antisense lncRNAs, including those that control well-known imprinting clusters, and obtained results agreeing with experimental observations and epigenetic marks. These results suggest that it is feasible to predict many lncRNA DNA-binding motifs and binding sites genome-wide. AVAILABILITY AND IMPLEMENTATION Website of LongTarget: lncrna.smu.edu.cn, or contact: hao.zhu@ymail.com. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  I. Bièche,et al.  Characterization of a germ-line deletion, including the entire INK4/ARF locus, in a melanoma-neural system tumor family: identification of ANRIL, an antisense noncoding RNA whose expression coclusters with ARF. , 2007, Cancer research.

[2]  J. Komorowski,et al.  Kcnq1ot1 antisense noncoding RNA mediates lineage-specific transcriptional silencing through chromatin-level regulation. , 2008, Molecular cell.

[3]  David G. Knowles,et al.  The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression , 2012, Genome research.

[4]  Fabian A. Buske,et al.  Triplexator: Detecting nucleic acid triple helices in genomic and transcriptomic data , 2012, Genome research.

[5]  S. Raguz,et al.  Molecular interplay of the noncoding RNA ANRIL and methylated histone H3 lysine 27 by polycomb CBX7 in transcriptional silencing of INK4a. , 2010, Molecular cell.

[6]  Denise P Barlow,et al.  Mechanisms of long range silencing by imprinted macro non-coding RNAs , 2012, Current opinion in genetics & development.

[7]  Howard Y. Chang,et al.  Functional Demarcation of Active and Silent Chromatin Domains in Human HOX Loci by Noncoding RNAs , 2007, Cell.

[8]  Lydia Teboul,et al.  Uncoupling Antisense-Mediated Silencing and DNA Methylation in the Imprinted Gnas Cluster , 2011, PLoS genetics.

[9]  Howard Y. Chang,et al.  Long Noncoding RNA as Modular Scaffold of Histone Modification Complexes , 2010, Science.

[10]  R. Wells,et al.  Specificity of the three-stranded complex formation between double-stranded DNA and single-stranded RNA containing repeating nucleotide sequences. , 1968, Journal of molecular biology.

[11]  A. Ferguson-Smith,et al.  Genomic imprinting: the emergence of an epigenetic paradigm , 2011, Nature Reviews Genetics.

[12]  F. Bosman,et al.  p16 inactivation by methylation of the CDKN2A promoter occurs early during neoplastic progression in Barrett's esophagus. , 2002, Gastroenterology.

[13]  Timothy J. Durham,et al.  Combinatorial Patterning of Chromatin Regulators Uncovered by Genome-wide Location Analysis in Human Cells , 2011, Cell.

[14]  M. Kitagawa,et al.  Long non-coding RNA ANRIL is required for the PRC2 recruitment to and silencing of p15INK4B tumor suppressor gene , 2011, Oncogene.

[15]  Hélène Jammes,et al.  Specific epigenetic alterations of IGF2-H19 locus in spermatozoa from infertile men , 2010, European Journal of Human Genetics.

[16]  D. Barlow,et al.  Quantitative genetics: Turning up the heat on QTL mapping , 2002, Nature Reviews Genetics.

[17]  Jeannie T. Lee Lessons from X-chromosome inactivation: long ncRNA as guides and tethers to the epigenome. , 2009, Genes & development.

[18]  Wolf Reik,et al.  Co-evolution of X-chromosome inactivation and imprinting in mammals , 2005, Nature Reviews Genetics.

[19]  Howard Y. Chang,et al.  Genomic maps of long noncoding RNA occupancy reveal principles of RNA-chromatin interactions. , 2011, Molecular cell.

[20]  D. M. Brown,et al.  Triple helices formed at oligopyrimidine*oligopurine sequences with base pair inversions: effect of a triplex-specific ligand on stability and selectivity. , 1998, Nucleic acids research.

[21]  F. Pauler,et al.  Airn Transcriptional Overlap, But Not Its lncRNA Products, Induces Imprinted Igf2r Silencing , 2012, Science.

[22]  S. Tilghman,et al.  Enhancer competition between H19 and Igf2 does not mediate their imprinting. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Howard Y. Chang,et al.  Long noncoding RNA HOTAIR reprograms chromatin state to promote cancer metastasis , 2010, Nature.

[24]  G. Glass,et al.  Triple helix formation: binding avidity of acridine-conjugated AG motif third strands containing natural, modified and surrogate bases opposed to pyrimidine interruptions in a polypurine target. , 1999, Nucleic acids research.

[25]  Peter Malfertheiner,et al.  Promoter methylation of CDKN2A and lack of p16 expression characterize patients with hepatocellular carcinoma , 2010, BMC Cancer.

[26]  Jinzhong Fu,et al.  Does life history shape sexual size dimorphism in anurans? A comparative analysis , 2013, BMC Evolutionary Biology.

[27]  David R. Kelley,et al.  Transposable elements reveal a stem cell-specific class of long noncoding RNAs , 2012, Genome Biology.

[28]  Jeannie T. Lee,et al.  Polycomb Proteins Targeted by a Short Repeat RNA to the Mouse X Chromosome , 2008, Science.

[29]  Robert E. Kingston,et al.  Occupying chromatin: Polycomb mechanisms for getting to genomic targets, stopping transcriptional traffic, and staying put. , 2013, Molecular cell.

[30]  Rodrigo Lopez,et al.  A new bioinformatics analysis tools framework at EMBL–EBI , 2010, Nucleic Acids Res..

[31]  E. Lander,et al.  The Xist lncRNA Exploits Three-Dimensional Genome Architecture to Spread Across the X Chromosome , 2013, Science.

[32]  Denise P Barlow,et al.  Genomic imprinting in mammals. , 2014, Cold Spring Harbor perspectives in biology.

[33]  V. Corces,et al.  CTCF: Master Weaver of the Genome , 2009, Cell.

[34]  G. Felsenfeld,et al.  Methylation of a CTCF-dependent boundary controls imprinted expression of the Igf2 gene , 2000, Nature.

[35]  Yize Li,et al.  ANRIL/CDKN2B-AS shows two-stage clade-specific evolution and becomes conserved after transposon insertions in simians , 2013, BMC Evolutionary Biology.

[36]  Maria Duca,et al.  The triple helix: 50 years later, the outcome , 2008, Nucleic acids research.

[37]  T. Moore,et al.  Multiple imprinted sense and antisense transcripts, differential methylation and tandem repeats in a putative imprinting control region upstream of mouse Igf2. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Jesse Stombaugh,et al.  Comprehensive survey and geometric classification of base triples in RNA structures , 2011, Nucleic acids research.

[39]  Jennifer A. Mitchell,et al.  The Air Noncoding RNA Epigenetically Silences Transcription by Targeting G9a to Chromatin , 2008, Science.

[40]  Eric Westhof,et al.  The non-Watson-Crick base pairs and their associated isostericity matrices. , 2002, Nucleic acids research.

[41]  A. Feinberg,et al.  Epigenetic silencing of tumour suppressor gene p15 by its antisense RNA , 2008, Nature.

[42]  I. Grummt,et al.  Interaction of noncoding RNA with the rDNA promoter mediates recruitment of DNMT3b and silencing of rRNA genes. , 2010, Genes & development.

[43]  C. Kanduri,et al.  Kcnq1ot1 noncoding RNA mediates transcriptional gene silencing by interacting with Dnmt1 , 2010, Development.