LncVar: a database of genetic variation associated with long non-coding genes

MOTIVATION Long non-coding RNAs (lncRNAs) are essential in many molecular pathways, and are frequently associated with disease but the mechanisms of most lncRNAs have not yet been characterized. Genetic variations, including single nucleotide polymorphisms (SNPs) and structural variations, are widely distributed in the genome, including lncRNA gene regions. As the number of studies on lncRNAs grows rapidly, it is necessary to evaluate the effects of genetic variations on lncRNAs. RESULTS Here, we present LncVar, a database of genetic variation associated with long non-coding genes in six species. We collected lncRNAs from the NONCODE database, and evaluated their conservation. We systematically integrated transcription factor binding sites and m6A modification sites of lncRNAs and provided comprehensive effects of SNPs on transcription and modification of lncRNAs. We collected putatively translated open reading frames (ORFs) in lncRNAs, and identified both synonymous and non-synonymous SNPs in ORFs. We also collected expression quantitative trait loci of lncRNAs from the literature. Furthermore, we identified lncRNAs in CNV regions as prognostic biomarker candidates of cancers and predicted lncRNA gene fusion events from RNA-seq data from cell lines. The LncVar database can be used as a resource to evaluate the effects of the variations on the biological function of lncRNAs. AVAILABILITY AND IMPLEMENTATION LncVar is available at http://bioinfo.ibp.ac.cn/LncVar CONTACT: rschen@ibp.ac.cnSupplementary information: Supplementary materials are available at Bioinformatics online.

[1]  Mai Har Sham,et al.  The role of SOX10 during enteric nervous system development. , 2013, Developmental biology.

[2]  Krishna R. Kalari,et al.  Gene Expression, Single Nucleotide Variant and Fusion Transcript Discovery in Archival Material from Breast Tumors , 2013, PloS one.

[3]  Wei Li,et al.  The polymorphism rs944289 predisposes to papillary thyroid carcinoma through a large intergenic noncoding RNA gene of tumor suppressor type , 2012, Proceedings of the National Academy of Sciences.

[4]  Gaurav Kumar Pandey,et al.  The risk-associated long noncoding RNA NBAT-1 controls neuroblastoma progression by regulating cell proliferation and neuronal differentiation. , 2014, Cancer cell.

[5]  D. Bartel,et al.  lincRNAs: Genomics, Evolution, and Mechanisms , 2013, Cell.

[6]  Brian T. Lee,et al.  The UCSC Genome Browser database: 2015 update , 2014, Nucleic Acids Res..

[7]  Sheng Li,et al.  The pivotal regulatory landscape of RNA modifications. , 2014, Annual review of genomics and human genetics.

[8]  Jean-Michel Claverie,et al.  The statistical significance of nucleotide position-weight matrix matches , 1996, Comput. Appl. Biosci..

[9]  William Stafford Noble,et al.  Fine-scale chromatin interaction maps reveal the cis-regulatory landscape of human lincRNA genes , 2014, Nature Methods.

[10]  Minoru Yoshida,et al.  RNA-Methylation-Dependent RNA Processing Controls the Speed of the Circadian Clock , 2013, Cell.

[11]  John M. Shelton,et al.  A Micropeptide Encoded by a Putative Long Noncoding RNA Regulates Muscle Performance , 2015, Cell.

[12]  E. Rankin,et al.  Hypoxic control of metastasis , 2016, Science.

[13]  Jingyuan Fu,et al.  Human Disease-Associated Genetic Variation Impacts Large Intergenic Non-Coding RNA Expression , 2013, PLoS genetics.

[14]  An-Yuan Guo,et al.  lncRNASNP: a database of SNPs in lncRNAs and their potential functions in human and mouse , 2014, Nucleic Acids Res..

[15]  Michael F. Lin,et al.  Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals , 2009, Nature.

[16]  Bing Ren,et al.  N6-methyladenosine-dependent regulation of messenger RNA stability , 2013 .

[17]  M. Albà,et al.  Long non-coding RNAs as a source of new peptides , 2014, eLife.

[18]  Yan Li,et al.  A high-resolution map of three-dimensional chromatin interactome in human cells , 2013, Nature.

[19]  Fang Fang,et al.  FusionMap: detecting fusion genes from next-generation sequencing data at base-pair resolution , 2011, Bioinform..

[20]  R. Guigó,et al.  Transcriptome genetics using second generation sequencing in a Caucasian population , 2010, Nature.

[21]  Yi Xing,et al.  m(6)A RNA modification controls cell fate transition in mammalian embryonic stem cells. , 2014, Cell stem cell.

[22]  David J. Arenillas,et al.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles , 2015, Nucleic Acids Res..

[23]  Hongzhe Li,et al.  A functional genomic approach identifies FAL1 as an oncogenic long noncoding RNA that associates with BMI1 and represses p21 expression in cancer. , 2014, Cancer cell.

[24]  Nikolaus Rajewsky,et al.  Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation , 2014, The EMBO journal.

[25]  Nicholas T. Ingolia,et al.  Ribosome Profiling of Mouse Embryonic Stem Cells Reveals the Complexity and Dynamics of Mammalian Proteomes , 2011, Cell.

[26]  M. Kupiec,et al.  Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq , 2012, Nature.

[27]  O. Elemento,et al.  Comprehensive Analysis of mRNA Methylation Reveals Enrichment in 3′ UTRs and near Stop Codons , 2012, Cell.

[28]  B. Moss,et al.  Nucleotide sequences at the N6-methyladenosine sites of HeLa cell messenger ribonucleic acid. , 1977, Biochemistry.

[29]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[30]  James B. Brown,et al.  Long noncoding RNAs are rarely translated in two human cell lines , 2012, Genome research.

[31]  Simon Hess,et al.  The fat mass and obesity associated gene (Fto) regulates activity of the dopaminergic midbrain circuitry , 2013, Nature Neuroscience.

[32]  Yusuke Nakamura,et al.  Genome-Wide Association Study of Pancreatic Cancer in Japanese Population , 2010, PloS one.

[33]  J. Dekker,et al.  The long-range interaction landscape of gene promoters , 2012, Nature.

[34]  Bo Wang,et al.  A genome-wide association study combining pathway analysis for typical sporadic amyotrophic lateral sclerosis in Chinese Han populations , 2014, Neurobiology of Aging.

[35]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[36]  R J Roberts,et al.  Sequence specificity of the human mRNA N6-adenosine methylase in vitro. , 1990, Nucleic acids research.

[37]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[38]  Yang Wang,et al.  N6-methyladenosine modification destabilizes developmental regulators in embryonic stem cells , 2014, Nature Cell Biology.

[39]  Zhike Lu,et al.  Unique Features of the m6A Methylome in Arabidopsis thaliana , 2014, Nature Communications.

[40]  Süleyman Cenk Sahinalp,et al.  deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data , 2011, PLoS Comput. Biol..

[41]  Wei Wu,et al.  NONCODEv4: exploring the world of long non-coding RNA genes , 2013, Nucleic Acids Res..

[42]  Peng Wang,et al.  LincSNP: a database of linking disease-associated SNPs to human large intergenic non-coding RNAs , 2014, BMC Bioinformatics.

[43]  S. Dhanasekaran,et al.  The landscape of long noncoding RNAs in the human transcriptome , 2015, Nature Genetics.

[44]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[45]  B. Kuster,et al.  Mass-spectrometry-based draft of the human proteome , 2014, Nature.

[46]  L. Mirny,et al.  Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data , 2013, Nature Reviews Genetics.

[47]  Naoki Takahashi,et al.  The GAS5 (growth arrest-specific transcript 5) gene fuses to BCL6 as a result of t(1;3)(q25;q27) in a patient with B-cell lymphoma. , 2008, Cancer genetics and cytogenetics.

[48]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[49]  J. Rinn,et al.  Peptidomic discovery of short open reading frame-encoded peptides in human cells , 2012, Nature chemical biology.

[50]  Chuan He,et al.  FTO-dependent demethylation of N6-methyladenosine regulates mRNA splicing and is required for adipogenesis , 2014, Cell Research.

[51]  Zhike Lu,et al.  m6A-dependent regulation of messenger RNA stability , 2013, Nature.

[52]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .