TRlnc: a comprehensive database for human transcriptional regulatory information of lncRNAs

Long noncoding RNAs (lncRNAs) have been proven to play important roles in transcriptional processes and biological functions. With the increasing study of human diseases and biological processes, information in human H3K27ac ChIP-seq, ATAC-seq and DNase-seq datasets is accumulating rapidly, resulting in an urgent need to collect and process data to identify transcriptional regulatory regions of lncRNAs. We therefore developed a comprehensive database for human regulatory information of lncRNAs (TRlnc, http://bio.licpathway.net/TRlnc), which aimed to collect available resources of transcriptional regulatory regions of lncRNAs and to annotate and illustrate their potential roles in the regulation of lncRNAs in a cell type-specific manner. The current version of TRlnc contains 8 683 028 typical enhancers/super-enhancers and 32 348 244 chromatin accessibility regions associated with 91 906 human lncRNAs. These regions are identified from over 900 human H3K27ac ChIP-seq, ATAC-seq and DNase-seq samples. Furthermore, TRlnc provides the detailed genetic and epigenetic annotation information within transcriptional regulatory regions (promoter, enhancer/super-enhancer and chromatin accessibility regions) of lncRNAs, including common SNPs, risk SNPs, eQTLs, linkage disequilibrium SNPs, transcription factors, methylation sites, histone modifications and 3D chromatin interactions. It is anticipated that the use of TRlnc will help users to gain in-depth and useful insights into the transcriptional regulatory mechanisms of lncRNAs.

[1]  R. Young,et al.  Super-Enhancers in the Control of Cell Identity and Disease , 2013, Cell.

[2]  Wei Wu,et al.  NONCODE 2016: an informative and valuable data source of long non-coding RNAs , 2015, Nucleic Acids Res..

[3]  Zhen Yang,et al.  LncRNADisease 2.0: an updated database of long non-coding RNA-associated diseases , 2018, Nucleic Acids Res..

[4]  L. Stein,et al.  JBrowse: a next-generation genome browser. , 2009, Genome research.

[5]  Yu Xue,et al.  PTMD: A Database of Human Disease-associated Post-translational Modifications , 2018, Genom. Proteom. Bioinform..

[6]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[7]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[8]  M. Groudine,et al.  Functional and Mechanistic Diversity of Distal Transcription Enhancers , 2011, Cell.

[9]  Qiong Zhang,et al.  lncRNASNP2: an updated database of functional SNPs and mutations in human and mouse lncRNAs , 2017, Nucleic Acids Res..

[10]  Xiang-Dong Fu,et al.  Chromatin-associated RNAs as facilitators of functional genomic interactions , 2019, Nature Reviews Genetics.

[11]  Yan Zhang,et al.  LincSNP 2.0: an updated database for linking disease-associated SNPs to human long non-coding RNAs and their TFBSs , 2016, Nucleic Acids Res..

[12]  Q. Cui,et al.  LncDisease: a sequence based bioinformatics tool for predicting lncRNA-disease associations , 2016, Nucleic acids research.

[13]  Alexander E. Kel,et al.  GTRD: a database of transcription factor binding sites identified by ChIP-seq experiments , 2016, Nucleic Acids Res..

[14]  K. Hu,et al.  EuRBPDB: a comprehensive resource for annotation, functional and oncological investigation of eukaryotic RNA binding proteins (RBPs) , 2019, bioRxiv.

[15]  Vladimir B Bajic,et al.  LncBook: a curated knowledgebase of human long non-coding RNAs , 2018, Nucleic Acids Res..

[16]  Wenjie Chen,et al.  GRASP v2.0: an update on the Genome-Wide Repository of Associations between SNPs and phenotypes , 2014, Nucleic Acids Res..

[17]  Cheng Li,et al.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses , 2017, Nucleic Acids Res..

[18]  Junhui Ge,et al.  Human colorectal cancer-specific CCAT1-L lncRNA regulates long-range chromatin interactions at the MYC locus , 2014, Cell Research.

[19]  Jing Li,et al.  CRlncRNA: a manually curated database of cancer-related long non-coding RNAs with experimental proof of functions on clinicopathological and molecular features , 2018, BMC Medical Genomics.

[20]  Yunpeng Zhang,et al.  LncACTdb 2.0: an updated database of experimentally supported ceRNA interactions curated from low- and high-throughput experiments , 2018, Nucleic Acids Res..

[21]  Robert D. Finn,et al.  Corrigendum: ''RNAcentral : a hub of information for non-coding RNA sequences'' [Nucleic acids research, 47 (2019) D1, p. D221-D229] , 2019 .

[22]  Yan Zhang,et al.  Lnc2Meth: a manually curated database of regulatory relationships between long non-coding RNAs and DNA methylation associated with human disease , 2017, Nucleic Acids Res..

[23]  William Stafford Noble,et al.  FIMO: scanning for occurrences of a given motif , 2011, Bioinform..

[24]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[25]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[26]  Yaoqi Zhou,et al.  EVLncRNAs: a manually curated database for long non-coding RNAs validated by low-throughput experiments , 2017, Nucleic Acids Res..

[27]  Zhao Zhang,et al.  PancanQTL: systematic identification of cis-eQTLs and trans-eQTLs in 33 cancer types , 2017, Nucleic Acids Res..

[28]  Howard Y. Chang,et al.  Long noncoding RNAs in cell-fate programming and reprogramming. , 2014, Cell stem cell.

[29]  Núria Queralt-Rosinach,et al.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants , 2016, Nucleic Acids Res..

[30]  M. Fullwood,et al.  Super-Enhancer-Driven Long Non-Coding RNA LINC01503, Regulated by TP63, Is Over-Expressed and Oncogenic in Squamous Cell Carcinoma. , 2018, Gastroenterology.

[31]  M. Esteller,et al.  DNA methylation and cancer. , 2010, Advances in genetics.

[32]  Qin Lin,et al.  HDncRNA: a comprehensive database of non-coding RNAs associated with heart diseases , 2018, Database J. Biol. Databases Curation.

[33]  Yan Huang,et al.  RNALocate: a resource for RNA subcellular localizations , 2016, Nucleic Acids Res..

[34]  Ippei Takahashi,et al.  Alteration of Antiviral Signalling by Single Nucleotide Polymorphisms (SNPs) of Mitochondrial Antiviral Signalling Protein (MAVS) , 2016, PloS one.

[35]  Li Wang,et al.  Lnc2Cancer v2.0: updated database of experimentally supported long non-coding RNAs in human cancers , 2018, Nucleic Acids Res..

[36]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[37]  Xu Min,et al.  EnDisease: a manually curated database for enhancer-disease associations , 2019, Database J. Biol. Databases Curation.

[38]  Vladimir B. Bajic,et al.  FARNA: knowledgebase of inferred functions of non-coding RNA transcripts , 2016, Nucleic acids research.

[39]  Qianlan Yao,et al.  Subpathway-GM: identification of metabolic subpathways via joint power of interesting genes and metabolites and their topologies within pathways , 2013, Nucleic acids research.

[40]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[41]  Keith W. Vance,et al.  Transcriptional regulatory functions of nuclear long noncoding RNAs , 2014, Trends in genetics : TIG.

[42]  Tao Liu,et al.  Cistrome Data Browser: a data portal for ChIP-Seq and chromatin accessibility data in human and mouse , 2016, Nucleic Acids Res..

[43]  Benoît Ballester,et al.  ReMap 2018: an updated atlas of regulatory regions from an integrative analysis of DNA-binding ChIP-seq experiments , 2017, Nucleic Acids Res..

[44]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[45]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[46]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[47]  Peng Wang,et al.  LincSNP: a database of linking disease-associated SNPs to human large intergenic non-coding RNAs , 2014, BMC Bioinformatics.

[48]  Chunquan Li,et al.  ENdb: a manually curated database of experimentally supported enhancers for human and mouse , 2019, Nucleic Acids Res..

[49]  Jun Yu,et al.  LncRNAWiki: harnessing community knowledge in collaborative curation of human long non-coding RNAs , 2014, Nucleic Acids Res..

[50]  Liqing Zhou,et al.  A functional lncRNA HOTAIR genetic variant contributes to gastric cancer susceptibility , 2016, Molecular carcinogenesis.

[51]  Clifford A. Meyer,et al.  Identifying and mitigating bias in next-generation sequencing methods for chromatin biology , 2014, Nature Reviews Genetics.

[52]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[53]  Jian Zhang,et al.  SEdb: a comprehensive human super-enhancer database , 2018, Nucleic Acids Res..

[54]  Mauro A. A. Castro,et al.  The chromatin accessibility landscape of primary human cancers , 2018, Science.

[55]  Manolis Kellis,et al.  HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants , 2011, Nucleic Acids Res..

[56]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[57]  E. Li,et al.  SEanalysis: a web tool for super-enhancer associated regulatory analysis , 2019, Nucleic Acids Res..

[58]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[59]  Hui Zhou,et al.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein–RNA interaction networks from large-scale CLIP-Seq data , 2013, Nucleic Acids Res..

[60]  B. Berman,et al.  Co-activation of super-enhancer-driven CCAT1 by TP63 and SOX2 promotes squamous cancer progression , 2018, Nature Communications.

[61]  Marcel E. Dinger,et al.  lncRNAdb v2.0: expanding the reference database for functional long noncoding RNAs , 2014, Nucleic Acids Res..

[62]  Edgar Wingender,et al.  TFClass: a classification of human transcription factors and their rodent orthologs , 2014, Nucleic Acids Res..

[63]  Latarsha J. Carithers,et al.  The Genotype-Tissue Expression (GTEx) Project. , 2015, Biopreservation and biobanking.

[64]  Chunquan Li,et al.  Super-Enhancer-Associated Long Noncoding RNA HCCL5 Is Activated by ZEB1 and Promotes the Malignancy of Hepatocellular Carcinoma. , 2018, Cancer research.

[65]  Dong Wang,et al.  iLoc‐lncRNA: predict the subcellular location of lncRNAs by incorporating octamer composition into general PseKNC , 2018, Bioinform..

[66]  Lennart Martens,et al.  LNCipedia 5: towards a reference set of human long non-coding RNAs , 2018, Nucleic Acids Res..

[67]  Meng Li,et al.  Subpathway-CorSP: Identification of metabolic subpathways via integrating expression correlations and topological features between metabolites and genes of interest within pathways , 2016, Scientific Reports.

[68]  Dennis B. Troup,et al.  NCBI GEO: archive for functional genomics data sets—10 years on , 2010, Nucleic Acids Res..

[69]  David A. Orlando,et al.  Selective Inhibition of Tumor Oncogenes by Disruption of Super-Enhancers , 2013, Cell.

[70]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[71]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[72]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[73]  A. Chinnaiyan,et al.  The emergence of lncRNAs in cancer biology. , 2011, Cancer discovery.

[74]  J. George,et al.  Dysregulated long noncoding RNAs (lncRNAs) in hepatocellular carcinoma: implications for tumorigenesis, disease progression, and liver cancer stem cells , 2017, Molecular Cancer.

[75]  Boris Lenhard,et al.  RNAdb 2.0—an expanded database of mammalian non-coding RNAs , 2006, Nucleic Acids Res..

[76]  Chenchen Feng,et al.  TRCirc: a resource for transcriptional regulation information of circRNAs , 2018, Briefings Bioinform..

[77]  J. Mendell,et al.  Functional Classification and Experimental Dissection of Long Noncoding RNAs , 2018, Cell.

[78]  K. Danielson,et al.  Regulation of the oncogenic phenotype by the nuclear body protein ZC3H8 , 2018, BMC Cancer.

[79]  Chunquan Li,et al.  SubpathwayMiner: a software package for flexible identification of pathways , 2009, Nucleic acids research.