Transcription factor enrichment analysis (TFEA): Quantifying the activity of hundreds of transcription factors from a single experiment

Detecting differential activation of transcription factors (TFs) in response to perturbation provides insight into cellular processes. Transcription Factor Enrichment Analysis (TFEA) is a robust and reliable computational method that detects differential activity of hundreds of TFs given any set of perturbation data. TFEA draws inspiration from GSEA and detects positional motif enrichment within a list of ranked regions of interest (ROIs). As ROIs are typically inferred from the data, we also introduce muMerge, a statistically principled method of generating a consensus list of ROIs from multiple replicates and conditions. TFEA is broadly applicable to data that informs on transcriptional regulation including nascent (eg. PRO-Seq), CAGE, ChIP-Seq, and accessibility (e.g. ATAC-Seq). TFEA not only identifies the key regulators responding to a perturbation, but also temporally unravels regulatory networks with time series data. Consequently, TFEA serves as a hypothesis-generating tool that provides an easy, rigorous, and cost-effective means to broadly assess TF activity yielding new biological insights.

[1]  Purnima Bholowalia,et al.  EBK-Means: A Clustering Technique based on Elbow Method and K-Means in WSN , 2014 .

[2]  Jesse J. Lipp,et al.  SLAM-seq defines direct gene-regulatory functions of the BRD4-MYC axis , 2018, Science.

[3]  T. Read,et al.  Enhancer RNA profiling predicts transcription factor activity , 2018, Genome research.

[4]  Z. Weng,et al.  Functional analysis of transcription factor binding sites in human promoters , 2012, Genome Biology.

[5]  Timothy L. Bailey,et al.  Motif Enrichment Analysis: a unified framework and an evaluation on ChIP data , 2010, BMC Bioinformatics.

[6]  David J. Arenillas,et al.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles , 2015, Nucleic Acids Res..

[7]  C. Glass,et al.  Enhancer RNAs and regulated transcriptional programs. , 2014, Trends in biochemical sciences.

[8]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[9]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[10]  M. Goldman,et al.  Critical Role of the IFN-Stimulated Gene Factor 3 Complex in TLR-Mediated IL-27p28 Gene Expression Revealing a Two-Step Activation Process , 2010, The Journal of Immunology.

[11]  Valentina Boeva,et al.  Analysis of Genomic Sequence Motifs for Deciphering Transcription Factor Binding and Transcriptional Regulation in Eukaryotic Cells , 2016, Front. Genet..

[12]  F. A. Kolpakov,et al.  HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis , 2017, Nucleic Acids Res..

[13]  Morgan A. Sammons,et al.  Control of p53-dependent transcription and enhancer activity by the p53 family member p63 , 2019, The Journal of Biological Chemistry.

[14]  William Stafford Noble,et al.  FIMO: scanning for occurrences of a given motif , 2011, Bioinform..

[15]  T. E. Wilson,et al.  Use of Bru-Seq and BruChase-Seq for genome-wide assessment of the synthesis and stability of RNA. , 2014, Methods.

[16]  F. Gao,et al.  Helicobacter pylori stimulates host cyclooxygenase‐2 gene transcription: critical importance of MEK/ERK‐dependent activation of USF1/‐2 and CREB transcription factors , 2003, Cellular microbiology.

[17]  R. Dowell,et al.  Global analysis of p53-regulated transcription identifies its direct targets and unexpected regulatory mechanisms , 2014, eLife.

[18]  Erik Aronesty,et al.  Comparison of Sequencing Utility Programs , 2013 .

[19]  D. Baltimore,et al.  The stability of mRNA influences the temporal order of the induction of genes encoding inflammatory molecules , 2009, Nature Immunology.

[20]  Ian C. McDowell,et al.  Glucocorticoid receptor recruits to enhancers and drives activation by motif-directed binding , 2018, Genome research.

[21]  Jonathan K. Pritchard,et al.  The Functional Consequences of Variation in Transcription Factor Binding , 2013, PLoS genetics.

[22]  T. Hughes,et al.  The Human Transcription Factors , 2018, Cell.

[23]  Cesare Furlanello,et al.  A promoter-level mammalian expression atlas , 2015 .

[24]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[25]  A. Mortazavi,et al.  Integrating ChIP-seq with other functional genomics data , 2018, Briefings in functional genomics.

[26]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[27]  G. Natoli,et al.  High constitutive activity of a broad panel of housekeeping and tissue-specific cis-regulatory elements depends on a subset of ETS proteins. , 2017, Genes & development.

[28]  R. Haché,et al.  Attenuation of glucocorticoid signaling through targeted degradation of p300 via the 26S proteasome pathway. , 2002, Molecular endocrinology.

[29]  W. Leonard,et al.  The transcription factor GABP is a critical regulator of B lymphocyte development. , 2007, Immunity.

[30]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[31]  P. Moynagh,et al.  Nuclear Factor κB Subunits RelB and cRel Negatively Regulate Toll-like Receptor 3-mediated β-Interferon Production via Induction of Transcriptional Repressor Protein YY1* , 2011, The Journal of Biological Chemistry.

[32]  Geoffrey J Faulkner,et al.  Analysis of the human monocyte-derived macrophage transcriptome and response to lipopolysaccharide provides new insights into genetic aetiology of inflammatory bowel disease , 2017, PLoS genetics.

[33]  T. Bailey,et al.  Differential motif enrichment analysis of paired ChIP-seq experiments , 2014, BMC Genomics.

[34]  C. Glass,et al.  Rev-Erbs repress macrophage gene expression by inhibiting enhancer-directed transcription , 2013, Nature.

[35]  Vladimir B. Bajic,et al.  HOCOMOCO: expansion and enhancement of the collection of transcription factor binding sites models , 2015, Nucleic Acids Res..

[36]  André L. Martins,et al.  Analysis of nascent RNA identifies a unified architecture of initiation regions at mammalian promoters and enhancers , 2014, Nature Genetics.

[37]  C. Danko,et al.  Dynamic reorganization of the AC16 cardiomyocyte transcriptome in response to TNFα signaling revealed by integrated genomic analyses , 2014, BMC Genomics.

[38]  J. Kawai,et al.  Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Li-Rong Yu,et al.  Distinct roles of GCN5/PCAF‐mediated H3K9ac and CBP/p300‐mediated H3K18/27ac in nuclear receptor transactivation , 2011, The EMBO journal.

[40]  G. Stark,et al.  IRF9 and unphosphorylated STAT2 cooperate with NF-κB to drive IL6 expression , 2018, Proceedings of the National Academy of Sciences.

[41]  Leighton J. Core,et al.  Nuclear stability and transcriptional directionality separate functionally distinct RNA species , 2014, Nature Communications.

[42]  Leighton J. Core,et al.  Hojoong Kwak Initiation and Pausing Precise Maps of RNA Polymerase Reveal How Promoters Direct , 2013 .

[43]  Y. Gilad,et al.  RNA-seq: impact of RNA degradation on transcript quantification , 2014, BMC Biology.

[44]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Piotr J. Balwierz,et al.  ISMARA: automated modeling of genomic signals as a democracy of regulatory motifs , 2014, Genome research.

[46]  Judith B. Zaugg,et al.  Quantification of differential transcription factor activity and multiomics-based classification into activators and repressors: diffTF , 2018, bioRxiv.

[47]  J. Lis,et al.  Single-molecule nascent RNA sequencing reveals regulatory domain architecture at promoters and enhancers , 2018, Nature Genetics.

[48]  J. Michael Cherry,et al.  The Encyclopedia of DNA elements (ENCODE): data portal update , 2017, Nucleic Acids Res..

[49]  C. Danko,et al.  Enhancer transcripts mark active estrogen receptor binding sites , 2013, Genome research.

[50]  Andrew J. Bannister,et al.  Regulation of chromatin by histone modifications , 2011, Cell Research.

[51]  Paolo Di Tommaso,et al.  Nextflow enables reproducible computational workflows , 2017, Nature Biotechnology.

[52]  Jun Cai,et al.  YY1 promotes IL-6 expression in LPS-stimulated BV2 microglial cells by interacting with p65 to promote transcriptional activation of IL-6. , 2018, Biochemical and biophysical research communications.

[53]  J. Brickman,et al.  Time-Resolved Analysis Reveals Rapid Dynamics and Broad Scope of the CBP/p300 Acetylome , 2018, Cell.

[54]  A. Wakamatsu,et al.  Genome-wide determination of RNA stability reveals hundreds of short-lived noncoding transcripts in mammals , 2012, Genome research.

[55]  T. Bailey,et al.  Inferring direct DNA binding from ChIP-seq , 2012, Nucleic acids research.

[56]  J. Christman,et al.  Yin Yang 1 enhances cyclooxygenase-2 gene expression in macrophages. , 2007, American journal of physiology. Lung cellular and molecular physiology.

[57]  Robin D. Dowell,et al.  A generative model for the behavior of RNA polymerase , 2016, Bioinform..

[58]  Karen E Gascoigne,et al.  Enhancer Activity Requires CBP/P300 Bromodomain-Dependent Histone H3K27 Acetylation. , 2018, Cell reports.

[59]  A. Gerber,et al.  Nascent transcript analysis of glucocorticoid crosstalk with TNF defines primary and cooperative inflammatory repression , 2019, Genome Research.

[60]  Mikhail Spivakov Spurious transcription factor binding: Non-functional or genetically redundant? , 2014, BioEssays : news and reviews in molecular, cellular and developmental biology.

[61]  Leighton J. Core,et al.  Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters , 2008, Science.