mTFkb: a knowledgebase for fundamental annotation of mouse transcription factors

Transcription factors (TFs) are well-known important regulators in cell biology and tissue development. However, in mouse, one of the most widely-used model species, currently the vast majority of the known TFs have not been functionally studied due to the lack of sufficient annotations. To this end, we collected and analyzed the whole transcriptome sequencing data from more than 30 major mouse tissues and used the expression profiles to annotate the TFs. We found that the expression patterns of the TFs are highly correlated with the histology of the tissue types thus can be used to infer the potential functions of the TFs. Furthermore, we found that as many as 30% TFs display tissue-specific expression pattern, and these tissue-specific TFs are among the key TFs in their corresponding tissues. We also observed signals of divergent transcription associated with many TFs with unique expression pattern. Lastly, we have integrated all the data, our analysis results as well as various annotation resources to build a web-based database named mTFkb freely accessible at http://www.myogenesisdb.org/mTFkb/. We believe that mTFkb could serve as a useful and valuable resource for TF studies in mouse.

[1]  David A. Orlando,et al.  Master Transcription Factors and Mediator Establish Super-Enhancers at Key Cell Identity Genes , 2013, Cell.

[2]  Leming Shi,et al.  Identification of Tissue-Specific Protein-Coding and Noncoding Transcripts across 14 Human Tissues Using RNA-seq , 2016, Scientific Reports.

[3]  R. Young,et al.  Transcription of eukaryotic protein-coding genes. , 2000, Annual review of genetics.

[4]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[5]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[6]  P. Casaccia‐Bonnefil,et al.  The Yin and Yang of YY1 in the nervous system , 2008, Journal of neurochemistry.

[7]  A. Kuroiwa,et al.  Hoxa‐11 and Hoxa‐13 are involved in repression of MyoD during limb muscle development , 2003, Development, growth & differentiation.

[8]  Catherine May The role of Islet-1 in the endocrine pancreas: Lessons from pancreas specific Islet-1 deficient mice , 2010, Islets.

[9]  Sally Temple,et al.  A Systematic Approach to Identify Candidate Transcription Factors that Control Cell Identity , 2015, Stem cell reports.

[10]  Wei Chen,et al.  Mutually exclusive signaling signatures define the hepatic and pancreatic progenitor cell lineage divergence , 2013, Genes & development.

[11]  D. Sinclair,et al.  Controlled DNA double-strand break induction in mice reveals post-damage transcriptome stability , 2015, Nucleic acids research.

[12]  Sylvie Breton,et al.  The Forkhead Transcription Factor Foxi1 Is a Master Regulator of Vacuolar H+-ATPase Proton Pump Subunits in the Inner Ear, Kidney and Epididymis , 2009, PloS one.

[13]  Albert E. Almada,et al.  Divergent transcription of long noncoding RNA/mRNA gene pairs in embryonic stem cells , 2013, Proceedings of the National Academy of Sciences.

[14]  Hao Sun,et al.  YY1TargetDB: an integral information resource for Yin Yang 1 target loci , 2013, Database J. Biol. Databases Curation.

[15]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[16]  Hui Liu,et al.  AnimalTFDB: a comprehensive animal transcription factor database , 2011, Nucleic Acids Res..

[17]  Nicolas Servant,et al.  A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis , 2013, Briefings Bioinform..

[18]  William McGinnis,et al.  Evolution of transcription factor function. , 2003, Current opinion in genetics & development.

[19]  Lin Yang,et al.  TFBSshape: a motif database for DNA shape features of transcription factor binding sites , 2013, Nucleic Acids Res..

[20]  Hao Sun,et al.  Genome‐wide survey by ChIP‐seq reveals YY1 regulation of lincRNAs in skeletal myogenesis , 2013, The EMBO journal.

[21]  Debra L. Fulton,et al.  TFCat: the curated catalog of mouse and human transcription factors , 2009, Genome Biology.

[22]  Hao Sun,et al.  Linc-YY1 promotes myogenic differentiation and muscle regeneration through an interaction with the transcription factor YY1 , 2015, Nature Communications.

[23]  Takashi Tanaka,et al.  The biology of Stat4 and Stat6 , 2000, Oncogene.

[24]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[25]  E. Furlong,et al.  Transcription factors: from enhancer binding to developmental control , 2012, Nature Reviews Genetics.

[26]  S. Günther,et al.  VITO-1, a novel vestigial related protein is predominantly expressed in the skeletal muscle lineage , 2002, Mechanisms of Development.

[27]  J. Rinn,et al.  Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature Biotechnology.

[28]  B. Bonavida,et al.  Transcription factor YY1: structure, function, and therapeutic implications in cancer biology , 2006, Oncogene.

[29]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[30]  S. Smale Pioneer factors in embryonic stem cells and differentiation. , 2010, Current opinion in genetics & development.

[31]  D. Latchman Transcription factors: an overview. , 1997, The international journal of biochemistry & cell biology.

[32]  M. Rudnicki,et al.  MyoD and Myf-5 differentially regulate the development of limb versus trunk skeletal muscle. , 1997, Development.

[33]  J. Kawai,et al.  A genome-wide and nonredundant mouse transcription factor database. , 2004, Biochemical and biophysical research communications.

[34]  A. Stewart,et al.  Mammalian Vestigial-like 2, a Cofactor of TEF-1 and MEF2 Transcription Factors That Promotes Skeletal Muscle Differentiation* , 2002, The Journal of Biological Chemistry.

[35]  Rudolf Jaenisch,et al.  Mechanisms and models of somatic cell reprogramming , 2013, Nature Reviews Genetics.

[36]  Marc Robinson-Rechavi,et al.  A benchmark of gene expression tissue-specificity metrics , 2015, bioRxiv.

[37]  Hao Sun,et al.  Sebnif: An Integrated Bioinformatics Pipeline for the Identification of Novel Large Intergenic Noncoding RNAs (lincRNAs) - Application in Human Skeletal Muscle Cells , 2014, PloS one.

[38]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[39]  Yu Xue,et al.  AnimalTFDB 2.0: a resource for expression, prediction and functional study of animal transcription factors , 2014, Nucleic Acids Res..

[40]  Tatiana A. Tatusova,et al.  NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy , 2011, Nucleic Acids Res..

[41]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[42]  Jens Nielsen,et al.  Transcriptomics resources of human tissues and organs , 2016, Molecular systems biology.

[43]  W. Cui,et al.  Sox2, a key factor in the regulation of pluripotency and neural differentiation. , 2014, World journal of stem cells.

[44]  Cole Trapnell,et al.  Improving RNA-Seq expression estimates by correcting for fragment bias , 2011, Genome Biology.

[45]  M. Buckingham,et al.  The formation of skeletal muscle: from somite to limb , 2003, Journal of anatomy.

[46]  J. Rinn,et al.  Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature biotechnology.

[47]  Canglin Wu,et al.  RegNetwork: an integrated database of transcriptional and post-transcriptional regulatory networks in human and mouse , 2015, Database J. Biol. Databases Curation.

[48]  Sarah A. Teichmann,et al.  DBD––taxonomically broad transcription factor predictions: new content and functionality , 2007, Nucleic Acids Res..

[49]  Denis Puthier,et al.  Divergent transcription is associated with promoters of transcriptional regulators , 2013, BMC Genomics.

[50]  A. Stewart,et al.  Transcription cofactor Vgl‐2 is required for skeletal muscle differentiation , 2004, Genesis.

[51]  D. Kleinjan,et al.  The Developmental Regulator Pax6 Is Essential for Maintenance of Islet Cell Function in the Adult Mouse Pancreas , 2013, PloS one.

[52]  Denglong Wu,et al.  Gene microarray analysis of the lncRNA expression profile in human urothelial carcinoma of the bladder. , 2014, International journal of clinical and experimental medicine.

[53]  T R Hughes,et al.  A catalogue of eukaryotic transcription factor types, their evolutionary origin, and species distribution. , 2011, Sub-cellular biochemistry.

[54]  V. Gallo,et al.  FOXN1: A Master Regulator Gene of Thymic Epithelial Development Program , 2013, Front. Immunol..

[55]  R. Young,et al.  Super-Enhancers in the Control of Cell Identity and Disease , 2013, Cell.

[56]  Raymond K. Auerbach,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[57]  K. Tréguer,et al.  Vestigial like gene family expression in Xenopus: common and divergent features with other vertebrates. , 2010, The International journal of developmental biology.

[58]  G. Rutter,et al.  Rfx6 Maintains the Functional Identity of Adult Pancreatic β Cells , 2014, Cell reports.

[59]  Ying Jin,et al.  Role of Oct4 in maintaining and regaining stem cell pluripotency , 2010, Stem Cell Research & Therapy.

[60]  K. Kadota,et al.  Detection of genes with tissue-specific expression patterns using Akaike's information criterion procedure. , 2003, Physiological genomics.

[61]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[62]  F. Gao,et al.  Expression and clinicopathological significance of the lncRNA HOXA11-AS in colorectal cancer. , 2016, Oncology letters.

[63]  S. Günther,et al.  VITO-1, a novel vestigial related protein is predominantly expressed in the skeletal muscle lineage. , 2002, Gene expression patterns : GEP.

[64]  M. Gautel,et al.  Transcriptional mechanisms regulating skeletal muscle differentiation, growth and homeostasis , 2011, Nature Reviews Molecular Cell Biology.