Construction and Analysis of an Integrated Regulatory Network Derived from High-Throughput Sequencing Data

We present a network framework for analyzing multi-level regulation in higher eukaryotes based on systematic integration of various high-throughput datasets. The network, namely the integrated regulatory network, consists of three major types of regulation: TF→gene, TF→miRNA and miRNA→gene. We identified the target genes and target miRNAs for a set of TFs based on the ChIP-Seq binding profiles, the predicted targets of miRNAs using annotated 3′UTR sequences and conservation information. Making use of the system-wide RNA-Seq profiles, we classified transcription factors into positive and negative regulators and assigned a sign for each regulatory interaction. Other types of edges such as protein-protein interactions and potential intra-regulations between miRNAs based on the embedding of miRNAs in their host genes were further incorporated. We examined the topological structures of the network, including its hierarchical organization and motif enrichment. We found that transcription factors downstream of the hierarchy distinguish themselves by expressing more uniformly at various tissues, have more interacting partners, and are more likely to be essential. We found an over-representation of notable network motifs, including a FFL in which a miRNA cost-effectively shuts down a transcription factor and its target. We used data of C. elegans from the modENCODE project as a primary model to illustrate our framework, but further verified the results using other two data sets. As more and more genome-wide ChIP-Seq and RNA-Seq data becomes available in the near future, our methods of data integration have various potential applications.

[1]  A. Yoo,et al.  LIN-12/Notch Activation Leads to MicroRNA-Mediated Down-Regulation of Vav in C. elegans , 2005, Science.

[2]  R. Russell,et al.  Principles of MicroRNA–Target Recognition , 2005, PLoS biology.

[3]  Kimberly Van Auken,et al.  WormBase: a comprehensive resource for nematode research , 2009, Nucleic Acids Res..

[4]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[5]  N. D. Clarke,et al.  Integration of External Signaling Pathways with the Core Transcriptional Network in Embryonic Stem Cells , 2008, Cell.

[6]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[7]  F. Slack,et al.  The time of appearance of the C. elegans let-7 microRNA is transcriptionally controlled utilizing a temporal regulatory element in its promoter. , 2003, Developmental biology.

[8]  A. Rougvie,et al.  Regulatory mutations of mir-48, a C. elegans let-7 family MicroRNA, cause developmental timing defects. , 2005, Developmental cell.

[9]  Job Harms,et al.  THE LANDSCAPE OF , 2010 .

[10]  A. van Oudenaarden,et al.  MicroRNA-mediated feedback and feedforward loops are recurrent network motifs in mammals. , 2007, Molecular cell.

[11]  D J Wolgemuth,et al.  Expression of the murine Hoxa4 gene requires both autoregulation and a conserved retinoic acid response element. , 1998, Development.

[12]  M. Moore From Birth to Death: The Complex Lives of Eukaryotic mRNAs , 2005, Science.

[13]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[14]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[15]  Raymond K. Auerbach,et al.  Genome-Wide Identification of Binding Sites Defines Distinct Functions for Caenorhabditis elegans PHA-4/FOXA in Development and Environmental Response , 2010, PLoS genetics.

[16]  Hannah Lewis International outbreak of Salmonella Goldcoast infection in tourists returning from Majorca, September-October 2005: final summary. , 2005, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[17]  Yuval Kluger,et al.  Inter- and intra-combinatorial regulation by transcription factors and microRNAs , 2007, BMC Genomics.

[18]  M. Gerstein,et al.  Design principles of molecular networks revealed by global comparisons and composite motifs , 2006, Genome Biology.

[19]  Sergei Maslov,et al.  Computational architecture of the yeast regulatory network , 2005, Physical biology.

[20]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[21]  Zachary Pincus,et al.  Dynamic expression of small non-coding RNAs, including novel microRNAs and piRNAs/21U-RNAs, during Caenorhabditis elegans development , 2009, Genome Biology.

[22]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[23]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[24]  V. Ambros The functions of animal microRNAs , 2004, Nature.

[25]  M. Gerstein,et al.  Analysis of diverse regulatory networks in a hierarchical context shows consistent tendencies for collaboration in the middle levels , 2010, Proceedings of the National Academy of Sciences.

[26]  Andreas Wagner,et al.  Convergent evolution of gene circuits , 2003, Nature Genetics.

[27]  Matthew W. Hahn,et al.  The evolution of transcriptional regulation in eukaryotes. , 2003, Molecular biology and evolution.

[28]  Megan F. Cole,et al.  Core Transcriptional Regulatory Circuitry in Human Embryonic Stem Cells , 2005, Cell.

[29]  R. Russell,et al.  Animal MicroRNAs Confer Robustness to Gene Expression and Have a Significant Impact on 3′UTR Evolution , 2005, Cell.

[30]  Gos Micklem,et al.  Supporting Online Material Materials and Methods Figs. S1 to S50 Tables S1 to S18 References Identification of Functional Elements and Regulatory Circuits by Drosophila Modencode , 2022 .

[31]  Robert B. Russell,et al.  Principles of MicroRNATarget Recognition , 2005 .

[32]  Colin N. Dewey,et al.  A Genome-Wide Map of Conserved MicroRNA Targets in C. elegans , 2006, Current Biology.

[33]  S. Mangan,et al.  Article number: 2005.0006 , 2022 .

[34]  Thomas J. Begley,et al.  Global network analysis of phenotypic effects: Protein networks and toxicity modulation in Saccharomyces cerevisiae , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Sebastian Wernicke,et al.  FANMOD: a tool for fast network motif detection , 2006, Bioinform..

[36]  S. Mangan,et al.  Structure and function of the feed-forward loop network motif , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[37]  D. Zack,et al.  Computational analysis of tissue-specific combinatorial gene regulation: predicting interaction between transcription factors in human tissues , 2006, Nucleic acids research.

[38]  D. Bartel,et al.  Microarray profiling of microRNAs reveals frequent coexpression with neighboring miRNAs and host genes. , 2005, RNA.

[39]  K. Gunsalus,et al.  Combinatorial microRNA target predictions , 2005, Nature Genetics.

[40]  Edward C Stites,et al.  Network Analysis of Oncogenic Ras Activation in Cancer , 2007, Science.

[41]  F. Slack,et al.  RAS Is Regulated by the let-7 MicroRNA Family , 2005, Cell.

[42]  M S German,et al.  Autoregulation and Maturity Onset Diabetes of the Young Transcription Factors Control the Human PAX4 Promoter* , 2000, The Journal of Biological Chemistry.

[43]  Oliver Hobert,et al.  MicroRNAs acting in a double-negative feedback loop to control a neuronal cell fate decision. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Julie M. Sahalie,et al.  An experimentally derived confidence score for binary protein-protein interactions , 2008, Nature Methods.

[45]  Raymond K. Auerbach,et al.  PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls , 2009, Nature Biotechnology.

[46]  Alessandro Fatica,et al.  A Minicircuitry Comprised of MicroRNA-223 and Transcription Factors NFI-A and C/EBPα Regulates Human Granulopoiesis , 2005, Cell.

[47]  Lynn Doucette-Stamm,et al.  A C . elegans genome-scale microRNA network contains composite feedback motifs with high flux capacity , 2008 .

[48]  N. Rajewsky,et al.  The evolution of gene regulation by transcription factors and microRNAs , 2007, Nature Reviews Genetics.

[49]  D. Moerman,et al.  Towards a mutation in every gene in Caenorhabditis elegans. , 2008, Briefings in functional genomics & proteomics.

[50]  Nicholas T. Ingolia,et al.  Mammalian microRNAs predominantly act to decrease target mRNA levels , 2010, Nature.

[51]  Sebastian D. Mackowiak,et al.  The Landscape of C. elegans 3′UTRs , 2010, Science.

[52]  R. Tjian,et al.  Transcription regulation and animal diversity , 2003, Nature.

[53]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[54]  U. Alon Network motifs: theory and experimental approaches , 2007, Nature Reviews Genetics.

[55]  M. Gerstein,et al.  Genomic analysis of the hierarchical structure of regulatory networks , 2006, Proceedings of the National Academy of Sciences.

[56]  Mark Gerstein,et al.  Comparing genomes to computer operating systems in terms of the topology and evolution of their regulatory control networks , 2010, Proceedings of the National Academy of Sciences.

[57]  Ariel S. Schwartz,et al.  An Atlas of Combinatorial Transcriptional Regulation in Mouse and Man , 2010, Cell.

[58]  S. Aota,et al.  Pax6 autoregulation mediated by direct interaction of Pax6 protein with the head surface ectoderm-specific enhancer of the mouse Pax6 gene. , 2003, Developmental biology.

[59]  Emmitt R. Jolly,et al.  Inference of combinatorial regulation in yeast transcriptional networks: a case study of sporulation. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[60]  Mark Gerstein,et al.  Analysis of Combinatorial Regulation: Scaling of Partnerships between Regulators with the Number of Governed Targets , 2010, PLoS Comput. Biol..

[61]  U. Alon,et al.  Negative autoregulation speeds the response times of transcription networks. , 2002, Journal of molecular biology.

[62]  Wen-Hsiung Li,et al.  MicroRNA regulation of human protein protein interaction network. , 2007, RNA.

[63]  Sebastian Wernicke,et al.  Efficient Detection of Network Motifs , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[64]  A. Dinner,et al.  Signatures of combinatorial regulation in intrinsic biological noise , 2008, Proceedings of the National Academy of Sciences.

[65]  P. Green,et al.  Massively parallel sequencing of the polyadenylated transcriptome of C. elegans. , 2009, Genome research.

[66]  N. Rajewsky microRNA target predictions in animals , 2006, Nature Genetics.

[67]  Ryoichiro Kageyama,et al.  Id sustains Hes1 expression to inhibit precocious neurogenesis by releasing negative autoregulation of Hes1. , 2007, Developmental cell.

[68]  D. Zack,et al.  Analysis of regulatory network topology reveals functionally distinct classes of microRNAs , 2008, Nucleic acids research.

[69]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[70]  Kevin Y. Yip,et al.  A statistical framework for modeling gene expression using chromatin features and application to modENCODE datasets , 2011, Genome Biology.

[71]  Dustin E. Schones,et al.  Chromatin poises miRNA- and protein-coding genes for expression. , 2009, Genome research.

[72]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[73]  Yitzhak Pilpel,et al.  Global and Local Architecture of the Mammalian microRNA–Transcription Factor Regulatory Network , 2007, PLoS Comput. Biol..

[74]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[75]  Raymond K. Auerbach,et al.  Integrative Analysis of the Caenorhabditis elegans Genome by the modENCODE Project , 2010, Science.

[76]  S. Mangan,et al.  The coherent feedforward loop serves as a sign-sensitive delay element in transcription networks. , 2003, Journal of molecular biology.

[77]  M. Gerstein,et al.  Unlocking the secrets of the genome , 2009, Nature.

[78]  O. Hobert Common logic of transcription factor and microRNA action. , 2004, Trends in biochemical sciences.