Topological structure analysis of chromatin interaction networks

Current Hi-C technologies for chromosome conformation capture allow to understand a broad spectrum of functional interactions between genome elements. Although significant progress has been made into analysis of Hi-C data to identify biologically significant features, many questions still remain open, in particular regarding potential biological significance of various topological features that are characteristic for chromatin interaction networks. It has been previously observed that promoter capture Hi-C (PCHi-C) interaction networks tend to separate easily into well-defined connected components that can be related to certain biological functionality, however, such evidence was based on manual analysis and was limited. Here we present a novel method for analysis of chromatin interaction networks aimed towards identifying characteristic topological features of interaction graphs and confirming their potential significance in chromatin architecture. Our method automatically identifies all connected components with an assigned significance score above a given threshold. These components can be subjected afterwards to different assessment methods for their biological role and/or significance. The method was applied to the largest PCHi-C data set available to date that contains interactions for 17 haematopoietic cell types. The results demonstrate strong evidence of well-pronounced component structure of chromatin interaction networks and provide some characterisation of this component structure. We also performed an indicative assessment of potential biological significance of identified network components with the results confirming that the network components can be related to specific biological functionality. The obtained results show that the topological structure of chromatin interaction networks can be well described in terms of isolated connected components of the network and that formation of these components can be often explained by biological features of functionally related gene modules. The presented method allows automatic identification of all such components and evaluation of their significance in PCHi-C dataset for 17 haematopoietic cell types. The method can be adapted for exploration of other chromatin interaction data sets that include information about sufficiently large number of different cell types, and, in principle, also for analysis of other kinds of cell type-specific networks.

[1]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[2]  Anthony D. Schmitt,et al.  Genome-wide mapping and analysis of chromosome architecture , 2016, Nature Reviews Molecular Cell Biology.

[3]  Houda Belaghzal,et al.  Hi-C 2.0: An Optimized Hi-C Procedure for High-Resolution Genome-Wide Mapping of Chromosome Conformation , 2016, bioRxiv.

[4]  Geir Kjetil Sandve,et al.  In the loop: promoter–enhancer interactions and bioinformatics , 2015, Briefings Bioinform..

[5]  Tanya M. Teslovich,et al.  The Influence of Age and Sex on Genetic Associations with Adult Body Size and Shape: A Large-Scale Genome-Wide Interaction Study , 2015, PLoS Genetics.

[6]  Timothy J. Durham,et al.  Systematic analysis of chromatin state dynamics in nine human cell types , 2011, Nature.

[7]  Piero Carninci,et al.  CAGE (cap analysis of gene expression): a protocol for the detection of promoter and transcriptional networks. , 2012, Methods in molecular biology.

[8]  R. David Hawkins,et al.  Three-dimensional genome architecture and emerging technologies: looping in disease , 2017, Genome Medicine.

[9]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[10]  Michael P Snyder,et al.  Static and dynamic DNA loops form AP-1 bound activation hubs during macrophage development , 2017, bioRxiv.

[11]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[12]  Sushmita Roy,et al.  A multi-task graph-clustering approach for chromosome conformation capture data sets identifies conserved modules of chromosomal interactions , 2016, Genome Biology.

[13]  A. Stark,et al.  Assessing sufficiency and necessity of enhancer activities for gene expression and the mechanisms of transcription activation , 2018, Genes & development.

[14]  Manolis Kellis,et al.  Chromatin-state discovery and genome annotation with ChromHMM , 2017, Nature Protocols.

[15]  Ieuan Clay,et al.  The transcriptional interactome: gene expression in 3D. , 2010, Current opinion in genetics & development.

[16]  Daniel Doerr,et al.  GraphTeams: a method for discovering spatial gene clusters in Hi-C sequencing data , 2018, BMC Genomics.

[17]  S. Bicciato,et al.  Comparison of computational methods for Hi-C data analysis , 2017, Nature Methods.

[18]  Natasa Przulj,et al.  Network analytics in the age of big data , 2016, Science.

[19]  Steven J. M. Jones,et al.  The International Human Epigenome Consortium: A Blueprint for Scientific Collaboration and Discovery , 2016, Cell.

[20]  A. Ashworth,et al.  Unbiased analysis of potential targets of breast cancer susceptibility loci by Capture Hi-C , 2014, Genome research.

[21]  Jonathan M. Cairns,et al.  CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data , 2015, Genome Biology.

[22]  Noam Kaplan,et al.  The Hitchhiker's guide to Hi-C analysis: practical guidelines. , 2015, Methods.

[23]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[24]  Edgars Celms,et al.  Graph-based Characterisations of Cell Types and Functionally Related Modules in Promoter Capture Hi-C Data , 2019, BIOINFORMATICS.

[25]  Derek W Wright,et al.  Gateways to the FANTOM5 promoter level mammalian expression atlas , 2015, Genome Biology.

[26]  Ivo L. Hofacker,et al.  AREsite2: an enhanced database for the comprehensive investigation of AU/GU/U-rich elements , 2015, Nucleic Acids Res..

[27]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[28]  Nadav Ahituv,et al.  Minor Loops in Major Folds: Enhancer–Promoter Looping, Chromatin Restructuring, and Their Association with Transcriptional Regulation and Disease , 2015, PLoS genetics.

[29]  Avi Ma'ayan,et al.  Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool , 2013, BMC Bioinformatics.

[30]  Rosela Golloshi,et al.  Iteratively improving Hi-C experiments one step at a time. , 2018, Methods.

[31]  Jonathan M. Cairns,et al.  Lineage-Specific Genome Architecture Links Enhancers and Non-coding Disease Variants to Target Gene Promoters , 2016, Cell.

[32]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[33]  Tijana Milenkovic,et al.  Rebuttal to the Letter to the Editor in response to the paper: proper evaluation of alignment‐free network comparison methods , 2017, Bioinform..

[34]  D. Ucar,et al.  Chromatin interaction networks revealed unique connectivity patterns of broad H3K4me3 domains and super enhancers in 3D chromatin , 2017, Scientific Reports.

[35]  Timothy J. Durham,et al.  "Systematic" , 1966, Comput. J..

[36]  Andrew D. Rouillard,et al.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update , 2016, Nucleic Acids Res..

[37]  Philip A. Ewels,et al.  Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C , 2015, Nature Genetics.

[38]  Deborah Chasman,et al.  Inference of cell type specific regulatory networks on mammalian lineages. , 2017, Current opinion in systems biology.

[39]  A. Stark,et al.  Transcriptional enhancers: from properties to genome-wide predictions , 2014, Nature Reviews Genetics.

[40]  Nataša Pržulj,et al.  Graphlet-based Characterization of Directed Networks , 2016, Scientific Reports.