FIREcaller: Detecting frequently interacting regions from Hi-C data

Hi-C experiments have been widely adopted to study chromatin spatial organization, which plays an essential role in genome function. We have recently identified frequently interacting regions (FIREs) and found that they are closely associated with cell-type-specific gene regulation. However, computational tools for detecting FIREs from Hi-C data are still lacking. In this work, we present FIREcaller, a stand-alone, user-friendly R package for detecting FIREs from Hi-C data. FIREcaller takes raw Hi-C contact matrices as input, performs within-sample and cross-sample normalization, and outputs continuous FIRE scores, dichotomous FIREs, and super-FIREs. Applying FIREcaller to Hi-C data from various human tissues, we demonstrate that FIREs and super-FIREs identified, in a tissue-specific manner, are closely related to gene regulation, are enriched for enhancer-promoter (E-P) interactions, tend to overlap with regions exhibiting epigenomic signatures of cis-regulatory roles, and aid the interpretation or GWAS variants. The FIREcaller package is implemented in R and freely available at https://yunliweb.its.unc.edu/FIREcaller. Highlights – Frequently Interacting Regions (FIREs) can be used to identify tissue and cell-type-specific cis-regulatory regions. – An R software, FIREcaller, has been developed to identify FIREs and clustered FIREs into super-FIREs.

[1]  M. L. Le Gros,et al.  Population-based 3D genome structure analysis reveals driving forces in spatial genome organization , 2016, Proceedings of the National Academy of Sciences.

[2]  Bing Ren,et al.  A Compendium of Promoter-Centered Long-Range Chromatin Interactions in the Human Genome , 2019, Nature Genetics.

[3]  Peter H. L. Krijger,et al.  Regulation of disease-associated gene expression in the 3D genome , 2016, Nature Reviews Molecular Cell Biology.

[4]  Yin Shen,et al.  Gene regulation in the 3D genome. , 2018, Human molecular genetics.

[5]  Roland Eils,et al.  circlize implements and enhances circular visualization in R , 2014, Bioinform..

[6]  P. Sullivan,et al.  Increased burden of ultra-rare structural variants localizing to boundaries of topologically associated domains in schizophrenia , 2020, Nature Communications.

[7]  Prashant S. Emani,et al.  Comprehensive functional genomic resource and integrative model for the human brain , 2018, Science.

[8]  Ferhat Ay,et al.  Identifying statistically significant chromatin contacts from Hi-C data with FitHiC2 , 2020, Nature Protocols.

[9]  Terrence S. Furey,et al.  A hidden Markov random field-based Bayesian method for the detection of long-range chromosomal interactions in Hi-C data , 2016, Bioinform..

[10]  Jing Liang,et al.  Chromatin architecture reorganization during stem cell differentiation , 2015, Nature.

[11]  Sharon R Grossman,et al.  Systematic mapping of functional enhancer–promoter connections with CRISPR interference , 2016, Science.

[12]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[13]  Lenka Maliskova,et al.  Mapping cis-Regulatory Chromatin Contacts in Neural Cells Links Neuropsychiatric Disorder Risk Variants to Target Genes , 2019, Nature Genetics.

[14]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[15]  Yun Li,et al.  Gene regulation in the 3D genome. , 2018, Human molecular genetics.

[16]  Anshul Kundaje,et al.  The ENCODE Blacklist: Identification of Problematic Regions of the Genome , 2019, Scientific Reports.

[17]  Zheng Xu,et al.  FastHiC: a fast and accurate algorithm to detect long-range chromosomal interactions from Hi-C data , 2016, Bioinform..

[18]  L. Mirny,et al.  Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data , 2013, Nature Reviews Genetics.

[19]  David A. Orlando,et al.  Master Transcription Factors and Mediator Establish Super-Enhancers at Key Cell Identity Genes , 2013, Cell.

[20]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[21]  Bing Ren,et al.  The Three-Dimensional Organization of Mammalian Genomes. , 2017, Annual review of cell and developmental biology.

[22]  Pall I. Olason,et al.  Common variants conferring risk of schizophrenia , 2009, Nature.

[23]  William Stafford Noble,et al.  Fine-scale chromatin interaction maps reveal the cis-regulatory landscape of human lincRNA genes , 2014, Nature Methods.

[24]  William Stafford Noble,et al.  Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts , 2014, Genome research.

[25]  Yun Li,et al.  Neuronal and glial 3D chromatin architecture illustrates cellular etiology of brain disorders , 2020, bioRxiv.

[26]  Yun Li,et al.  Schizophrenia and a high-resolution map of the three-dimensional chromatin interactome of adult and fetal cortex , 2018, bioRxiv.

[27]  Mark Gerstein,et al.  Measuring the reproducibility and quality of Hi-C data , 2017, Genome Biology.

[28]  A. Tanay,et al.  Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture , 2011, Nature Genetics.

[29]  Patrick F. Sullivan,et al.  Using three-dimensional regulatory chromatin interactions from adult and fetal cortex to interpret genetic results for psychiatric disorders and cognitive traits , 2018 .

[30]  Noam Kaplan,et al.  The Hitchhiker's guide to Hi-C analysis: practical guidelines. , 2015, Methods.

[31]  Kellen G. Cresswell,et al.  TADCompare: An R Package for Differential and Temporal Analysis of Topologically Associated Domains , 2020, Frontiers in Genetics.

[32]  Zheng Xu,et al.  HUGIn: Hi-C Unifying Genomic Interrogator , 2017, bioRxiv.

[33]  Darren J. Burgess,et al.  Epigenomics: Deciphering non-coding variation with 3D epigenomics , 2016, Nature Reviews Genetics.

[34]  Stephan J Sanders,et al.  Integrative functional genomic analysis of human brain development and neuropsychiatric risks , 2018, Science.

[35]  Jesse R. Dixon,et al.  Chromatin Domains: The Unit of Chromosome Organization. , 2016, Molecular cell.

[36]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[37]  Yan Li,et al.  A high-resolution map of three-dimensional chromatin interactome in human cells , 2013, Nature.

[38]  Anthony D. Schmitt,et al.  A Compendium of Chromatin Contact Maps Reveals Spatially Active Regions in the Human Genome. , 2016, Cell reports.

[39]  M. Hill,et al.  The emerging roles of TCF4 in disease and development. , 2014, Trends in molecular medicine.

[40]  Daning Lu,et al.  Chromosome conformation elucidates regulatory relationships in developing human brain , 2016, Nature.

[41]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[42]  Anthony D. Schmitt,et al.  Genome-wide mapping and analysis of chromosome architecture , 2016, Nature Reviews Molecular Cell Biology.

[43]  Amina Noor,et al.  Common DNA sequence variation influences 3-dimensional conformation of the human genome , 2019, Genome Biology.

[44]  Ming Hu,et al.  HiCNorm: removing biases in Hi-C data via Poisson regression , 2012, Bioinform..

[45]  Mark Gerstein,et al.  Measuring the reproducibility and quality of Hi-C data , 2017 .

[46]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.