HiCeekR: A Novel Shiny App for Hi-C Data Analysis

The High-throughput Chromosome Conformation Capture (Hi-C) technique combines the power of the Next Generation Sequencing technologies with chromosome conformation capture approach to study the 3D chromatin organization at the genome-wide scale. Although such a technique is quite recent, many tools are already available for pre-processing and analyzing Hi-C data, allowing to identify chromatin loops, topological associating domains and A/B compartments. However, only a few of them provide an exhaustive analysis pipeline or allow to easily integrate and visualize other omic layers. Moreover, most of the available tools are designed for expert users, who have great confidence with command-line applications. In this paper, we present HiCeekR (https://github.com/lucidif/HiCeekR), a novel R Graphical User Interface (GUI) that allows researchers to easily perform a complete Hi-C data analysis. With the aid of the Shiny libraries, it integrates several R/Bioconductor packages for Hi-C data analysis and visualization, guiding the user during the entire process. Here, we describe its architecture and functionalities, then illustrate its capabilities using a publicly available dataset.

[1]  C. Nusbaum,et al.  Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. , 2006, Genome research.

[2]  Steven L. Brody,et al.  Modulation of Th1 Activation and Inflammation by the NF-κB Repressor Foxj1 , 2004, Science.

[3]  Céline Lévy-Leduc,et al.  Two-dimensional segmentation for analyzing Hi-C data , 2014, Bioinform..

[4]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[5]  Fidel Ramírez,et al.  Galaxy HiCExplorer: a web server for reproducible Hi-C data analysis, quality control and visualization , 2018, Nucleic Acids Res..

[6]  Mattia Forcato,et al.  Computational methods for analyzing genome-wide chromosome conformation capture data. , 2018, Current opinion in biotechnology.

[7]  Pietro Liò,et al.  NuChart: An R Package to Study Gene Spatial Neighbourhoods with Multi-Omics Annotations , 2013, PloS one.

[8]  Francesco Russo,et al.  Advancements in RNASeqGUI towards a Reproducible Analysis of RNA-Seq Experiments , 2016, BioMed research international.

[9]  Philip A. Ewels,et al.  HiCUP: pipeline for mapping and processing Hi-C data , 2015, F1000Research.

[10]  Ivan Merelli,et al.  NuChart-II: The road to a fast and scalable tool for Hi-C data analysis , 2017, Int. J. High Perform. Comput. Appl..

[11]  William Stafford Noble,et al.  Analysis methods for studying the 3D architecture of the genome , 2015, Genome Biology.

[12]  L. Mirny,et al.  Iterative Correction of Hi-C Data Reveals Hallmarks of Chromosome Organization , 2012, Nature Methods.

[13]  L. Milanesi,et al.  Integrating multi-omic features exploiting Chromosome Conformation Capture data , 2015, Front. Genet..

[14]  Cathy Lundmark,et al.  A Hitchhiker's Guide to… , 2005 .

[15]  X. Zhou,et al.  TopDom: an efficient and deterministic method for identifying topological domains in genomes , 2015, Nucleic acids research.

[16]  P. Bayrak-Toydemir,et al.  Hereditary hemorrhagic telangiectasia: genetics and molecular diagnostics in a new era , 2015, Front. Genet..

[17]  Sheng Zhong,et al.  GITAR: An Open Source Tool for Analysis and Visualization of Hi-C Data , 2018, bioRxiv.

[18]  L. Mirny,et al.  Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data , 2013, Nature Reviews Genetics.

[19]  Anna Tonazzini,et al.  ChromStruct 4: A Python Code to Estimate the Chromatin Structure from Hi-C Data , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[20]  Yoli Shavit,et al.  Combining a wavelet change point and the Bayes factor for analysing chromosomal interaction data. , 2014, Molecular bioSystems.

[21]  Koustav Pal,et al.  Hi-C analysis: from data generation to integration , 2018, Biophysical Reviews.

[22]  Marc W. Schmid,et al.  HiCdat: a fast and easy-to-use Hi-C data analysis tool , 2015, BMC Bioinformatics.

[23]  G. Ciriello,et al.  Comparison of computational methods for the identification of topologically associating domains , 2018, Genome Biology.

[24]  Judith B. Zaugg,et al.  Genetic Control of Chromatin States in Humans Involves Local and Distal Chromosomal Interactions , 2015, Cell.

[25]  Noam Kaplan,et al.  The Hitchhiker's guide to Hi-C analysis: practical guidelines. , 2015, Methods.

[26]  Thomas Cremer,et al.  Spatial preservation of nuclear chromatin architecture during three-dimensional fluorescence in situ hybridization (3D-FISH). , 2002, Experimental cell research.

[27]  Stefan Schoenfelder,et al.  Capture Hi-C reveals novel candidate genes and complex long-range interactions with related autoimmune risk loci , 2015, Nature Communications.

[28]  Ming Hu,et al.  HiCNorm: removing biases in Hi-C data via Poisson regression , 2012, Bioinform..

[29]  A. Tanay,et al.  Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture , 2011, Nature Genetics.

[30]  A. Suzuki,et al.  Affixin interacts with α-actinin and mediates integrin signaling for reorganization of F-actin induced by initial cell–substrate interaction , 2004, The Journal of cell biology.

[31]  J. Vilo,et al.  g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update) , 2019, Nucleic Acids Res..

[32]  A. Kallies,et al.  Transcription Factor T-bet Orchestrates Lineage Development and Function in the Immune System. , 2017, Trends in immunology.

[33]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[34]  K. Sandhu,et al.  Circular chromosome conformation capture (4C) uncovers extensive networks of epigenetically regulated intra- and interchromosomal interactions , 2006, Nature Genetics.

[35]  Daniel Ruiz,et al.  A Fast Algorithm for Matrix Balancing , 2013, Web Information Retrieval and Linear Algebra Algorithms.

[36]  Aaron T. L. Lun,et al.  diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data , 2015, BMC Bioinformatics.

[37]  Aristotelis Tsirigos,et al.  HiC-bench: comprehensive and reproducible Hi-C data analysis designed for parameter exploration and benchmarking , 2016, BMC Genomics.

[38]  Claudia Angelini,et al.  RNASeqGUI: a GUI for analysing RNA-Seq data , 2014, Bioinform..

[39]  David J. Chen,et al.  Complex Formation by the Human Rad51B and Rad51C DNA Repair Proteins and Their Activities in Vitro * , 2002, The Journal of Biological Chemistry.

[40]  Claudia Angelini,et al.  Advantages and Limits in the Adoption of Reproducible Research and R-Tools for the Analysis of Omic Data , 2015, CIBB.

[41]  Emmanuel Barillot,et al.  HiTC - Exploration of High Throughput ’C’ experiments , 2013 .

[42]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[43]  William Stafford Noble,et al.  Software tools for visualizing Hi-C data , 2016, Genome Biology.

[44]  S. Bicciato,et al.  Comparison of computational methods for Hi-C data analysis , 2017, Nature Methods.

[45]  J. Dekker,et al.  Hi-C: a comprehensive technique to capture the conformation of genomes. , 2012, Methods.

[46]  Zhijun Han,et al.  Computational tools for Hi-C data analysis , 2017, Quantitative Biology.

[47]  Jean-Philippe Vert,et al.  HiC-Pro: an optimized and flexible pipeline for Hi-C data processing , 2015, Genome Biology.