HiPiler: Visual Exploration of Large Genome Interaction Matrices with Interactive Small Multiples

This paper presents an interactive visualization interface—HiPiler—for the exploration and visualization of regions-of-interest in large genome interaction matrices. Genome interaction matrices approximate the physical distance of pairs of genomic regions to each other and can contain up to 3 million rows and columns with many sparse regions. Traditional matrix aggregation or pan-and-zoom interfaces largely fail in supporting search, inspection, and comparison of local regions-of-interest (ROIs). ROIs can be defined, e.g., by sets of adjacent rows and columns, or by specific visual patterns in the matrix. ROIs are first-class objects in HiPiler, which represents them as thumbnail-like “snippets”. Snippets can be laid out automatically based on their data and meta attributes. They are linked back to the matrix and can be explored interactively. The design of HiPiler is based on a series of semi-structured interviews with 10 domain experts involved in the analysis and interpretation of genome interaction matrices. We describe six exploration tasks that are crucial for analysis of interaction matrices and demonstrate how HiPiler supports these tasks. We report on a user study with a series of data exploration sessions with domain experts to assess the usability of HiPiler as well as to demonstrate respective findings in the data.

[1]  Thomas Zichner,et al.  DELLY: structural variant discovery by integrated paired-end and split-read analysis , 2012, Bioinform..

[2]  Jean-Daniel Fekete,et al.  Melange: space folding for multi-focus interaction , 2008, CHI.

[3]  Jean-Daniel Fekete,et al.  Magnostics: Image-Based Search of Interesting Matrix Views for Guided Network Exploration , 2017, IEEE Transactions on Visualization and Computer Graphics.

[4]  W. Marsden I and J , 2012 .

[5]  S. Bicciato,et al.  Comparison of computational methods for Hi-C data analysis , 2017, Nature Methods.

[6]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[7]  Jean-Daniel Fekete,et al.  NodeTrix: a Hybrid Visualization of Social Networks , 2007, IEEE Transactions on Visualization and Computer Graphics.

[8]  Peter J. Park,et al.  HiGlass: Web-based visual comparison and exploration of genome interaction maps , 2017 .

[9]  Job Dekker,et al.  Two ways to fold the genome during the cell cycle: insights obtained with chromosome conformation capture , 2014, Epigenetics & Chromatin.

[10]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[11]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[12]  M. Rosenfeld,et al.  Enhancers as non-coding RNA transcription units: recent insights and future perspectives , 2016, Nature Reviews Genetics.

[13]  Darawalee Wangsa,et al.  Nucleome Analysis Reveals Structure–Function Relationships for Colon Cancer , 2017, Molecular Cancer Research.

[14]  Arjan Kuijper,et al.  Visual Analysis of Large Graphs: State‐of‐the‐Art and Future Research Challenges , 2011, Eurographics.

[15]  K. Tan,et al.  Global view of enhancer–promoter interactome in human cells , 2014, Proceedings of the National Academy of Sciences.

[16]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[17]  Lee E. Edsall,et al.  A map of the cis-regulatory sequences in the mouse genome , 2012, Nature.

[18]  Falk Schreiber,et al.  MAVisto: a tool for the exploration of network motifs , 2005, Bioinform..

[19]  Jean-Daniel Fekete,et al.  Matrix Reordering Methods for Table and Network Visualization , 2016, Comput. Graph. Forum.

[20]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[21]  Ann Dean,et al.  Enhancer and promoter interactions-long distance calls. , 2012, Current opinion in genetics & development.

[22]  William Stafford Noble,et al.  Software tools for visualizing Hi-C data , 2017, Genome Biology.

[23]  Jarke J. van Wijk,et al.  Compressed Adjacency Matrices: Untangling Gene Regulatory Networks , 2012, IEEE Transactions on Visualization and Computer Graphics.

[24]  Ben Shneiderman,et al.  Motif simplification: improving network visualization readability with fan, connector, and clique glyphs , 2013, CHI.

[25]  Lovelace J. Luquette,et al.  Diverse Mechanisms of Somatic Structural Variations in Human Cancer Genomes , 2013, Cell.

[26]  Pierre Dragicevic,et al.  Time Curves: Folding Time to Visualize Patterns of Temporal Evolution in Data , 2016, IEEE Transactions on Visualization and Computer Graphics.

[27]  L. Mirny,et al.  Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data , 2013, Nature Reviews Genetics.

[28]  J. Dekker,et al.  The hierarchy of the 3D genome. , 2013, Molecular cell.

[29]  Tobias Schreck,et al.  Visual analysis of graphs with multiple connected components , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[30]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[31]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[32]  A. Visel,et al.  Disruptions of Topological Chromatin Domains Cause Pathogenic Rewiring of Gene-Enhancer Interactions , 2015, Cell.

[33]  Stuart K. Card,et al.  The effect of information scent on searching information: visualizations of large tree structures , 2000, AVI '00.

[34]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[35]  Jean-Daniel Fekete,et al.  ZAME: Interactive Large-Scale Graph Visualization , 2008, 2008 IEEE Pacific Visualization Symposium.

[36]  P. Fraser,et al.  Nuclear organization of the genome and the potential for gene regulation , 2007, Nature.

[37]  Jean-Daniel Fekete,et al.  Visualizing dynamic networks with matrix cubes , 2014, CHI.

[38]  Jean-Daniel Fekete,et al.  Small MultiPiles: Piling Time to Explore Temporal Patterns in Dynamic Networks , 2015, Comput. Graph. Forum.

[39]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[40]  Daniel S. Day,et al.  Activation of proto-oncogenes by disruption of chromosome neighborhoods , 2015, Science.

[41]  James T. Robinson,et al.  Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom. , 2016, Cell systems.

[42]  Josée Dostie,et al.  An Overview of Genome Organization and How We Got There: from FISH to Hi-C , 2015, Microbiology and Molecular Reviews.

[43]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[44]  Alexander Lex,et al.  From Visual Exploration to Storytelling and Back Again , 2016, bioRxiv.