Iterative Design and Evaluation of Regulatory Network Visualisation at Scale

Over the last decade, the development of a range of Next Generation Sequencing (NGS) technologies has led to an enormous increase in the size of the data sets available in molecular biology. The scale of these data presents new challenges for researchers, and visualisation is widely regarded as an essential tool for exploration and detailed analysis of candidate relationships. Inevitably, there are cognitive and technical limits on the information which may usefully be displayed on a particular device, and there may be some tension between the analytical utility of a representation and its coverage of the relationships available within the data. Careful attention must be given to the overall design of the visualisation, and to the channels selected, and these tasks are further complicated if the intent is to support interactive exploration by a number of collocated researchers or inclusion within a collaborative workflow. This paper is concerned with the design of a visualisation for regulatory interactions in bacteria, the complex relationships that exist between a set of proteins and the much larger set of genes whose action they control. Modelling these interactions yields equally complex network diagrams, and even classical hairball representations when visualised. In this work we explore the iterative refinement of an alternative visualisation for data of this kind, moving away from the traditional hairball to a 'field' of smaller structures, the intent being to support effective comparison across many dozens of strains and species rather than the exhaustive documentation of a full set of interactions for the one organism. While the study did not directly compare insights obtained using TRNDiff with those obtained using other tools, formal evaluations have allowed us to settle on an effective set of representations and visual channels, and interactive features to support analysis. Our approach has produced a far more effective visualisation of these important data sets, and offers useful lessons for tool developers and insights into the utility of touch devices and larger displays for visual analytics and generation of insight at scale.

[1]  Xin-Yi Chua,et al.  Prediction of transcriptional regulatory interactions in bacteria : a comparative genomics approach , 2012 .

[2]  Samuel S. Silva,et al.  Using color in visualization: A survey , 2011, Comput. Graph..

[3]  William J. R. Longabaugh,et al.  Combing the hairball with BioFabric: a new approach for visualization of large networks , 2012, BMC Bioinformatics.

[4]  Sarala M. Wimalaratne,et al.  The Systems Biology Graphical Notation , 2009, Nature Biotechnology.

[5]  Christopher Andrews,et al.  Information visualization on large, high-resolution displays: Issues, challenges, and opportunities , 2011, Inf. Vis..

[6]  Matthew A. Hibbs,et al.  Visualization of omics data for systems biology , 2010, Nature Methods.

[7]  Chris North,et al.  Visualizing Biological Pathways: Requirements Analysis, Systems Evaluation and Research Agenda , 2005, Inf. Vis..

[8]  Elaine R. Mardis,et al.  A decade’s perspective on DNA sequencing technology , 2011, Nature.

[9]  Lawrence Buckingham,et al.  Large Scale Comparative Visualisation of Regulatory Networks with TRNDiff , 2015, ICCS.

[10]  Donald P. Greenberg,et al.  Color-defective vision and computer graphics displays , 1988, IEEE Computer Graphics and Applications.

[11]  Debra Polson,et al.  The cube: a very large-scale interactive engagement space , 2013, ITS.

[12]  Inanç Birol,et al.  Hive plots - rational approach to visualizing networks , 2012, Briefings Bioinform..

[13]  Monique W. M. Jaspers,et al.  The think aloud method: a guide to user interface design , 2004, Int. J. Medical Informatics.

[14]  Chris North,et al.  Toward measuring visualization insight , 2006, IEEE Computer Graphics and Applications.

[15]  Michael Garland,et al.  On the Visualization of Social and other Scale-Free Networks , 2008, IEEE Transactions on Visualization and Computer Graphics.

[16]  William J. Riehl,et al.  RegPrecise 3.0 – A resource for genome-scale exploration of transcriptional regulation in bacteria , 2013, BMC Genomics.

[17]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[18]  Chris North,et al.  An insight-based methodology for evaluating bioinformatics visualizations , 2005, IEEE Transactions on Visualization and Computer Graphics.

[19]  Jock D. Mackinlay,et al.  Automating the design of graphical presentations of relational information , 1986, TOGS.

[20]  Chris North,et al.  A comparison of benchmark task and insight evaluation methods for information visualization , 2011, Inf. Vis..