Visualization of Bioinformatics Workflows for Ease of Understanding and Design Activities

Bioinformatics analyses are growing in size and complexity. They are often described as workflows, with the workflow specifications also becoming more complex due to the diversity of data, tools, and computational resources involved. A number of workflow management systems (WMS) have been developed recently to help bioinformaticians in their workflow design activities. Many of these WMS visualize workflows as graphs, where the nodes are analysis steps and the edges are interactions and constraints between analysis steps. These graphs usually represent a data flow of the analysis. We know that in software visualization, similar graphs are used to show a data flow in software systems. However, the WMS do not use any widely accepted standards for workflow visualization, particularly not in the bioinformatics domain. As a result, workflows are visualized in different ways in different WMS and workflows describing the same analysis look different in different WMS. Furthermore, the visualization techniques used in WMS for bioinformatics are quite limited. Here, we argue that applying some of the visual analytics methods and techniques used in software field, such as UML (unified modelling language) diagrams combined with quality metrics, can help to enhance understanding and sharing of the workflow, and ease workflow analysis and design activities.

[1]  C.J.H. Mann,et al.  Object-Oriented Metrics in Practice: Using Software Metrics to Characterize, Evaluate, and Improve the Design of Object-Oriented Systems , 2007 .

[2]  Morris A. Swertz,et al.  Beyond standardization: dynamic software infrastructures for systems biology , 2007, Nature Reviews Genetics.

[3]  Daniel J. Blankenberg,et al.  A framework for collaborative analysis of ENCODE data: making large-scale analyses biologist-friendly. , 2007, Genome research.

[4]  Morris A. Swertz,et al.  The MOLGENIS toolkit: rapid prototyping of biosoftware at the push of a button , 2010, BMC Bioinformatics.

[5]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[6]  Alfred Inselberg,et al.  Parallel Coordinates: Visual Multidimensional Geometry and Its Applications , 2003, KDIR.

[7]  Alexandru Telea,et al.  The Metric Lens: Visualizing Metrics and Structure on Software Diagrams , 2008, 2008 15th Working Conference on Reverse Engineering.

[8]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[9]  Alexandru Telea,et al.  To appear in an IEEE VGTC sponsored conference proceedings Visualizing Metrics on Areas of Interest in Software Architecture Diagrams , 2022 .

[10]  Ganesh S. Oak Information Visualization Introduction , 2022 .

[11]  Robert Spence,et al.  Information Visualization: Design for Interaction (2nd Edition) , 2007 .

[12]  Stephan Diehl,et al.  Software Visualization - Visualizing the Structure, Behaviour, and Evolution of Software , 2007 .

[13]  Robert Spence,et al.  Information Visualization: Design for Interaction (2nd Edition) , 2006 .

[14]  Edward A. Lee,et al.  CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper. 2000; 00:1–7 Prepared using cpeauth.cls [Version: 2002/09/19 v2.02] Taverna: Lessons in creating , 2022 .

[15]  Herman H. Goldstine,et al.  Planning and coding of problems for an Electronic Computing Instrument , 1947 .

[16]  Morris A. Swertz,et al.  Towards a MOLGENIS Based Computational Framework , 2011, 2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing.

[17]  Stéphane Ducasse,et al.  Understanding software evolution using a combination of software visualization and software metrics , 2002, Obj. Logiciel Base données Réseaux.

[18]  D. Rosenberg Cartographies of Time: A History of the Timeline , 2010 .

[19]  Morris A. Swertz,et al.  Introducing Data Provenance and Error Handling for NGS Workflows within the MOLGENIS Computational Framework , 2012, BIOINFORMATICS.

[20]  Michele Lanza,et al.  Object-Oriented Metrics in Practice - Using Software Metrics to Characterize, Evaluate, and Improve the Design of Object-Oriented Systems , 2006 .

[21]  Alexandru Telea,et al.  Visual Exploration of Combined Architectural and Metric Information , 2005, 3rd IEEE International Workshop on Visualizing Software for Understanding and Analysis.

[22]  Ramana Rao,et al.  The table lens: merging graphical and symbolic representations in an interactive focus + context visualization for tabular information , 1994, CHI '94.