Visualization of Biological Pathways : An Ethnographic Study and Systems Evaluation

Life scientists use pathway diagrams to represent complex biochemical interactions at the molecular level. Although several visualization software systems have been developed to assist them in analyzing pathways, relatively few scientists use them extensively, and those that do rarely exploit their full capabilities. In collaboration with life scientists, we found that many are reluctant to use these systems due to the steep learning curve and amount of effort required to construct biologically relevant pathways. They find minimal value in a system that provides only simple visual or dynamic pictures, without providing adequate means to manipulate pathways in terms of analytic requirements. This work reports the results of ethnographic field studies conducted to understand life scientists’ needs for pathway analysis, and heuristic evaluations of five popular pathway visualization systems based on the identified data analysis requirements. These studies suggest several critical new requirements for pathway visualization systems that provide guidance for future development of these tools. The critical requirements include: 1) automated construction and updating of pathways by searching literature databases, 2) overlaying rich biological information on pathways in a biologically relevant format, 3) linking pathways to multidimensional data from high throughput experiments such as microarrays, 4) overviewing multiple pathways simultaneously and analysis of inter-connections between them, 5) scaling pathways to higher levels of abstraction to analyze effects of complex molecular interactions across levels of biological organization, and 6) logical organization and sharing of accumulated knowledge among researchers.

[1]  Edward Rolf Tufte,et al.  The visual display of quantitative information , 1985 .

[2]  G. Yarrington Molecular Cell Biology , 1987, The Yale Journal of Biology and Medicine.

[3]  Jakob Nielsen,et al.  Finding usability problems through heuristic evaluation , 1992, CHI.

[4]  James D. Hollan,et al.  Pad++: a zooming graphical interface for exploring alternate interface physics , 1994, UIST '94.

[5]  Peter D. Karp,et al.  EcoCyc: Encyclopedia of Escherichia coli genes and metabolism , 1998, Nucleic Acids Res..

[6]  M. Bittner,et al.  Expression profiling using cDNA microarrays , 1999, Nature Genetics.

[7]  Ivan Herman,et al.  Graph Visualization and Navigation in Information Visualization: A Survey , 2000, IEEE Trans. Vis. Comput. Graph..

[8]  Carole A. Goble,et al.  A classification of tasks in bioinformatics , 2001, Bioinform..

[9]  Michael Krauthammer,et al.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles , 2001, ISMB.

[10]  Ian M. Donaldson,et al.  BIND: THE BIOMOLECULAR INTERACTION DATABASE , 2001 .

[11]  Allan Kuchinsky,et al.  Biological storytelling: a software tool for biological information organization based upon narrative structure , 2002, AVI '02.

[12]  G. Churchill Fundamentals of experimental design for cDNA microarrays , 2002, Nature Genetics.

[13]  Steven C. Lawlor,et al.  GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways , 2002, Nature Genetics.

[14]  Mary Beth Rosson,et al.  Usability Engineering: Scenario-based Development of Human-Computer Interaction , 2001 .

[15]  Steven C. Lawlor,et al.  MAPPFinder: using Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data , 2003, Genome Biology.

[16]  Naren Ramakrishnan,et al.  The Emerging Landscape of Bioinformatics Software Systems , 2002, Computer.

[17]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[18]  Emek Demir,et al.  PATIKA: an integrated visual environment for collaborative construction and analysis of cellular pathways , 2002, Bioinform..

[19]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[20]  Akihiko Konagaya,et al.  KnowledgeEditor: a new tool for interactive modeling and analyzing biological pathways based on microarray data , 2003, Bioinform..

[21]  Min Su Lee,et al.  UniPath: A Knowledge Representation System for Biological Pathways , 2003 .

[22]  Ivan Bratko,et al.  GenePath: a system for automated construction of genetic networks from mutant data , 2003, Bioinform..

[23]  Xiaoyan Zhu,et al.  PathwayFinder: Paving the Way Towards Automatic Pathway Extraction , 2004, APBC.

[24]  S. Rhee,et al.  MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. , 2004, The Plant journal : for cell and molecular biology.

[25]  Chris North,et al.  An Evaluation of Microarray Visualization Tools for Biological Insight , 2004, IEEE Symposium on Information Visualization.