MediSyn: uncertainty-aware visualization of multiple biomedical datasets to support drug treatment selection

Background Dispersed biomedical databases limit user exploration to generate structured knowledge. Linked Data unifies data structures and makes the dispersed data easy to search across resources, but it lacks supporting human cognition to achieve insights. In addition, potential errors in the data are difficult to detect in their free formats. Devising a visualization that synthesizes multiple sources in such a way that links between data sources are transparent, and uncertainties, such as data conflicts, are salient is challenging. Results To investigate the requirements and challenges of uncertainty-aware visualizations of linked data, we developed MediSyn, a system that synthesizes medical datasets to support drug treatment selection. It uses a matrix-based layout to visually link drugs, targets (e.g., mutations), and tumor types. Data uncertainties are salient in MediSyn; for example, (i) missing data are exposed in the matrix view of drug-target relations; (ii) inconsistencies between datasets are shown via overlaid layers; and (iii) data credibility is conveyed through links to data provenance. Conclusions Through the synthesis of two manually curated datasets, cancer treatment biomarkers and drug-target bioactivities, a use case shows how MediSyn effectively supports the discovery of drug-repurposing opportunities. A study with six domain experts indicated that MediSyn benefited the drug selection and data inconsistency discovery. Though linked publication sources supported user exploration for further information, the causes of inconsistencies were not easy to find. Additionally, MediSyn could embrace more patient data to increase its informativeness. We derive design implications from the findings.

[1]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[2]  Silvia Miksch,et al.  Visualizing Sets and Set-typed Data: State-of-the-Art and Future Challenges , 2014, EuroVis.

[3]  Charles Perin,et al.  Revisiting Bertin Matrices: New Interactions for Crafting Tabular Visualizations , 2014, IEEE Transactions on Visualization and Computer Graphics.

[4]  Michael P. Schroeder,et al.  In silico prescription of anticancer drugs to cohorts of 28 tumor types reveals targeting opportunities. , 2015, Cancer cell.

[5]  Dieter Schmalstieg,et al.  VisBricks: Multiform Visualization of Large, Inhomogeneous Data , 2011, IEEE Transactions on Visualization and Computer Graphics.

[6]  Barend Mons,et al.  Integrated Bio-Search: challenges and trends for the integration, search and comprehensive processing of biological information , 2014, BMC Bioinformatics.

[7]  Katrien Verbert,et al.  Scalable Exploration of Relevance Prospects to Support Decision Making , 2016, IntRS@RecSys.

[8]  Jock D. Mackinlay,et al.  Automating the design of graphical presentations of relational information , 1986, TOGS.

[9]  Jordi Mestres,et al.  PharmaTrek: A Semantic Web Explorer for Open Innovation in Multitarget Drug Discovery , 2012, Molecular informatics.

[10]  Li Chen,et al.  A user-centric evaluation framework for recommender systems , 2011, RecSys '11.

[11]  Dieter Schmalstieg,et al.  ConTour: Data-Driven Exploration of Multi-Relational Datasets for Drug Discovery , 2014, IEEE Transactions on Visualization and Computer Graphics.

[12]  Hanspeter Pfister,et al.  UpSet: Visualization of Intersecting Sets , 2014, IEEE Transactions on Visualization and Computer Graphics.

[13]  Gem Stapleton,et al.  Visualizing Sets with Linear Diagrams , 2015, TCHI.

[14]  Adrien Ugon,et al.  Using visual analytics for presenting comparative information on new drugs , 2017, J. Biomed. Informatics.

[15]  Mark Gahegan,et al.  A typology for visualizing uncertainty , 2005, IS&T/SPIE Electronic Imaging.

[16]  P. Riehmann,et al.  Interactive Sankey diagrams , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[17]  Xiaohui Liu,et al.  Visualization of high throughput biological data , 2012, Proceedings of the ITI 2012 34th International Conference on Information Technology Interfaces.

[18]  Krister Wennerberg,et al.  Axitinib effectively inhibits BCR-ABL1(T315I) with a distinct binding conformation , 2015, Nature.

[19]  Lydia B. Chilton,et al.  Tabulator: Exploring and Analyzing linked data on the Semantic Web , 2006 .

[20]  Kenneth A. Perrine,et al.  Interactive visualization of multiple query results , 2001, IEEE Symposium on Information Visualization, 2001. INFOVIS 2001..

[21]  Nicole Tourigny,et al.  Bio2RDF: Towards a mashup to build bioinformatics knowledge systems , 2008, J. Biomed. Informatics.

[22]  Stefan Decker,et al.  Linked Biomedical Dataspace: Lessons Learned Integrating Data for Drug Discovery , 2014, SEMWEB.

[23]  Stefan Decker,et al.  ReVeaLD: A user-driven domain-specific interactive search platform for biomedical research , 2014, J. Biomed. Informatics.

[24]  Dieter Schmalstieg,et al.  StratomeX: Visual Analysis of Large‐Scale Heterogeneous Genomics Data for Cancer Subtype Characterization , 2012, Comput. Graph. Forum.

[25]  Niklas Elmqvist,et al.  Exploring the design space of composite visualization , 2012, 2012 IEEE Pacific Visualization Symposium.

[26]  Vassilis Virvilis,et al.  Literature mining, ontologies and information visualization for drug repurposing , 2011, Briefings Bioinform..

[27]  J. Sweller COGNITIVE LOAD THEORY, LEARNING DIFFICULTY, AND INSTRUCTIONAL DESIGN , 1994 .

[28]  Funda Meric-Bernstam,et al.  The right drugs at the right time for the right patient: the MD Anderson precision oncology decision support platform. , 2015, Drug discovery today.

[29]  Barend Mons,et al.  Open PHACTS: semantic interoperability for drug discovery. , 2012, Drug discovery today.

[30]  Hanspeter Pfister,et al.  Domino: Extracting, Comparing, and Manipulating Subsets Across Multiple Tabular Datasets , 2014, IEEE Transactions on Visualization and Computer Graphics.

[31]  Bart P. Knijnenburg,et al.  Explaining the user experience of recommender systems , 2012, User Modeling and User-Adapted Interaction.

[32]  S. J. Campbell,et al.  Visualizing the drug target landscape. , 2010, Drug discovery today.

[33]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[34]  Ricardo J. G. B. Campello,et al.  On the selection of appropriate distances for gene expression data clustering , 2014, BMC Bioinformatics.

[35]  Stefan Decker,et al.  GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer research , 2014 .

[36]  Egon L. Willighagen,et al.  Linked open drug data for pharmaceutical research and development , 2011, J. Cheminformatics.