ReVeaLD: A user-driven domain-specific interactive search platform for biomedical research

Bioinformatics research relies heavily on the ability to discover and correlate data from various sources. The specialization of life sciences over the past decade, coupled with an increasing number of biomedical datasets available through standardized interfaces, has created opportunities towards new methods in biomedical discovery. Despite the popularity of semantic web technologies in tackling the integrative bioinformatics challenge, there are many obstacles towards its usage by non-technical research audiences. In particular, the ability to fully exploit integrated information needs using improved interactive methods intuitive to the biomedical experts. In this report we present ReVeaLD (a Real-time Visual Explorer and Aggregator of Linked Data), a user-centered visual analytics platform devised to increase intuitive interaction with data from distributed sources. ReVeaLD facilitates query formulation using a domain-specific language (DSL) identified by biomedical experts and mapped to a self-updated catalogue of elements from external sources. ReVeaLD was implemented in a cancer research setting; queries included retrieving data from in silico experiments, protein modeling and gene expression. ReVeaLD was developed using Scalable Vector Graphics and JavaScript and a demo with explanatory video is available at http://www.srvgal78.deri.ie:8080/explorer. A set of user-defined graphic rules controls the display of information through media-rich user interfaces. Evaluation of ReVeaLD was carried out as a game: biomedical researchers were asked to assemble a set of 5 challenge questions and time and interactions with the platform were recorded. Preliminary results indicate that complex queries could be formulated under less than two minutes by unskilled researchers. The results also indicate that supporting the identification of the elements of a DSL significantly increased intuitiveness of the platform and usability of semantic web technologies by domain users.

[1]  Tiziana Catarci,et al.  Visual Query Systems for Databases: A Survey , 1997, J. Vis. Lang. Comput..

[2]  Bin Chen,et al.  Chem2Bio2RDF: A Linked Open Data Portal for Chemical Biology , 2010, ArXiv.

[3]  Jeffrey Heer,et al.  SpanningAspectRatioBank Easing FunctionS ArrayIn ColorIn Date Interpolator MatrixInterpola NumObjecPointI Rectang ISchedu Parallel Pause Scheduler Sequen Transition Transitioner Transiti Tween Co DelimGraphMLCon IData JSONCon DataField DataSc Dat DataSource Data DataUtil DirtySprite LineS RectSprite , 2011 .

[4]  Nigel Shadbolt,et al.  NITELIGHT: A Graphical Tool for Semantic Query Construction , 2008 .

[5]  Arthur Dalby,et al.  Description of several chemical structure file formats used by computer programs developed at Molecular Design Limited , 1992, J. Chem. Inf. Comput. Sci..

[6]  Virpi Roto,et al.  User Experience Evaluation Methods in Academic and Industrial Contexts , 2009 .

[7]  Brian McBride,et al.  Jena: A Semantic Web Toolkit , 2002, IEEE Internet Comput..

[8]  Helena F. Deus,et al.  Data integration gets 'Sloppy' , 2006, Nature Biotechnology.

[9]  Hubert Cahuzac,et al.  From intuitive mapping to concept mapping: an application within an anthropological urban field study , 2004 .

[10]  Nikos Loutas,et al.  A collaborative methodology for developing a semantic model for interlinking Cancer Chemoprevention linked-data sources , 2014, Semantic Web.

[11]  Jiajie Zhang,et al.  The Nature of External Representations in Problem Solving , 1997, Cogn. Sci..

[12]  M. Scott Marshall,et al.  Translating standards into practice - One Semantic Web API for Gene Expression , 2012, J. Biomed. Informatics.

[13]  Masao Nagasaki,et al.  Cell Illustrator 4.0: A Computational Platform for Systems Biology , 2010, Silico Biol..

[14]  Enrico Motta,et al.  Overcoming Schema Heterogeneity between Linked Semantic Repositories to Improve Coreference Resolution , 2009, ASWC.

[15]  HeerJeffrey,et al.  D3 Data-Driven Documents , 2011 .

[16]  David H. Jonassen,et al.  Structural Knowledge: Techniques for Representing, Conveying, and Acquiring Structural Knowledge , 1993 .

[17]  J. Mintzes,et al.  The concept map as a research tool: Exploring conceptual change in biology , 1990 .

[18]  Huajun Chen,et al.  Semantic Web meets Integrative Biology: a survey , 2013, Briefings Bioinform..

[19]  Kei-Hoi Cheung,et al.  Advancing translational research with the Semantic Web , 2007, BMC Bioinformatics.

[20]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[21]  Amit P. Sheth,et al.  From Questions to Effective Answers: On the Utility of Knowledge-Driven Querying Systems for Life Sciences Data , 2013, DILS.

[22]  Christopher G. Chute,et al.  BioPortal: ontologies and integrated data resources at the click of a mouse , 2009, Nucleic Acids Res..

[23]  Stephen G. Kobourov,et al.  Force-Directed Drawing Algorithms , 2013, Handbook of Graph Drawing and Visualization.

[24]  Roberto Tamassia,et al.  Handbook on Graph Drawing and Visualization , 2013 .

[25]  Ronan Fitzpatrick,et al.  Usable Software and Its Attributes: A Synthesis of Software Quality, European Community Law and Human-Computer Interaction , 1998, BCS HCI.

[26]  Kevin Lano,et al.  Slicing of UML models using model transformations , 2010, MODELS'10.

[27]  Zhisheng Huang,et al.  Linked Life Data , 2012 .

[28]  David R. Karger,et al.  Fresnel: A Browser-Independent Presentation Vocabulary for RDF , 2005, SEMWEB.

[29]  Nicola Dell,et al.  VIQUEN: A visual query engine for RDF , 2010 .

[30]  Nicole Tourigny,et al.  Bio2RDF: Towards a mashup to build bioinformatics knowledge systems , 2008, J. Biomed. Informatics.

[31]  Mark A. Musen,et al.  A System for Ontology-Based Annotation of Biomedical Data , 2008, DILS.

[32]  John N. Weinstein,et al.  Exposing the cancer genome atlas as a SPARQL endpoint , 2010, J. Biomed. Informatics.

[33]  Didier Buchs,et al.  GReg : a domain specific language for the modeling of genetic regulator y mechanisms , 2011 .

[34]  Jerri L. Ledford,et al.  Google Analytics , 2006 .

[35]  Nigel Shadbolt,et al.  A Visual Approach to Semantic Query Design Using a Web-Based Graphical Query Designer , 2008, EKAW.

[36]  E. Heiss,et al.  Mechanism-based in vitro screening of potential cancer chemopreventive agents. , 2003, Mutation research.

[37]  Wolfgang Maass,et al.  Development of Integrative Bioinformatics Applications using Cloud Computing resources and Knowledge Organization Systems (KOS). , 2011 .

[38]  Adrian Paschke,et al.  A journey to Semantic Web query federation in the life sciences , 2009, BMC Bioinformatics.

[39]  J. Novak,et al.  The use of concept mapping and knowledge vee mapping with junior high school science students , 1983 .

[40]  David R. Karger,et al.  Exhibit: lightweight structured data publishing , 2007, WWW '07.

[41]  Helena Futscher de Deus Improving discovery in the life sciences using semantic Web technologies and linked data: design principles for life sciences knowledge organization systems , 2011 .

[42]  Nigel Bevan,et al.  What is the difference between the purpose of usability and user experience evaluation methods , 2009 .

[43]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[44]  T. Walter Combining Domain-Specific Languages and Ontology Technologies ? , 2009 .

[45]  Steffen Staab,et al.  OntoDSL: An Ontology-Based Framework for Domain-Specific Languages , 2009, MoDELS.

[46]  Bruce Phillips,et al.  Tracking real-time user experience (TRUE): a comprehensive instrumentation solution for complex systems , 2008, CHI.

[47]  Colin Ware,et al.  Visual Queries: The Foundation of Visual Thinking , 2005, Knowledge and Information Visualization.

[48]  Carole A. Goble,et al.  BioCatalogue: a universal catalogue of web services for the life sciences , 2010, Nucleic Acids Res..

[49]  Amit P. Sheth,et al.  TcruziKB: Enabling Complex Queries for Genomic Data Exploration , 2008, 2008 IEEE International Conference on Semantic Computing.

[50]  Gavin Powell,et al.  Towards Fuzzy Query-Relaxation for RDF , 2012, ESWC.

[51]  Yuki Hayashi,et al.  AGUIA: autonomous graphical user interface assembly for clinical trials semantic data services , 2010, BMC Medical Informatics Decis. Mak..

[52]  Wolfgang Maass,et al.  S3QL: A distributed domain specific language for controlled semantic integration of life sciences data , 2011, BMC Bioinformatics.

[53]  Thomas Ertl,et al.  SemLens: visual analysis of semantic data with scatter plots and semantic lenses , 2011, I-Semantics '11.

[54]  Egon L. Willighagen,et al.  Emerging practices for mapping and linking life sciences data using RDF - A case series , 2012, J. Web Semant..

[55]  S. Kramer Application of concept mapping to systems engineering , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[56]  A. Bernstein,et al.  Ginseng : A Guided Input Natural Language Search Engine for Querying Ontologies , 2006 .

[57]  L. Ohno-Machado Journal of Biomedical Informatics , 2001 .

[58]  Stefan Decker,et al.  Cataloguing and Linking Life Sciences LOD Cloud , 2009 .

[59]  Marianne Winslett,et al.  Scientific and Statistical Database Management, 21st International Conference, SSDBM 2009, New Orleans, LA, USA, June 2-4, 2009, Proceedings , 2009, SSDBM.

[60]  Thomas Franz,et al.  LENA - Browsing RDF Data More Complex Than Foaf , 2008, International Semantic Web Conference.

[61]  Carlos Alberto Heuser,et al.  Integrating Biological Databases , 2003, SBBD.

[62]  D Thieffry,et al.  GINsim: a software suite for the qualitative modelling, simulation and analysis of regulatory networks. , 2006, Bio Systems.

[63]  Alexandra Poulovassilis,et al.  Event-Condition-Action Rule Languages for the Semantic Web , 2006, EDBT Workshops.

[64]  Dan Suciu,et al.  A Dataflow Graph Transformation Language and Query Rewriting System for RDF Ontologies , 2012, SSDBM.

[65]  Eyal Oren,et al.  Extending Faceted Navigation for RDF Data , 2006, SEMWEB.

[66]  Alexandra Poulovassilis,et al.  RDFTL : An Event-Condition-Action Language for RDF , 2004 .

[67]  Benedikt Nordhoff,et al.  Dijkstra’s Algorithm , 2013 .

[68]  Jeffrey Heer,et al.  D³ Data-Driven Documents , 2011, IEEE Transactions on Visualization and Computer Graphics.

[69]  R.W. Harrison,et al.  Domain specific languages for cellular interactions , 2004, The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[70]  Wilbert O. Galitz,et al.  The Essential Guide to User Interface Design: An Introduction to GUI Design Principles and Techniques , 1996 .