Model-driven user interfaces for bioinformatics data resources: regenerating the wheel as an alternative to reinventing it

BackgroundThe proliferation of data repositories in bioinformatics has resulted in the development of numerous interfaces that allow scientists to browse, search and analyse the data that they contain. Interfaces typically support repository access by means of web pages, but other means are also used, such as desktop applications and command line tools. Interfaces often duplicate functionality amongst each other, and this implies that associated development activities are repeated in different laboratories. Interfaces developed by public laboratories are often created with limited developer resources. In such environments, reducing the time spent on creating user interfaces allows for a better deployment of resources for specialised tasks, such as data integration or analysis. Laboratories maintaining data resources are challenged to reconcile requirements for software that is reliable, functional and flexible with limitations on software development resources.ResultsThis paper proposes a model-driven approach for the partial generation of user interfaces for searching and browsing bioinformatics data repositories. Inspired by the Model Driven Architecture (MDA) of the Object Management Group (OMG), we have developed a system that generates interfaces designed for use with bioinformatics resources. This approach helps laboratory domain experts decrease the amount of time they have to spend dealing with the repetitive aspects of user interface development. As a result, the amount of time they can spend on gathering requirements and helping develop specialised features increases. The resulting system is known as Pierre, and has been validated through its application to use cases in the life sciences, including the PEDRoDB proteomics database and the e-Fungi data warehouse.ConclusionMDAs focus on generating software from models that describe aspects of service capabilities, and can be applied to support rapid development of repository interfaces in bioinformatics. The Pierre MDA is capable of supporting common database access requirements with a variety of auto-generated interfaces and across a variety of repositories. With Pierre, four kinds of interfaces are generated: web, stand-alone application, text-menu, and command line. The kinds of repositories with which Pierre interfaces have been used are relational, XML and object databases.

[1]  C. Sander,et al.  The HUPO PSI's Molecular Interaction format—a community standard for the representation of protein interaction data , 2004, Nature Biotechnology.

[2]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[3]  Carole A. Goble,et al.  Teallach: a model-based user interface development environment for object databases , 1999, Proceedings User Interfaces to Data Intensive Systems.

[4]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[5]  David Jordan,et al.  Java data objects , 2003 .

[6]  José María Cavero Barca,et al.  MIDAS/BD: A Methodological Framework for Web Database Design , 2001, ER.

[7]  Paulo Pinheiro da Silva,et al.  User interface declarative models and development environments: a survey , 2000 .

[8]  Carole A. Goble,et al.  Pedro Ontology Services: A Framework for Rapid Ontology Markup , 2005, ESWC.

[9]  Michael Y. Galperin The Molecular Biology Database Collection: 2005 update , 2004, Nucleic Acids Res..

[10]  Peter Fettke,et al.  Model Driven Architecture (MDA) , 2003, Wirtsch..

[11]  Rolf Apweiler,et al.  The EBI SRS server-new features , 2002, Bioinform..

[12]  David Botstein,et al.  The Stanford Microarray Database: data access and quality assessment tools , 2003, Nucleic Acids Res..

[13]  Jason E. Stewart,et al.  Design and implementation of microarray gene expression markup language (MAGE-ML) , 2002, Genome Biology.

[14]  Chris F. Taylor,et al.  Pedro: a configurable data entry tool for XML , 2004, Bioinform..

[15]  Chris F. Taylor,et al.  A systematic approach to modeling, capturing, and disseminating proteomics experimental data , 2003, Nature Biotechnology.

[16]  Andreas Prlic,et al.  Ensembl 2006 , 2005, Nucleic Acids Res..

[17]  Rocco Moretti,et al.  Model Driven Architecture (MDA) , 2007 .

[18]  T. N. Bhat,et al.  A framework for scientific data modeling and automated software development , 2005, Bioinform..

[19]  Arvind Malhotra,et al.  Xml schema part 2: datatypes , 1999 .

[20]  Alistair J. P. Brown,et al.  PEDRo: A database for storing, searching and disseminating experimental proteomics data , 2004, BMC Genomics.

[21]  Bart De Moor,et al.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis , 2005, Bioinform..

[22]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[23]  Wolfgang Meier,et al.  eXist: An Open Source Native XML Database , 2002, Web, Web-Services, and Database Systems.

[24]  Alan W. Brown Model driven architecture: Principles and practice , 2004, Software and Systems Modeling.

[25]  Paulo Pinheiro,et al.  User Interface Declarative Models and Development Environments: A Survey , 2000, DSV-IS.

[26]  Andrew Hayes,et al.  GIMS: an integrated data storage and analysis environment for genomic and functional data , 2003, Yeast.