A Comparison of Federated Databases with Web Services for the Integration of Bioinformatics Data

Between technological breakthroughs and new computational approaches, the amount of biological data is increasing explosively. Currently, there are many data integration approaches that can provide biologists central and uniform access to all kinds of biological data. To minimize the disruption of current operations, maintain local autonomy and handle heterogeneities, Federated Databases and Web Services have been proposed as candidate solutions. The Web Services approach offers the most flexibility. However, the performance of Web Services has been a concern for many developers. This paper describes a comparison based on our experience with both approaches. It discusses the trade-offs among performance, support for heterogeneity, robustness and scalability. Of significance, is the discovery that the Web Services approach performs very competitively.

[1]  Val Tannen,et al.  K2/Kleisli and GUS: Experiments in integrated access to genomic data sources , 2001, IBM Syst. J..

[2]  Cary Marcus Pennington Building federated bioinformatics databases using Web services , 2009 .

[3]  Philippa Rhodes,et al.  ApiDB: integrated resources for the apicomplexan bioinformatics resource center , 2006, Nucleic Acids Res..

[4]  L. Stein Integrating biological databases , 2003, Nature Reviews Genetics.

[5]  Michael Y. Galperin The Molecular Biology Database Collection: 2006 update , 2005, Nucleic Acids Res..

[6]  Carole A. Goble,et al.  A classification of tasks in bioinformatics , 2001, Bioinform..

[7]  Carole A. Goble,et al.  myGrid: personalised bioinformatics on the information grid , 2003, ISMB.

[8]  Peter D. Karp,et al.  A Strategy for Database Interoperation , 1995, J. Comput. Biol..

[9]  Michael Y. Galperin The Molecular Biology Database Collection: 2005 update , 2004, Nucleic Acids Res..

[10]  Mark D. Wilkinson,et al.  BioMOBY: An Open Source Biological Web Services Proposal , 2002, Briefings Bioinform..

[11]  Alan J. Robinson,et al.  XEMBL: distributing EMBL data in XML format , 2002, Bioinform..

[12]  Vladimir Brusic,et al.  Data Warehousing in Molecular Biology , 2000, Briefings Bioinform..

[13]  Stephan Philippi Light-weight integration of molecular biological databases , 2004, Bioinform..

[14]  Michael Y. Galperin The Molecular Biology Database Collection: 2007 update , 2006, Nucleic Acids Res..