Finding and sharing: new approaches to registries of databases and services for the biomedical sciences

The recent explosion of biological data and the concomitant proliferation of distributed databases make it challenging for biologists and bioinformaticians to discover the best data resources for their needs, and the most efficient way to access and use them. Despite a rapid acceleration in uptake of syntactic and semantic standards for interoperability, it is still difficult for users to find which databases support the standards and interfaces that they need. To solve these problems, several groups are developing registries of databases that capture key metadata describing the biological scope, utility, accessibility, ease-of-use and existence of web services allowing interoperability between resources. Here, we describe some of these initiatives including a novel formalism, the Database Description Framework, for describing database operations and functionality and encouraging good database practise. We expect such approaches will result in improved discovery, uptake and utilization of data resources. Database URL: http://www.casimir.org.uk/casimir_ddf

[1]  Carole A. Goble,et al.  BioCatalogue: a universal catalogue of web services for the life sciences , 2010, Nucleic Acids Res..

[2]  Sarah Cohen Boulakia,et al.  Data Integration in the Life Sciences, 5th International Workshop, DILS 2008, Evry, France, June 25-27, 2008. Proceedings , 2008, DILS.

[3]  Michael Zouberakis,et al.  CASIMIR: Coordination and Sustainability of International Mouse Informatics Resources , 2008, 2008 8th IEEE International Conference on BioInformatics and BioEngineering.

[4]  Michael Zouberakis,et al.  Mouse Resource Browser—a database of mouse databases , 2010, Database J. Biol. Databases Curation.

[5]  Michelle D. Brazas,et al.  Evolution in bioinformatic resources: 2009 update on the Bioinformatics Links Directory , 2009, Nucleic Acids Res..

[6]  Steve Pettifer,et al.  An active registry for bioinformatics web services , 2009, Bioinform..

[7]  Nigel W. Hardy,et al.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project , 2008, Nature Biotechnology.

[8]  I. Foster,et al.  Service-Oriented Science , 2005, Science.

[9]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[10]  Radha Boddepalli,et al.  DoD2007: 1082 molecular biology databases , 2007, Bioinformation.

[11]  Patricia Rodriguez-Tomé The BioCatalog , 1998, Bioinform..

[12]  Michael Y. Galperin,et al.  The 2010 Nucleic Acids Research Database Issue and online Database Collection: a community of data resources , 2009, Nucleic Acids Res..

[13]  Anne E. Trefethen,et al.  Cyberinfrastructure for e-Science , 2005, Science.

[14]  Joel H. Saltz,et al.  caGrid: design and implementation of the core architecture of the cancer biomedical informatics grid , 2006, Bioinform..

[15]  Michael Zouberakis,et al.  Solutions for data integration in functional genomics: a critical assessment and case study , 2008, Briefings Bioinform..

[16]  Lincoln D. Stein,et al.  Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges , 2008, Nature Reviews Genetics.