Evolution of the Catalogue of Life Architecture

The Species 2000 and ITIS Catalogue of Life aims to create and deliver a catalogue of all known species, using a distributed set of data sources. The current Species 2000 software has developed over a number of years, and the system requirements have evolved substantially over the same period. In this paper we discuss the current Catalogue of Life software, the way the requirements are evolving, and major elements of a planned new architectural design being developed as part of the 4D4Life EU e-Infrastructure project. Of particular importance in the new design is to be able to maintain the catalogue, dealing with potential overlaps between supplier databases; to keep it up to date and manage revisions that arise out of changes of scientific opinion; to be able to map between different taxonomies within and outside the catalogue; to be able to provide a wider range of services to other electronic systems which need the catalogue as their "taxonomic backbone"; and to support third-party applications by means of an open platform architecture.

[1]  A. Peterson,et al.  Predicting Species Invasions Using Ecological Niche Modeling: New Approaches from Bioinformatics Attack a Pressing Problem , 2001 .

[2]  Meng Li,et al.  Stream Operators for Querying Data Streams , 2005, WAIM.

[3]  R. J. White,et al.  Experiences with a Hybrid Implementation of a Globally Distributed Federated Database System , 2001, WAIM.

[4]  Alan Paton,et al.  Biodiversity informatics and the plant conservation baseline. , 2009, Trends in plant science.

[5]  Wenfei Fan,et al.  Keys with Upward Wildcards for XML , 2001, DEXA.

[6]  Carliss Y. Baldwin,et al.  The Architecture of Platforms: A Unified View , 2008 .

[7]  A. Gawer Platforms, Markets and Innovation , 2011 .

[8]  R. J. White,et al.  Techniques for effective integration, maintenance and evolution of species databases , 2000, Proceedings. 12th International Conference on Scientific and Statistica Database Management.

[9]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[10]  R. J. White,et al.  SPICE: A Flexible Architecture for Integrating Autonomous Databases to Comprise a Distributed Catalogue of Life , 2000, DEXA.

[11]  Phil Cryer Adoption of Persistent Identifiers for Biodiversity Informatics , 2010 .

[12]  V. Heywood,et al.  Global Biodiversity Assessment , 1996 .

[13]  Catherine N. Norton,et al.  Taxonomic indexing--extending the role of taxonomy. , 2006, Systematic biology.

[14]  Sean Martin,et al.  Globally distributed object identification for biological knowledgebases , 2004, Briefings Bioinform..