MorphoBank: phylophenomics in the “cloud”

A highly interoperable informatics infrastructure rapidly emerged to handle genomic data used for phylogenetics and was instrumental in the growth of molecular systematics. Parallel growth in software and databases to address needs peculiar to phylophenomics has been relatively slow and fragmented. Systematists currently face the challenge that Earth may hold tens of millions of species (living and fossil) to be described and classified. Grappling with research on this scale has increasingly resulted in work by teams, many constructing large phenomic supermatrices. Until now, phylogeneticists have managed data in single‐user, file‐based desktop software wholly unsuitable for real‐time, team‐based collaborative work. Furthermore, phenomic data often differ from genomic data in readily lending themselves to media representation (e.g. 2D and 3D images, video, sound). Phenomic data are a growing component of phylogenetics, and thus teams require the ability to record homology hypotheses using media and to share and archive these data. Here we describe MorphoBank, a web application and database leveraging software as a service methodology compatible with “cloud” computing technology for the construction of matrices of phenomic data. In its tenth year, and fully available to the scientific community at‐large since inception, MorphoBank enables interactive collaboration not possible with desktop software, permitting self‐assembling teams to develop matrices, in real time, with linked media in a secure web environment. MorphoBank also provides any user with tools to build character and media ontologies (rule sets) within matrices, and to display these as directed acyclic graphs. These rule sets record the phylogenetic interrelatedness of characters (e.g. if X is absent, Y is inapplicable, or X–Z characters share a media view). MorphoBank has enabled an order of magnitude increase in phylophenomic data collection: a recent collaboration by more than 25 researchers has produced a database of > 4500 phenomic characters supported by > 10 000 media.
© The Willi Hennig Society 2011.

[1]  Joel Cracraft,et al.  The Seven Great Questions of Systematic Biology: An Essential Foundation for Conservation and the Sustainable Use of Biodiversity , 2002 .

[2]  Howard L. Snell,et al.  Conolophus marthae sp.nov. (Squamata, Iguanidae), a new species of land iguana from the Galápagos archipelago , 2009 .

[3]  Régine Vignes-Lebbe,et al.  Metacanthomorpha: essay on a phylogeny-oriented database for morphology--the acanthomorph (teleostei) example. , 2004, Systematic biology.

[4]  Jesse James Garrett Ajax: A New Approach to Web Applications , 2007 .

[5]  Andrés Moya,et al.  From phylogenetics to phylogenomics: the evolutionary relationships of insect endosymbiotic gamma-Proteobacteria as a test case. , 2007, Systematic biology.

[6]  Charles Seife,et al.  What Is the Universe Made Of? , 2005, Science.

[7]  Lars Vogt,et al.  The linguistic problem of morphology: structure versus homology and the standardization of morphological data , 2009, Cladistics : the international journal of the Willi Hennig Society.

[8]  Gregory Gutin,et al.  Digraphs - theory, algorithms and applications , 2002 .

[9]  J. Gatesy,et al.  The supermatrix approach to systematics. , 2007, Trends in ecology & evolution.

[10]  Chris Mungall,et al.  Phenotype ontologies: the bridge between genomics and evolution. , 2007, Trends in ecology & evolution.

[11]  Hilmar Lapp,et al.  The Teleost Anatomy Ontology: Anatomical Representation for the Genomics Age , 2010, Systematic biology.

[12]  Herbert Schildt,et al.  Struts : the complete reference , 2004 .

[13]  M. Novacek,et al.  Cretaceous eutherians and Laurasian origin for placental mammals near the K/T boundary , 2007, Nature.

[14]  Rob DeSalle,et al.  The unholy trinity: taxonomy, species delimitation and DNA barcoding , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[15]  Nelson Rios,et al.  Connecting evolutionary morphology to genomics using ontologies: a case study from Cypriniformes including zebrafish. , 2007, Journal of experimental zoology. Part B, Molecular and developmental evolution.

[16]  Daphne J Fairbairn,et al.  THE ADVENT OF MANDATORY DATA ARCHIVING , 2011, Evolution; international journal of organic evolution.

[17]  D. Maddison,et al.  MacClade 4: analysis of phy-logeny and character evolution , 2003 .

[18]  Jeremy A. Miller,et al.  Linking of digital images to phylogenetic data matrices using a morphological ontology. , 2007, Systematic biology.

[19]  Kenneth D. Angielczyk,et al.  A comprehensive taxonomic revision of Dicynodon (Therapsida, Anomodontia) and its implications for dicynodont phylogeny, biogeography, and biostratigraphy , 2011 .

[20]  Sunil Bajpai,et al.  Whales originated from aquatic artiodactyls in the Eocene epoch of India , 2007, Nature.

[21]  D. Maddison,et al.  Mesquite: a modular system for evolutionary analysis. Version 2.6 , 2009 .

[22]  Nikolaus Malchus Shell tubules in Condylocardiinae (Bivalvia: Carditoidea) , 2010 .

[23]  Mark C. Hove,et al.  A DNA-barcoding approach to identifying juvenile freshwater mussels (Bivalvia:Unionidae) recovered from naturally infested fishes , 2011, Journal of the North American Benthological Society.

[24]  J. Farris,et al.  Phylogenetic analysis of 73 060 taxa corroborates major eukaryotic groups , 2009, Cladistics : the international journal of the Willi Hennig Society.