Building compatible and dynamic character matrices – Current and future use of specimen-based character data

Abstract Herbarium specimens have always played a central role in plant sciences and constitute the cornerstone for systematic and taxonomy. This role is further strengthened with the ongoing digitisation and growing online-availability of collections all over the globe. The increasing usability of specimens demands, however, an improved use and sustainable handling of specimen data not only in new scientific uses correlated with the digitisation, but also by modern workflows applied to the traditional purpose of specimens. A crucial step in the comparative analyses of organisms is the preparation of a character matrix to observe and assess the morphological extent and variability of taxa on the basis of individual specimens. This process and the resulting matrix often are of ephemeral nature since only its results are published in a condensed form. The data relationships are usually not stored, making a re-use impossible and a new analysis inevitable. To overcome the limitations of conventional taxonomy, we here introduce a comprehensive workflow that is currently being implemented on the EDIT Platform for Cybertaxonomy.

[1]  Vincent S. Smith,et al.  Actionable, long-term stable and semantic web compatible identifiers for access to biological collection objects , 2017, Database J. Biol. Databases Curation.

[2]  Walter G. Berendsohn,et al.  A Comprehensive and Standards-Aware Common Data Model (CDM) for Taxonomic Research , 2017 .

[3]  H. Ross Principles of Numerical Taxonomy , 1964 .

[4]  R Vignes,et al.  Computer-aided identification of insect vectors. , 1989, Parasitology today.

[5]  Pankaj Jaiswal,et al.  Gramene database: a hub for comparative plant genomics. , 2011, Methods in molecular biology.

[6]  W. P. Maddison,et al.  Mesquite: a modular system for evolutionary analysis. Version 2.01 (Build j28) , 2007 .

[7]  Andy Pereira,et al.  Plant Reverse Genetics , 2011, Methods in Molecular Biology.

[8]  D. Maddison,et al.  Mesquite: a modular system for evolutionary analysis. Version 2.6 , 2009 .

[9]  Cynthia L. Smith,et al.  Integrating phenotype ontologies across multiple species , 2010, Genome Biology.

[10]  Birgitta König-Ries,et al.  Towards an Integrated Biodiversity and Ecological Research Data Management and Archiving Platform: The German Federation for the Curation of Biological Data (GFBio) , 2014, GI-Jahrestagung.

[11]  Gregor Hagedorn,et al.  A comprehensive reference model for biological collections and surveys , 1999 .

[12]  Cedric Raguenaud,et al.  The Prometheus Description Model: an examination of the taxonomic description-building process and its representation , 2005 .

[13]  J. M. Heberling,et al.  Herbarium specimens as exaptations: New uses for old collections. , 2017, American journal of botany.

[14]  V. Funk,et al.  The importance of vouchers , 2005 .

[15]  Walter G. Berendsohn,et al.  Devising the EDIT Platform for Cybertaxonomy , 2010 .

[16]  Ben C. Stöver,et al.  Sample data processing in an additive and reproducible taxonomic workflow by using character data persistently linked to preserved individual specimens , 2015, Database J. Biol. Databases Curation.

[17]  Barry Smith,et al.  The Plant Ontology as a Tool for Comparative Plant Anatomy and Genomic Analyses , 2012, Plant & cell physiology.

[18]  S. Higgins,et al.  TRY – a global database of plant traits , 2011, Global Change Biology.

[19]  Anton Güntsch,et al.  Ageing: Rejuvenation study stirs old memories , 2017, Nature.

[20]  M. Watson,et al.  The Prometheus Taxonomic Model: a practical approach to representing multiple classifications. , 2000 .

[21]  M. J. Dallwitz,et al.  Definition of the DELTA format , 2015 .

[22]  Robert Tolksdorf,et al.  A Terminology Service Supporting Semantic Annotation, Integration, Discovery and Analysis of Interdisciplinary Research Data , 2016, Datenbank-Spektrum.

[23]  Barry Smith,et al.  Ontologies as Integrative Tools for Plant Science Nih Public Access Author Manuscript $watermark-text Ontology 101 $watermark-text , 2022 .

[24]  A METHOD FOR DATA CAPTURE , 1972 .

[25]  Pamela S Soltis,et al.  Old Plants, New Tricks: Phenological Research Using Herbarium Specimens. , 2017, Trends in ecology & evolution.

[26]  Isabelle Mougenot,et al.  ThesauForm - Traits: A web based collaborative tool to develop a thesaurus for plant functional diversity research , 2012, Ecol. Informatics.

[27]  W. G. Berendsohn,et al.  Biodiversity information platforms: From standards to interoperability , 2011, ZooKeys.

[28]  Brian Macisaac,et al.  Common data model , 1999 .

[29]  P. Soltis Digitization of herbaria enables novel research. , 2017, American journal of botany.

[30]  Hong Cui,et al.  Building the “Plant Glossary”—A controlled botanical vocabulary using terms extracted from the Floras of North America and China , 2017 .

[31]  Jim Diederich Basic properties for biological databases: Character development and support , 1997 .

[32]  J. Balhoff,et al.  Time to change how we describe biodiversity. , 2012, Trends in ecology & evolution.

[33]  Walter G. Berendsohn,et al.  An integrative and dynamic approach for monographing species-rich plant groups - Building the global synthesis of the angiosperm order Caryophyllales , 2015 .

[34]  Hilmar Lapp,et al.  Evolutionary Characters, Phenotypes and Ontologies: Curating Data from the Systematic Biology Literature , 2010, PloS one.

[35]  R. J. PANKHURST Key Generation by Computer , 1970, Nature.

[36]  Gregor Hagedorn,et al.  A method to establish and revise descriptive data sets over the Internet , 2000 .

[37]  Régine Vignes Lebbe,et al.  Xper²: managing descriptive data from their collection to e-monographs , 2010 .

[38]  M. J. Dallwitz,et al.  A General System for Coding Taxonomic Descriptions , 1980 .

[39]  Gregor Hagedorn Structuring Descriptive Data of Organisms — Requirement Analysis and Information Models , 2007 .

[40]  Anton Güntsch,et al.  Bottom-up Taxon Characterisations with Shared Knowledge: Describing Specimens in a Semantic Context , 2017, S4BioDiv@ISWC.