Measures for interoperability of phenotypic data: minimum information requirements and formatting

BackgroundPlant phenotypic data shrouds a wealth of information which, when accurately analysed and linked to other data types, brings to light the knowledge about the mechanisms of life. As phenotyping is a field of research comprising manifold, diverse and time-consuming experiments, the findings can be fostered by reusing and combining existing datasets. Their correct interpretation, and thus replicability, comparability and interoperability, is possible provided that the collected observations are equipped with an adequate set of metadata. So far there have been no common standards governing phenotypic data description, which hampered data exchange and reuse.ResultsIn this paper we propose the guidelines for proper handling of the information about plant phenotyping experiments, in terms of both the recommended content of the description and its formatting. We provide a document called “Minimum Information About a Plant Phenotyping Experiment”, which specifies what information about each experiment should be given, and a Phenotyping Configuration for the ISA-Tab format, which allows to practically organise this information within a dataset. We provide examples of ISA-Tab-formatted phenotypic data, and a general description of a few systems where the recommendations have been implemented.ConclusionsAcceptance of the rules described in this paper by the plant phenotyping community will help to achieve findable, accessible, interoperable and reusable data.

[1]  A. Baron Experimental Designs , 1990, The Behavior analyst.

[2]  Helen Parkinson,et al.  MIAME/Plant – adding value to plant microarrray experiments , 2006, Plant Methods.

[3]  Pelin Yilmaz,et al.  The genomic standards consortium: bringing standards to life for microbial ecology , 2011, The ISME Journal.

[4]  Aneta Sawikowska,et al.  Quantitative Trait Loci for Yield and Yield-Related Traits in Spring Barley Populations Derived from Crosses between European and Syrian Cultivars , 2016, PloS one.

[5]  Uwe Scholz,et al.  PGP repository: a plant phenomics and genomics data publication infrastructure , 2016, Database J. Biol. Databases Curation.

[6]  Ute Roessner,et al.  Minimum reporting standards for plant biology context information in metabolomic studies , 2007, Metabolomics.

[7]  B. Usadel,et al.  Xeml Lab: a tool that supports the design of experiments at a graphical interface and generates computer-readable metadata files, which capture information about genotypes, growth conditions, environmental perturbations and sampling strategy. , 2009, Plant, cell & environment.

[8]  Hadi Quesneville,et al.  GnpIS: an information system to integrate genetic and genomic data from plants and fungi , 2013, Database J. Biol. Databases Curation.

[9]  Jane Hunter,et al.  PODD: An Ontology-Driven Data Repository for Collaborative Phenomics Research , 2010, ICADL.

[10]  Arllet M. Portugal,et al.  Bridging the phenotypic and genetic data useful for integrated breeding through a data annotation using the Crop Ontology developed by the crop communities of practice , 2012, Front. Physio..

[11]  Uwe Scholz,et al.  Data Management Experiences and Best Practices from the Perspective of a Plant Research Institute , 2014, DILS.

[12]  Nigel W. Hardy,et al.  The metabolomics standards initiative (MSI) , 2007, Metabolomics.

[13]  Uwe Scholz,et al.  Quantitative monitoring of Arabidopsis thaliana growth and development using high-throughput plant phenotyping , 2016, Scientific Data.

[14]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[15]  Hilla Peretz,et al.  The , 1966 .

[16]  William G. Cochran,et al.  Experimental Designs, 2nd Edition , 1950 .

[17]  Uwe Scholz,et al.  e!DAL - a framework to store, share and publish research data , 2014, BMC Bioinformatics.

[18]  A. Alercia,et al.  FAO/Bioversity Multi-Crop Passport Descriptors V.2.1 [MCPD V.2.1] - December 2015 , 2015 .

[19]  John McNeill,et al.  International code of botanical nomenclature (Vienna Code) : adopted by the Seventeenth International Botanical Congress, Vienna, Austria, July 2005 , 2006 .

[20]  B. Hammond Ontology , 2004, Lawrence Booth’s Book of Visions.

[21]  Daniel R. Zerbino,et al.  Ensembl regulation resources , 2016, Database J. Biol. Databases Curation.

[22]  Oliver Hofmann,et al.  ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level , 2010, Bioinform..

[23]  Matthias Lange,et al.  Towards recommendations for metadata and data handling in plant phenotyping. , 2015, Journal of experimental botany.

[24]  Bjarni J. Vilhjálmsson,et al.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines , 2010 .

[25]  P. Reich,et al.  New handbook for standardised measurement of plant functional traits worldwide , 2013 .

[26]  Lennart Martens,et al.  The minimum information about a proteomics experiment (MIAPE) , 2007, Nature Biotechnology.

[27]  Astrid Junker,et al.  Optimizing experimental procedures for quantitative evaluation of crop plant performance in high throughput phenotyping systems , 2015, Front. Plant Sci..

[28]  Matej Oresic,et al.  Data standards can boost metabolomics research, and if there is a will, there is a way , 2015, Metabolomics.

[29]  Susanna-Assunta Sansone,et al.  Standard reporting requirements for biological samples in metabolomics experiments: environmental context , 2007, Metabolomics.

[30]  Paul T. Spellman,et al.  A simple spreadsheet-based, MIAME-supportive format for microarray data: MAGE-TAB , 2006, BMC Bioinformatics.

[31]  Fabio Fiorani,et al.  The art of growing plants for experimental purposes: a practical guide for the plant biologist. , 2012, Functional plant biology : FPB.