Enabling reusability of plant phenomic datasets with MIAPPE 1.1

Summary Enabling data reuse and knowledge discovery is increasingly critical in modern science, and requires an effort towards standardising data publication practices. This is particularly challenging in the plant phenotyping domain, due to its complexity and heterogeneity. We have produced the MIAPPE 1.1 release, which enhances the existing MIAPPE standard in coverage, to support perennial plants, in structure, through an explicit data model, and in clarity, through definitions and examples. We evaluated MIAPPE 1.1 by using it to express several heterogeneous phenotyping experiments in a range of different formats, to demonstrate its applicability and the interoperability between the various implementations. Furthermore, the extended coverage is demonstrated by the fact that one of the datasets could not have been described under MIAPPE 1.0. MIAPPE 1.1 marks a major step towards enabling plant phenotyping data reusability, thanks to its extended coverage, and especially the formalisation of its data model, which facilitates its implementation in different formats. Community feedback has been critical to this development, and will be a key part of ensuring adoption of the standard.

[1]  Arllet M. Portugal,et al.  Bridging the phenotypic and genetic data useful for integrated breeding through a data annotation using the Crop Ontology developed by the crop communities of practice , 2012, Front. Physio..

[2]  James W. Jones,et al.  Integrated description of agricultural field experiments and production: The ICASA Version 2.0 data standards , 2013 .

[3]  Barry Smith,et al.  The environment ontology: contextualising biological and biomedical entities , 2013, Journal of Biomedical Semantics.

[4]  Frederik Coppens,et al.  Correlation analysis of the transcriptome of growing leaves with mature leaf parameters in a maize RIL population , 2015, Genome Biology.

[5]  Anne E. Trefethen,et al.  Toward interoperable bioscience data , 2012, Nature Genetics.

[6]  Frederik Coppens,et al.  Combined Large-Scale Phenotyping and Transcriptomics in Maize Reveals a Robust Growth Regulatory Network1[OPEN] , 2016, Plant Physiology.

[7]  C. Tenopir,et al.  Data Sharing by Scientists: Practices and Perceptions , 2011, PloS one.

[8]  Eugene Zhang,et al.  The Planteome database: an integrated resource for reference ontologies, plant genomics and phenomics , 2017, Nucleic Acids Res..

[9]  Rafael C. Jimenez,et al.  Data integration in biological research: an overview , 2015, Journal of Biological Research-Thessaloniki.

[10]  James C. Hu,et al.  The Gene Ontology Resource: 20 years and still GOing strong , 2019 .

[11]  Oliver Hofmann,et al.  ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level , 2010, Bioinform..

[12]  Matthias Lange,et al.  Towards recommendations for metadata and data handling in plant phenotyping. , 2015, Journal of experimental botany.

[13]  Florence Debarre,et al.  The Availability of Research Data Declines Rapidly with Article Age , 2013, Current Biology.

[14]  T. Pridmore,et al.  Plant Phenomics, From Sensors to Knowledge , 2017, Current Biology.

[15]  G. King,et al.  Exploring and exploiting epigenetic variation in crops. , 2010, Genome.

[16]  L. Stein,et al.  Plant Ontology (PO): a Controlled Vocabulary of Plant Structures and Growth Stages , 2005, Comparative and functional genomics.

[17]  Uwe Scholz,et al.  BrAPI—an application programming interface for plant breeding applications , 2019, Bioinform..

[18]  V. Beneš,et al.  The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. , 2009, Clinical chemistry.

[19]  Massimiliano Izzo,et al.  FAIRsharing as a community approach to standards, repositories and policies , 2019, Nature Biotechnology.

[20]  Pascal Neveu,et al.  Dealing with multi‐source and multi‐scale information in plant phenomics: the ontology‐driven Phenotyping Hybrid Information System , 2018, The New phytologist.

[21]  Hadi Quesneville,et al.  GnpIS: an information system to integrate genetic and genomic data from plants and fungi , 2013, Database J. Biol. Databases Curation.

[22]  Alain Charcosset,et al.  Genomic prediction of maize yield across European environmental conditions , 2019, Nature Genetics.

[23]  Nicolas Marron,et al.  Integrating genome annotation and QTL position to identify candidate genes for productivity, architecture and water-use efficiency in Populus spp , 2012, BMC Plant Biology.

[24]  Uwe Scholz,et al.  Measures for interoperability of phenotypic data: minimum information requirements and formatting , 2016, Plant Methods.

[25]  Astrid Junker,et al.  Optimizing experimental procedures for quantitative evaluation of crop plant performance in high throughput phenotyping systems , 2015, Front. Plant Sci..

[26]  Frederik Coppens,et al.  Genetic properties of the MAGIC maize population: a new platform for high definition QTL mapping in Zea mays , 2015, Genome Biology.

[27]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[28]  J. Jansen,et al.  Understanding the genetic basis of potato development using a multi-trait QTL analysis , 2015, Euphytica.

[29]  Chris F. Taylor,et al.  The minimum information about a genome sequence (MIGS) specification , 2008, Nature Biotechnology.

[30]  Christopher G. Chute,et al.  BioPortal: ontologies and integrated data resources at the click of a mouse , 2009, Nucleic Acids Res..

[31]  S. Mccouch,et al.  When more is better: how data sharing would accelerate genomic selection of crop plants. , 2016, The New phytologist.

[32]  Elizabeth Arnaud,et al.  Applying FAIR Principles to Plant Phenotypic Data Management in GnpIS , 2019, Plant phenomics.

[33]  Dani Zamir,et al.  Where Have All the Crop Phenotypes Gone? , 2013, PLoS biology.

[34]  Mark A. Musen,et al.  AgroPortal: A vocabulary and ontology repository for agronomy , 2018, Comput. Electron. Agric..

[35]  Bernd Rinn,et al.  FAIRDOMHub: a repository and collaboration environment for sharing systems biology research , 2016, Nucleic Acids Res..

[36]  Hanna Cwiek-Kupczynska,et al.  Striving for Semantics of Plant Phenotyping Data , 2017, SAVE-SD@WWW.

[37]  M. Oliveira,et al.  Differential DNA Methylation Patterns Are Related to Phellogen Origin and Quality of Quercus suber Cork , 2017, PloS one.

[38]  Helen E. Parkinson,et al.  BioSamples database: an updated sample metadata hub , 2018, Nucleic Acids Res..

[39]  Paul T. Spellman,et al.  A simple spreadsheet-based, MIAME-supportive format for microarray data: MAGE-TAB , 2006, BMC Bioinformatics.