MDEmic in a use case for microscopy metadata harmonization: Facilitating FAIR principles in practical application with metadata annotation tools

While the FAIR principles are well accepted in the scientific community, the implementation of appropriate metadata editing and transfer to ensure FAIR research data in practice is significantly lagging behind. On the one hand, it strongly depends on the availability of tools that efficiently support this step in research data management. On the other hand, it depends on the available standards regarding the interpretability of metadata. Here, we introduce a tool, MDEmic, for editing metadata of microscopic imaging data in an easy and comfortable way that provides high flexibility in terms of adjustment of metadata sets. This functionality was in great demand by many researchers applying microscopic techniques. MDEmic has already become a part of the standard installation package of the image database OMERO as OMERO.mde. This database helps to organize and visualize microscopic image data and keep track of their further processing and linkage to other data sets. For this reason, many imaging core facilities provide OMERO to their users. We present a use case scenario for the tailored application of OMERO.mde to imaging data of an institutional OMERO-based Membrane Dye Database, which requires specific experimental metadata. Similar to public image data repositories like the Image Data Resource, IDR, this database facilitates image data storage including rich metadata which enables data mining and re-use, one of the major goals of the FAIR principles. Although today the majority of scientific data including microscopy and imaging data are available in digital format, a real benefit from easy sharing and re-using digital data according to the FAIR principles [9] only exists if data are understandable and unambiguously interpretable. Collecting and maintaining the relevant metadata is key to ensuring that data are reliable, reusable and can be found and accessed by the scientific community. Imaging data are usually extremely rich data files as they report on various parameters in a multidimensional space and are acquired with complex microscopy instruments. The metadata or data models are very diverse due to the wide range of e.g. modalities, scales,

[1]  Thomas Pengo,et al.  Imaging methods are vastly underreported in biomedical research , 2020, eLife.

[2]  B. S. Manjunath,et al.  Biological imaging software tools , 2012, Nature Methods.

[3]  Sandeep Ravindran Core curriculum: learning to manage a shared microscopy facility. , 2020, Nature.

[4]  Satoshi Kume,et al.  RIKEN MetaDatabase: A Database Platform for Health Care and Life Sciences as a Microcosm of Linked Open Data Cloud , 2018, Int. J. Semantic Web Inf. Syst..

[5]  Douglas Russell,et al.  Maintaining the provenance of microscopy metadata using OMERO.forms software , 2017, bioRxiv.

[6]  A. Sporbert,et al.  Advanced light microscopy core facilities: Balancing service, science and career , 2016, Microscopy research and technique.

[7]  Bálint Antal,et al.  Image Data Resource: a bioimage data integration and publication platform , 2017, Nature Methods.

[8]  Satoshi Kume,et al.  Development of an Ontology for an Integrated Image Analysis Platform to enable Global Sharing of Microscopy Imaging Data , 2016, International Semantic Web Conference.

[9]  Douglas A. Creager,et al.  The Open Microscopy Environment (OME) Data Model and XML file: open tools for informatics and quantitative analysis in biological imaging , 2005, Genome Biology.

[10]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[11]  Christopher W. Belter,et al.  Data sharing in PLOS ONE: An analysis of Data Availability Statements , 2018, PloS one.

[12]  Vassilios Ioannidis,et al.  FAIRness Literacy: The Achilles' Heel of Applying FAIR Principles , 2020, Data Sci. J..

[13]  Erik Brauner,et al.  Informatics and Quantitative Analysis in Biological Imaging , 2003, Science.

[14]  Ivan B. N. Clark,et al.  PyOmeroUpload: A Python toolkit for uploading images and metadata to OMERO , 2020, Wellcome open research.

[15]  Ardan Patwardhan,et al.  A call for public archives for biological image data , 2018, Nature Methods.

[16]  R. Pepperkok,et al.  Institutional core facilities: prerequisite for breakthroughs in the life sciences , 2016, EMBO reports.

[17]  Maximiliaan Huisman,et al.  Minimum Information guidelines for fluorescence microscopy: increasing the value, quality, and fidelity of image data , 2019, ArXiv.

[18]  Michael S. Nelson,et al.  QUAREP-LiMi: A community-driven initiative to establish guidelines for quality assessment and reproducibility for instruments and images in light microscopy , 2021, Journal of microscopy.

[19]  Lynn Yarmey,et al.  Make scientific data FAIR , 2019, Nature.

[20]  Ryan P. Sullivan,et al.  A global view of standards for open image data formats and repositories , 2020, Nature Methods.

[21]  C. Rueden,et al.  Metadata matters: access to image data in the real world , 2010, The Journal of cell biology.