Training Biomedical Researchers in Metadata with a MIBBI-Based Ontology

Recent initiatives in data management recognize that involving the researchers is one of the more problematic issues and that taking into account the practices of each domain can ease this process. We describe here an experiment in the adoption of data description by researchers in the biomedical domain. We started with a generic lightweight ontology based on the Minimum Information for Biological and Biomedical Investigations (MIBBI) standard and presented it to researchers from the Institute of Innovation and Investigation in Health (I3S) in Porto. This resulted in seven interviews and four data description sessions using a RDM platform. The feedback from researchers shows that this intentionally restricted ontology favours an easy entry point into RDM but does not prevent them from identifying the limitations of the model and pinpointing their specific domain requirements. To complete the experiment, we collected the extra descriptors suggested by the researchers and compared them to the full MIBBI. Part of these new descriptors can be obtained from the standard, reinforcing the importance of common metadata models for broad domains such as biomedical research.

[1]  Donatella Castelli,et al.  Are Scientific Data Repositories Coping with Research Data Publishing? , 2016, Data Sci. J..

[2]  Craig Willis,et al.  Analysis and synthesis of metadata goals for scientific data , 2012, J. Assoc. Inf. Sci. Technol..

[3]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[4]  Elizabeth D. Dalton,et al.  Changes in Data Sharing and Data Reuse Practices and Perceptions among Scientists Worldwide , 2015, PloS one.

[5]  Paul N. Schofield,et al.  The role of ontologies in biological and biomedical research: a functional perspective , 2015, Briefings Bioinform..

[6]  Read Kevin Common Metadata Elements for Cataloging Biomedical Datasets , 2015 .

[7]  Nigel W. Hardy,et al.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project , 2008, Nature Biotechnology.

[8]  Jessica A. Turner,et al.  The Ontology for Biomedical Investigations , 2016, PloS one.

[9]  Lennart Martens,et al.  The minimum information about a proteomics experiment (MIAPE) , 2007, Nature Biotechnology.

[10]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[11]  T. Cech,et al.  Fostering innovation and discovery in biomedical research. , 2005, JAMA.

[12]  Cristina Ribeiro,et al.  Ranking Dublin Core descriptor lists from user interactions: a case study with Dublin Core Terms using the Dendro platform , 2018, International Journal on Digital Libraries.

[13]  Massimiliano Izzo,et al.  FAIRsharing as a community approach to standards, repositories and policies , 2019, Nature Biotechnology.

[14]  Michael L. Thomas,et al.  Minimization of Childhood Maltreatment Is Common and Consequential: Results from a Large, Multinational Sample Using the Childhood Trauma Questionnaire , 2016, PloS one.

[15]  Susanna-Assunta Sansone,et al.  linkedISA: semantic representation of ISA-Tab experimental metadata , 2014, BMC Bioinformatics.

[16]  Martin Eisenacher,et al.  Controlled vocabularies and ontologies in proteomics: Overview, principles and practice , 2014, Biochimica et biophysica acta.

[17]  K. Law,et al.  LABEX L-IPSL Arctic Metadata Portal , 2016, Data Sci. J..

[18]  Jeremy G. Frey,et al.  Creating Context for the Experiment Record. User-Defined Metadata: Investigations into Metadata Usage in the LabTrove ELN , 2014, J. Chem. Inf. Model..

[19]  Paolo Manghi,et al.  A vision towards Scientific Communication Infrastructures , 2013, International Journal on Digital Libraries.

[20]  Jane Greenberg,et al.  Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data , 2012, Dublin Core Conference.

[21]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .