Towards better traceability of field sampling data

Ensuring traceability of field experimental data or laboratory sampling data to conduct reproducible research is a challenge at the present time. Between the time when geolocalized specimens (biotic or abiotic) are taken, and the time the resulting data ends up in analysis published within a study, many manual operations take place and may generate errors. The French LTSER have joined forces at the national level to propose a solution as generic as possible to this problem of monitoring of the samples and the data associated with them. Compared to existing solutions (such as Laboratory Information Management Systems), we target a robust labeling solution adapted to outdoor working conditions, with the management of stocks and movements of samples. We designed and realized a prototype tested from end to end, using an open source software (https://github.com/Irstea/collec), cheap Zebra printers (mobile or not) and raspberries as devices. This solution provides sufficient flexibility for the wide variety of existing protocols. Its strength is the record of all contextual data associated with the samples, which constitute important parameters of the subsequent analyzes. At last, not only traceability is guaranteed, but also a gain of time and a rationalization of the storage of samples that will induce a return on investment.

[1]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[2]  Gregory Butler,et al.  The EnzymeTracker: an open-source laboratory information management system for sample tracking , 2011, BMC Bioinformatics.

[3]  Vincent S. Smith,et al.  Actionable, long-term stable and semantic web compatible identifiers for access to biological collection objects , 2017, Database J. Biol. Databases Curation.

[4]  Louise McRae,et al.  Global biodiversity monitoring: From data sources to Essential Biodiversity Variables , 2017 .

[5]  Craig Willis,et al.  Analysis and synthesis of metadata goals for scientific data , 2012, J. Assoc. Inf. Sci. Technol..

[6]  Katherine C Sexton,et al.  Development of the ISBER Best Practices for Repositories: Collection, Storage, Retrieval and Distribution of Biological Materials for Research. , 2012, Biopreservation and biobanking.

[7]  Honglin He,et al.  Fostering ecological data sharing: collaborations in the International Long Term Ecological Research Network , 2015 .

[8]  Simon J. D. Cox,et al.  Ontology for observations and sampling features, with alignments to existing models , 2016, Semantic Web.

[9]  Alan Christoffels,et al.  Baobab Laboratory Information Management System: Development of an Open-Source Laboratory Information Management System for Biobanking , 2017, Biopreservation and biobanking.

[10]  Jean Clobert,et al.  Integrating ongoing biodiversity monitoring: potential benefits and methods , 2008, Biodiversity and Conservation.

[11]  Heimo Müller,et al.  From the evaluation of existing solutions to an all-inclusive package for biobanks , 2017, Health and Technology.

[12]  Markus List,et al.  Efficient Sample Tracking With OpenLabFramework , 2014, Scientific reports.

[13]  Randall T. Schuh,et al.  Integrating specimen databases and revisionary systematics , 2012, ZooKeys.

[14]  Dave Roberts,et al.  Towards mainstreaming of biodiversity data publishing: recommendations of the GBIF Data Publishing Framework Task Group , 2011, BMC Bioinformatics.

[15]  Vicki A. Funk,et al.  Utility of QR codes in biological collections , 2013, PhytoKeys.

[16]  Brooks Hanson,et al.  Liberating field science samples and data , 2016, Science.

[17]  Jens Klump,et al.  A Digital Repository for Physical Samples: Concepts, Solutions and Management , 2017, TPDL.

[18]  Suzanne A. Pierce,et al.  Toward the Geoscience Paper of the Future: Best practices for documenting and sharing research from data to software to provenance , 2016 .