An open-source framework for neuroscience metadata management applied to digital reconstructions of neuronal morphology

Research advancements in neuroscience entail the production of a substantial amount of data requiring interpretation, analysis, and integration. The complexity and diversity of neuroscience data necessitate the development of specialized databases and associated standards and protocols. NeuroMorpho.Org is an online repository of over one hundred thousand digitally reconstructed neurons and glia shared by hundreds of laboratories worldwide. Every entry of this public resource is associated with essential metadata describing animal species, anatomical region, cell type, experimental condition, and additional information relevant to contextualize the morphological content. Until recently, the lack of a user-friendly, structured metadata annotation system relying on standardized terminologies constituted a major hindrance in this effort, limiting the data release pace. Over the past 2 years, we have transitioned the original spreadsheet-based metadata annotation system of NeuroMorpho.Org to a custom-developed, robust, web-based framework for extracting, structuring, and managing neuroscience information. Here we release the metadata portal publicly and explain its functionality to enable usage by data contributors. This framework facilitates metadata annotation, improves terminology management, and accelerates data sharing. Moreover, its open-source development provides the opportunity of adapting and extending the code base to other related research projects with similar requirements. This metadata portal is a beneficial web companion to NeuroMorpho.Org which saves time, reduces errors, and aims to minimize the barrier for direct knowledge sharing by domain experts. The underlying framework can be progressively augmented with the integration of increasingly autonomous machine intelligence components.

[1]  Martin J. O'Connor,et al.  Using association rule mining and ontologies to generate metadata recommendations from multiple biomedical databases , 2019, Database J. Biol. Databases Curation.

[2]  Paul Clements,et al.  Software architecture in practice , 1999, SEI series in software engineering.

[3]  Douglas M. Bowden,et al.  NeuroNames: An Ontology for the BrainInfo Portal to Neuroscience on the Web , 2011, Neuroinformatics.

[4]  Giorgio A. Ascoli,et al.  An open repository for single-cell reconstructions of the brain forest , 2018, Scientific Data.

[5]  Alan Ruttenberg,et al.  A strategy for building neuroanatomy ontologies , 2012, Bioinform..

[6]  Jessica A. Turner,et al.  The NIFSTD and BIRNLex Vocabularies: Building Comprehensive Ontologies for Neuroscience , 2008, Neuroinformatics.

[7]  Payam Meyer,et al.  The NIH Open Citation Collection: A public access, broad coverage resource , 2019, PLoS biology.

[8]  Kayvan Bijari,et al.  Leveraging deep graph-based text representation for sentiment polarity applications , 2020, Expert Syst. Appl..

[9]  Matthew E Falagas,et al.  Comparison of PubMed, Scopus, Web of Science, and Google Scholar: strengths and weaknesses , 2007, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[10]  Hagen U. Tilgner,et al.  SynGO: An Evidence-Based, Expert-Curated Knowledge Base for the Synapse , 2019, Neuron.

[11]  Hans-Michael Müller,et al.  A hybrid human and machine resource curation pipeline for the Neuroscience Information Framework , 2012, Database J. Biol. Databases Curation.

[12]  Jason Puckett,et al.  Zotero: A Guide for Librarians, Researchers and Educators , 2011 .

[13]  Abha Agrawal,et al.  EndNote 1-2-3 Easy!: Reference Management for the Professional , 2005 .

[14]  Kimberly Van Auken,et al.  Textpresso Central: a customizable platform for searching, text mining, viewing, and curating biomedical literature , 2018, BMC Bioinformatics.

[15]  Maryann E. Martone,et al.  An ontological approach to describing neurons and their relationships , 2012, Front. Neuroinform..

[16]  Michael L. Hines,et al.  Open Source Brain: a collaborative resource for visualizing, analyzing, simulating and developing standardized models of neurons and circuits , 2018, bioRxiv.

[17]  Martin J. O'Connor,et al.  Using semantic technologies to enhance metadata submissions to public repositories in biomedicine , 2018, SWAT4LS.

[18]  Jan Grewe,et al.  A Bottom-up Approach to Data Annotation in Neurophysiology , 2011, Front. Neuroinform..

[19]  Nigel H. Goddard,et al.  Towards NeuroML: model description methods for collaborative modelling in neuroscience. , 2001, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[20]  Benjamin M. Gyori,et al.  FamPlex: a resource for entity recognition and relationship resolution of human protein families and complexes in biomedical text mining , 2018, bioRxiv.

[21]  Giorgio A. Ascoli,et al.  Name-calling in the hippocampus (and beyond): coming to terms with neuron types and properties , 2016, Brain Informatics.

[22]  Michael L. Hines,et al.  NeuroML: A Language for Describing Data Driven Models of Neurons and Networks with a High Degree of Biological Detail , 2010, PLoS Comput. Biol..

[23]  Martin J. O'Connor,et al.  The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata that Describe Scientific Experiments , 2017, SEMWEB.

[24]  Stephen D. Larson,et al.  NeuroLex.org: an online framework for neuroscience knowledge , 2013, Front. Neuroinform..

[25]  Giorgio A. Ascoli,et al.  The importance of metadata to assess information content in digital reconstructions of neuronal morphology , 2014, Cell and Tissue Research.

[26]  Christof Koch,et al.  Neurodata Without Borders: Creating a Common Data Format for Neurophysiology , 2015, Neuron.

[27]  Kristofer E. Bouchard,et al.  BRAINformat: A Data Standardization Framework for Neuroscience Data , 2015, bioRxiv.

[28]  Hanchuan Peng,et al.  Design and implementation of multi-signal and time-varying neural reconstructions , 2018, Scientific Data.

[29]  Hans-Michael Müller,et al.  Textpresso for Neuroscience: Searching the Full Text of Thousands of Neuroscience Research Papers , 2008, Neuroinformatics.

[30]  Daniel Gardner,et al.  Terminology for Neuroscience Data Discovery: Multi-tree Syntax and Investigator-Derived Semantics , 2008, Neuroinformatics.

[31]  Domenico Beneventano,et al.  Computing inter-document similarity with Context Semantic Analysis , 2018, Inf. Syst..

[32]  G. Ascoli,et al.  NeuroMorpho.Org: A Central Resource for Neuronal Morphologies , 2007, The Journal of Neuroscience.

[33]  Richard E. West,et al.  Mendeley: Creating Communities of Scholarly Inquiry Through Research Collaboration , 2011 .

[34]  Giorgio A. Ascoli,et al.  An ontology-based search engine for digital reconstructions of neuronal morphology , 2017, Brain Informatics.

[35]  Michael L. Hines,et al.  Neuron Names: A Gene- and Property-Based Name Format, With Special Reference to Cortical Neurons , 2019, Front. Neuroanat..

[36]  Michael L. Hines,et al.  Open Source Brain: A Collaborative Resource for Visualizing, Analyzing, Simulating, and Developing Standardized Models of Neurons and Circuits , 2018, Neuron.

[37]  Sophia Ananiadou,et al.  A Text Mining Pipeline Using Active and Deep Learning Aimed at Curating Information in Computational Neuroscience , 2018, Neuroinformatics.

[38]  Giorgio A. Ascoli,et al.  PaperBot: open-source web-based search and metadata organization of scientific literature , 2019, BMC Bioinformatics.

[39]  Giorgio A. Ascoli,et al.  Digital Reconstructions of Neuronal Morphology: Three Decades of Research Trends , 2012, Front. Neurosci..

[40]  Sridevi Polavaram,et al.  Win–win data sharing in neuroscience , 2017, Nature Methods.

[41]  Christian O'Reilly,et al.  A Framework for Collaborative Curation of Neuroscientific Literature , 2017, Front. Neuroinform..

[42]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..