Text-mining and neuroscience.

The wealth and diversity of neuroscience research are inherent characteristics of the discipline that can give rise to some complications. As the field continues to expand, we generate a great deal of data about all aspects, and from multiple perspectives, of the brain, its chemistry, biology, and how these affect behavior. The vast majority of research scientists cannot afford to spend their time combing the literature to find every article related to their research, nor do they wish to spend time adjusting their neuroanatomical vocabulary to communicate with other subdomains in the neurosciences. As such, there has been a recent increase in the amount of informatics research devoted to developing digital resources for neuroscience research. Neuroinformatics is concerned with the development of computational tools to further our understanding of the brain and to make sense of the vast amount of information that neuroscientists generate (French & Pavlidis, 2007). Many of these tools are related to the use of textual data. Here, we review some of the recent developments for better using the vast amount of textual information generated in neuroscience research and publication and suggest several use cases that will demonstrate how bench neuroscientists can take advantage of the resources that are available.

[1]  Aaron M. Cohen,et al.  An Effective General Purpose Approach for Automated Biomedical Document Classification , 2006, AMIA.

[2]  Cathy H. Wu,et al.  Studying Biocuration Workflows , 2009 .

[3]  Maryann E. Martone,et al.  Ontologies for Neuroscience: What are they and What are they Good for? , 2008, Frontiers in neuroscience.

[4]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[5]  Giorgio A. Ascoli Twenty Questions for Neuroscience Metadata , 2012, Neuroinformatics.

[6]  Bruce R. Rosen,et al.  Enabling collaborative research using the Biomedical Informatics Research Network (BIRN) , 2011, J. Am. Medical Informatics Assoc..

[7]  P. Bork,et al.  Literature mining for the biologist: from information retrieval to biological discovery , 2006, Nature Reviews Genetics.

[8]  Chris Mungall,et al.  A knowledge based approach to matching human neurodegenerative disease and animal models , 2013, Front. Neuroinform..

[9]  Bradley Voytek,et al.  Automated cognome construction and semi-automated hypothesis generation , 2012, Journal of Neuroscience Methods.

[10]  Leon French,et al.  Neuroinformatics Original Research Article , 2022 .

[11]  Eduard H. Hovy,et al.  Layout-aware text extraction from full-text PDF of scientific articles , 2012, Source Code for Biology and Medicine.

[12]  Aaron M. Cohen,et al.  Case Report: Five-way Smoking Status Classification Using Text Hot-Spot Identification and Error-correcting Output Codes , 2008, J. Am. Medical Informatics Assoc..

[13]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[14]  Maryann E. Martone,et al.  An ontological approach to describing neurons and their relationships , 2012, Front. Neuroinform..

[15]  D. Detmer American Medical Informatics Association (AMIA) 2007 Annual Symposium , 2008 .

[16]  Giorgio A. Ascoli,et al.  The Coming of Age of the Hippocampome , 2010, Neuroinformatics.

[17]  Judith A. Blake,et al.  The Mouse Genome Database (MGD): mouse biology and model systems , 2007, Nucleic Acids Res..

[18]  David N. Kennedy The Benefits of Preparing Data for Sharing Even When You Don’t , 2012, Neuroinformatics.

[19]  Douglas M. Bowden,et al.  NeuroNames 2002 , 2003, Neuroinformatics.

[20]  Jessica A. Turner,et al.  The NIFSTD and BIRNLex Vocabularies: Building Comprehensive Ontologies for Neuroscience , 2008, Neuroinformatics.

[21]  Nathan J. Bahr,et al.  Discovering Synergistic Qualities of Published Authors to Enhance Translational Research , 2008, AMIA.

[22]  Peter A. Groblewski,et al.  Drug-induced conditioned place preference and aversion in mice , 2006, Nature Protocols.

[23]  Douglas M. Bowden,et al.  Scientific Demonstration Abstracts. Demonstration Abstracts: Educational and Tutoring Systems: NeuroNames©: Human/Macaque Neuroanatomical Nomenclature , 1990 .

[24]  Jeremy D. Schmahmann,et al.  A Proposal for a Coordinated Effort for the Determination of Brainwide Neuroanatomical Connectivity in Model Organisms at a Mesoscopic Scale , 2009, PLoS Comput. Biol..

[25]  Aaron M. Cohen,et al.  Research Paper: Cross-Topic Learning for Work Prioritization in Systematic Review Creation and Update , 2009, J. Am. Medical Informatics Assoc..

[26]  K. Bretonnel Cohen,et al.  Original article Text mining for the biocuration workflow , 2012 .

[27]  M. F. Huerta,et al.  The National Institutes of Health Blueprint for Neuroscience Research , 2006, The Journal of Neuroscience.

[28]  Hans-Michael Müller,et al.  Textpresso for Neuroscience: Searching the Full Text of Thousands of Neuroscience Research Papers , 2008, Neuroinformatics.

[29]  Nello Cristianini,et al.  Comparison of vector space model methodologies to reconcile cross-species neuroanatomical concepts , 2005, Neuroinformatics.

[30]  Paul W. Sternberg,et al.  The gene lin-3 encodes an inductive signal for vulval development in C. elegans , 1992, Nature.

[31]  Amarnath Gupta,et al.  Development and use of Ontologies Inside the Neuroscience Information Framework: A Practical Approach , 2012, Front. Gene..

[32]  S. Mitchell,et al.  Effects of multiple delayed rewards on delay discounting in an adjusting amount procedure , 2003, Behavioural Processes.

[33]  Philip S. Yu,et al.  Evidence-based medicine, the essential role of systematic reviews, and the need for automated text mining tools , 2010, IHI.

[34]  Ellen Riloff,et al.  The Role of Information Extraction in the Design of a Document Triage Application for Biocuration , 2011, BioNLP@ACL.

[35]  Paul Pavlidis,et al.  Using text mining to link journal articles to neuroanatomical databases , 2012, The Journal of comparative neurology.

[36]  Leon French,et al.  Informatics in neuroscience , 2007, Briefings Bioinform..

[37]  G Tononi,et al.  Theoretical neuroanatomy: relating anatomical and functional connectivity in graphs and cortical connection matrices. , 2000, Cerebral cortex.

[38]  Larry W. Swanson,et al.  The neuron classification problem , 2007, Brain Research Reviews.

[39]  Douglas M. Bowden,et al.  NeuroNames Brain Hierarchy , 1995, NeuroImage.

[40]  M. Haendel,et al.  Dealing with Data: A Case Study on Information and Data Management Literacy , 2012, PLoS biology.

[41]  Hans-Michael Müller,et al.  The Neuroscience Information Framework: A Data and Knowledge Environment for Neuroscience , 2008, Neuroinformatics.

[42]  Philip V. Ogren,et al.  Knowtator: A Protégé plug-in for annotated corpus construction , 2006, NAACL.

[43]  Perry L. Miller,et al.  The Human Brain Project: neuroinformatics tools for integrating, searching and modeling multidisciplinary neuroscience data , 1998, Trends in Neurosciences.

[44]  Aaron M. Cohen,et al.  k-Information Gain Scaled Nearest Neighbors: A Novel Approach to Classifying Protein-Protein Interaction-Related Documents , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[45]  H AmbertKyle,et al.  k-Information Gain Scaled Nearest Neighbors , 2012 .

[46]  K. Bretonnel Cohen,et al.  Text mining for the biocuration workflow , 2012, Database J. Biol. Databases Curation.

[47]  Jack Park,et al.  Creating neuroscience ontologies. , 2007, Methods in molecular biology.

[48]  Leon Hayes French Bioinformatics for neuroanatomical connectivity , 2012 .

[49]  Aaron M. Cohen,et al.  Research Paper: A System for Classifying Disease Comorbidity Status from Medical Discharge Summaries Using Automated Hotspot and Negated Concept Detection , 2009, J. Am. Medical Informatics Assoc..

[50]  Mark Ellisman,et al.  e-Neuroscience: challenges and triumphs in integrating distributed data from molecules to brains , 2004, Nature Neuroscience.

[51]  Amarnath Gupta,et al.  NIFSTD and NeuroLex: Comprehensive Neuroscience Ontology Development Based on Multiple Biomedical Ontologies and Community Involvement , 2011, ICBO.

[52]  Martin Rf,et al.  NeuroNames©: Human/Macaque Neuroanatomical Nomenclature. , 1990 .

[53]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[54]  Hong Wei Dong,et al.  Allen reference atlas : a digital color brain atlas of the C57Black/6J male mouse , 2008 .

[55]  Eduard H. Hovy,et al.  Intelligent Approaches to Mining the Primary Research Literature: Techniques, Systems, and Examples , 2008, Computational Intelligence in Medical Informatics.

[56]  Hans-Michael Müller,et al.  Textpresso: An Ontology-Based Information Retrieval and Extraction System for Biological Literature , 2004, PLoS biology.

[57]  Allan R. Jones,et al.  Genome-wide atlas of gene expression in the adult mouse brain , 2007, Nature.

[58]  R. Rodgers,et al.  Anxiety, defence and the elevated plus-maze , 1997, Neuroscience & Biobehavioral Reviews.

[59]  Martone Maryann A multi-scale parts list for the brain: community-based ontology curation for neuroinformatics with NeuroLex.org , 2010 .

[60]  L. S. Jacyna,et al.  Nineteenth-Century Origins of Neuroscientific Concepts , 1987 .

[61]  J. Price :Allen Reference Atlas: A Digital Color Brain Atlas of the C57BL/6J Male Mouse , 2008 .

[62]  Aaron M. Cohen,et al.  SYRIAC: The SYstematic Review Information Automated Collection System A Data Warehouse for Facilitating Automated Biomedical Text Classification , 2008, AMIA.

[63]  William R. Hersh,et al.  Information Retrieval: A Health and Biomedical Perspective , 2002 .

[64]  Hans-Michael Müller,et al.  Federated Access to Heterogeneous Information Resources in the Neuroscience Information Framework (NIF) , 2008, Neuroinformatics.

[65]  William R. Hersh,et al.  A survey of current work in biomedical text mining , 2005, Briefings Bioinform..

[66]  Mark A. Musen,et al.  Creating Mappings For Ontologies in Biomedicine: Simple Methods Work , 2009, AMIA.

[67]  Lars Kai Hansen,et al.  Mining for associations between text and brain activation in a functional neuroimaging database , 2007, Neuroinformatics.

[68]  Leon French,et al.  Relationships between Gene Expression and Brain Wiring in the Adult Rodent Brain , 2011, PLoS Comput. Biol..

[69]  Larry W. Swanson,et al.  BAMS Neuroanatomical Ontology: Design and Implementation , 2008, Frontiers Neuroinformatics.