Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research

Discovery studies in animals constitute a cornerstone of biomedical research, but suffer from lack of generalizability to human populations. We propose that large-scale interrogation of these data could reveal patterns of animal use that could narrow the translational divide. We describe a text-mining approach that extracts translationally useful data from PubMed abstracts. These comprise six modules: species, model, genes, interventions/disease modifiers, overall outcome and functional outcome measures. Existing National Library of Medicine natural language processing tools (SemRep, GNormPlus and the Chemical annotator) underpin the program and are further augmented by various rules, term lists, and machine learning models. Evaluation of the program using a 98-abstract test set achieved F1 scores ranging from 0.75–0.95 across all modules, and exceeded F1 scores obtained from comparable baseline programs. Next, the program was applied to a larger 14,481 abstract data set (2008–2017). Expected and previously identified patterns of species and model use for the field were obtained. As previously noted, the majority of studies reported promising outcomes. Longitudinal patterns of intervention type or gene mentions were demonstrated, and patterns of animal model use characteristic of the Parkinson’s disease field were confirmed. The primary function of the program is to overcome low external validity of animal model systems by aggregating evidence across a diversity of models that capture different aspects of a multifaceted cellular process. Some aspects of the tool are generalizable, whereas others are field-specific. In the initial version presented here, we demonstrate proof of concept within a single disease area, Parkinson’s disease. However, the program can be expanded in modular fashion to support a wider range of neurodegenerative diseases.

[1]  J. V. Van Raamsdonk,et al.  Modeling Parkinson’s Disease in C. elegans , 2018, Journal of Parkinson's disease.

[2]  A. Pisani,et al.  Promising rodent models in Parkinson's disease. , 2018, Parkinsonism & related disorders.

[3]  K. Mullane,et al.  Preclinical Models of Alzheimer's Disease: Relevance and Translational Validity , 2019, Current protocols in pharmacology.

[4]  A. Lang,et al.  α-Synuclein-Based Animal Models of Parkinson's Disease: Challenges and Opportunities in a New Era , 2016, Trends in Neurosciences.

[5]  Michael B Bracken,et al.  Is animal research sufficiently evidence based to be a cornerstone of biomedical research? , 2014, BMJ : British Medical Journal.

[6]  Isabelle Boutron,et al.  Misrepresentation and distortion of research in biomedical literature , 2018, Proceedings of the National Academy of Sciences.

[7]  V. Francardo,et al.  Animal models of l-DOPA–induced dyskinesia: an update on the current options , 2012, Neuroscience.

[8]  S. Lazic,et al.  A call for transparent reporting to optimize the predictive value of preclinical research , 2012, Nature.

[9]  L. Breger,et al.  Genetically engineered animal models of Parkinson's disease: From worm to rodent , 2019, The European journal of neuroscience.

[10]  Zhiyong Lu,et al.  PubTator: a web-based text mining tool for assisting biocuration , 2013, Nucleic Acids Res..

[11]  Marcus R Munafò,et al.  Significance chasing in research practice: causes, consequences and possible solutions. , 2014, Addiction.

[12]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[13]  Laurie M. Ryan,et al.  NIA Commentary: Translational Issues in Alzheimer’s Disease Drug Development , 2012, Neuropsychopharmacology.

[14]  F. Scheperjans Gut microbiota, 1013 new pieces in the Parkinson's disease puzzle. , 2016, Current opinion in neurology.

[15]  G. Novack Translating Drugs From Animals to Humans: Do We Need to Prove Efficacy? , 2013, Translational vision science & technology.

[16]  R. Harrison,et al.  Phase II and phase III failures: 2013–2015 , 2016, Nature Reviews Drug Discovery.

[17]  Zhiyong Lu,et al.  tmBioC: improving interoperability of text-mining tools with BioC , 2014, Database J. Biol. Databases Curation.

[18]  R. Porter,et al.  Failure to publish negative results: A systematic bias in psychiatric literature , 2017, The Australian and New Zealand journal of psychiatry.

[19]  J. Scannell,et al.  Diagnosing the decline in pharmaceutical R&D efficiency , 2012, Nature Reviews Drug Discovery.

[20]  D. Howells,et al.  Publication Bias in Reports of Animal Stroke Studies Leads to Major Overstatement of Efficacy , 2010, PLoS biology.

[21]  R. Takahashi,et al.  Parkinson’s disease pathogenesis from the viewpoint of small fish models , 2017, Journal of Neural Transmission.

[22]  Allan R. Jones,et al.  Conserved cell types with divergent features in human versus mouse cortex , 2019, Nature.

[23]  Jianhua Li,et al.  Analysis of Polarity Information in Medical Text , 2005, AMIA.

[24]  C. Zeiss Improving the predictive value of interventional animal models data. , 2015, Drug discovery today.

[25]  I. Cuthill,et al.  Survey of the Quality of Experimental Design, Statistical Analysis and Reporting of Research Using Animals , 2009, PloS one.

[26]  J. Arrowsmith,et al.  Trial Watch: Phase II and Phase III attrition rates 2011–2012 , 2013, Nature Reviews Drug Discovery.

[27]  T. Südhof,et al.  Cell Biology and Pathophysiology of α-Synuclein. , 2018, Cold Spring Harbor perspectives in medicine.

[28]  J. Ioannidis Extrapolating from Animals to Humans , 2012, Science Translational Medicine.

[29]  J. Ioannidis Acknowledging and Overcoming Nonreproducibility in Basic and Preclinical Research. , 2017, JAMA.

[30]  J. Bolker,et al.  Model organisms: There's more to life than rats and flies , 2012, Nature.

[31]  P. Snoy Establishing Efficacy of Human Products Using Animals , 2010, Veterinary pathology.

[32]  Sophia Ananiadou,et al.  Thalia: semantic search engine for biomedical abstracts , 2018, Bioinform..

[33]  H. Allore,et al.  Established patterns of animal study design undermine translation of disease-modifying therapies for Parkinson’s disease , 2017, PloS one.

[34]  B. Brembs Prestigious Science Journals Struggle to Reach Even Average Reliability , 2018, Front. Hum. Neurosci..

[35]  B. Liu,et al.  The lipopolysaccharide Parkinson’s disease animal model: mechanistic studies and drug discovery , 2008, Fundamental & clinical pharmacology.

[36]  M. Delong,et al.  Primate models of movement disorders of basal ganglia origin , 1990, Trends in Neurosciences.

[37]  A. Crossman,et al.  Animal models of l‐dopa‐induced dyskinesia in Parkinson's disease , 2018, Movement disorders : official journal of the Movement Disorder Society.

[38]  Ana Mlinarić,et al.  Dealing with the positive publication bias: Why you should really publish your negative results , 2017, Biochemia medica.

[39]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[40]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[41]  G. Perry,et al.  Alzheimer disease research in the 21st century: past and current failures, new perspectives and funding priorities , 2016, Oncotarget.

[42]  R. Nussbaum The Identification of Alpha-Synuclein as the First Parkinson Disease Gene , 2017, Journal of Parkinson's disease.

[43]  Laura Avanzino,et al.  Rehabilitation for Parkinson's disease: Current outlook and future challenges. , 2016, Parkinsonism & related disorders.

[44]  S. Fox,et al.  Symptomatic Models of Parkinson's Disease and L-DOPA-Induced Dyskinesia in Non-human Primates. , 2015, Current topics in behavioral neurosciences.

[45]  R. Ransohoff All (animal) models (of neurodegeneration) are wrong. Are they also useful? , 2018, The Journal of experimental medicine.

[46]  J. Jankovic,et al.  Targeting α-Synuclein in Parkinson’s Disease: Progress Towards the Development of Disease-Modifying Therapeutics , 2019, Drugs.

[47]  S. Lewis,et al.  Current Treatment Options for Alzheimer's Disease and Parkinson's Disease Dementia , 2016, Current neuropharmacology.

[48]  C. Zeiss From Reproducibility to Translation in Neurodegenerative Disease. , 2017, ILAR journal.

[49]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[50]  J. E. Kranz,et al.  Design, power, and interpretation of studies in the standard murine model of ALS , 2008, Amyotrophic lateral sclerosis : official publication of the World Federation of Neurology Research Group on Motor Neuron Diseases.

[51]  J. Ioannidis,et al.  P values in display items are ubiquitous and almost invariably significant: A survey of top science journals , 2018, PloS one.

[52]  F. Collins,et al.  Policy: NIH plans to enhance reproducibility , 2014, Nature.

[53]  A. Lang,et al.  Pharmacological treatment of Parkinson disease: a review. , 2014, JAMA.

[54]  C. Elliott,et al.  Neurophysiology of Drosophila Models of Parkinson's Disease , 2015, Parkinson's disease.

[55]  Guy Beauchamp,et al.  ARRIVE has not ARRIVEd: Support for the ARRIVE (Animal Research: Reporting of in vivo Experiments) guidelines does not improve the reporting quality of papers in animal welfare, analgesia or anesthesia , 2018, PloS one.

[56]  John P. Overington,et al.  Classification and analysis of a large collection of in vivo bioassay descriptions , 2017, PLoS Comput. Biol..

[57]  Pierre Tufféry,et al.  Accessing external innovation in drug discovery and development , 2015, Expert opinion on drug discovery.

[58]  O. Rascol,et al.  New treatments for levodopa‐induced motor complications , 2015, Movement disorders : official journal of the Movement Disorder Society.

[59]  H. Bergman,et al.  Reversal of experimental parkinsonism by lesions of the subthalamic nucleus. , 1990, Science.

[60]  J. Volkmann DEEP BRAIN STIMULATION II Deep Brain Stimulation for the Treatment of Parkinson’s Disease , 2004 .

[61]  J. Lafuente,et al.  Nanoformulation: A Useful Therapeutic Strategy for Improving Neuroprotection and the Neurorestorative Potential in Experimental Models of Parkinson's Disease. , 2017, International review of neurobiology.

[62]  Marlies Leenaars,et al.  Publication Bias in Laboratory Animal Research: A Survey on Magnitude, Drivers, Consequences and Potential Solutions , 2012, PloS one.

[63]  P. Gubellini,et al.  Animal models of Parkinson's disease: An updated overview. , 2015, Revue neurologique.

[64]  S. Perrin Preclinical research: Make mouse studies work , 2014, Nature.

[65]  Zhiyong Lu,et al.  GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains , 2015, BioMed research international.

[66]  J. Ioannidis,et al.  Evaluation of Excess Significance Bias in Animal Studies of Neurological Diseases , 2013, PLoS biology.

[67]  Giuseppe Di Giovanni,et al.  Animal models of early-stage Parkinson’s disease and acute dopamine deficiency to study compensatory neurodegenerative mechanisms , 2018, Journal of Neuroscience Methods.

[68]  David S. Park,et al.  Animal Models of Parkinson's Disease , 2011, Parkinson's disease.

[69]  Alberto J Espay,et al.  Disease Modification in Parkinson's Disease: Current Approaches, Challenges, and Future Considerations , 2018, Movement disorders : official journal of the Movement Disorder Society.

[70]  C. Perier,et al.  Neurotoxin-based models of Parkinson's disease , 2012, Neuroscience.

[71]  A. Schapira,et al.  Why have we failed to achieve neuroprotection in Parkinson's disease? , 2008, Annals of neurology.

[72]  J. Ioannidis,et al.  Reproducibility in Science: Improving the Standard for Basic and Preclinical Research , 2015, Circulation research.

[73]  Olivier Bodenreider,et al.  Aggregating UMLS Semantic Types for Reducing Conceptual Complexity , 2001, MedInfo.

[74]  H. Heng,et al.  Heng Reductionism The Conflict Between Complex Systems and Correction , 2008 .

[75]  E. Bézard,et al.  Animal models of Parkinson's disease: Limits and relevance to neuroprotection studies , 2013, Movement disorders : official journal of the Movement Disorder Society.

[76]  P. Calabresi,et al.  Motor complications in Parkinson's disease: Striatal molecular and electrophysiological mechanisms of dyskinesias , 2018, Movement disorders : official journal of the Movement Disorder Society.

[77]  D. Howells,et al.  Can Animal Models of Disease Reliably Inform Human Studies? , 2010, PLoS medicine.

[78]  T. Curran Reproducibility of academic preclinical translational research: lessons from the development of Hedgehog pathway inhibitors to treat cancer , 2018, Open Biology.

[79]  I. Cuthill,et al.  Animal Research: Reporting In Vivo Experiments: The ARRIVE Guidelines , 2010, British journal of pharmacology.

[80]  P. Pound,et al.  Is it possible to overcome issues of external validity in preclinical animal research? Why most animal models are bound to fail , 2018, Journal of Translational Medicine.

[81]  David B. Fogel,et al.  Factors associated with clinical trials that fail and opportunities for improving the likelihood of success: A review , 2018, Contemporary clinical trials communications.

[82]  Aidong Zhang,et al.  A survey on literature based discovery approaches in biomedical domain , 2019, J. Biomed. Informatics.

[83]  J. Winderickx,et al.  Yeast models of Parkinson's disease-associated molecular pathologies. , 2017, Current opinion in genetics & development.

[84]  M. Morissette,et al.  Non-human primate models of PD to test novel therapies , 2018, Journal of Neural Transmission.

[85]  Marcelo Fiszman,et al.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text , 2003, J. Biomed. Informatics.