Using Deep Learning to Support High-Quality Evidence Mapping

The Living Evidence Map Project at the Norwegian Institute of Public Health (NIPH) gives an updated overview of research results and publications. As part of NIPH’s mandate to inform evidence-based infection prevention, control and treatment, a large group of experts are continously monitoring, assessing, coding and summarising new COVID-19 publications. Screening tools, coding practice and workflow are incrementally improved, but remain largely manual. This paper describes how deep learning methods have been employed to learn classification and coding from the steadily growing NIPH COVID-19 dashboard data, so as to aid manual classification, screening and preprocessing of the rapidly growing influx of new papers on the subject. Our main objective is to make manual screening scalable through semi-automation, while ensuring high-quality Evidence Map content. This paper reports on early results on classifying publication topic and type from titles and abstracts, showing that even simple neural network architectures and text representations can yield acceptable performance.

[1]  Isabelle Boutron,et al.  Future of evidence ecosystem series: 2. Current opportunities and need for better tools and methods. , 2020, Journal of clinical epidemiology.

[2]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[3]  Christian Riess,et al.  A Gentle Introduction to Deep Learning in Medical Image Processing , 2018, Zeitschrift fur medizinische Physik.

[4]  Bo Zhao,et al.  Deep learning in clinical natural language processing: a methodical review , 2019, J. Am. Medical Informatics Assoc..

[5]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[6]  T. Hoffmann,et al.  Waste in covid-19 research , 2020, BMJ.

[7]  L. Hayden,et al.  Ten Commandments for Effective Clinical Decision Support: Making the Practice of Evidence-based Medicine a Reality , 2011 .

[8]  Sophia Ananiadou,et al.  Topic detection using paragraph vectors to support active learning in systematic reviews , 2016, J. Biomed. Informatics.

[9]  Linn Brandt,et al.  Evidence ecosystems and learning health systems: why bother? , 2020, Journal of clinical epidemiology.

[10]  Yi Pan,et al.  Automated ICD-9 Coding via A Deep Learning Approach , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[11]  S. Ananiadou,et al.  Using text mining for study identification in systematic reviews: a systematic review of current approaches , 2015, Systematic Reviews.

[12]  Oren Etzioni,et al.  CORD-19: The Covid-19 Open Research Dataset , 2020, NLPCOVID19.

[13]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[14]  Matthew Michelson,et al.  A Deep Learning Method to Automatically Identify Reports of Scientifically Rigorous Clinical Research from the Biomedical Literature: Comparative Analytic Study , 2018, Journal of medical Internet research.

[15]  Stephane M Meystre,et al.  Impact of De-Identification on Clinical Text Classification Using Traditional and Deep Learning Classifiers , 2019, MedInfo.

[16]  Sophia Ananiadou,et al.  Prioritising references for systematic reviews with RobotAnalyst: A user study , 2018, Research synthesis methods.

[17]  Sandy Oliver,et al.  The politics of evidence and methodology: lessons from the EPPI-Centre , 2005 .

[18]  Tingting Mu,et al.  Self-Tuned Descriptive Document Clustering Using a Predictive Network , 2018, IEEE Transactions on Knowledge and Data Engineering.

[19]  M. Taylor,et al.  In Deep , 2016 .

[20]  Jianfeng Gao,et al.  Deep Learning Based Text Classification: A Comprehensive Review , 2020, ArXiv.