MMiDaS-AE: multi-modal missing data aware stacked autoencoder for biomedical abstract screening

Systematic review (SR) is an essential process to identify, evaluate, and summarize the findings of all relevant individual studies concerning health-related questions. However, conducting a SR is labor-intensive, as identifying relevant studies is a daunting process that entails multiple researchers screening thousands of articles for relevance. In this paper, we propose MMiDaS-AE, a Multi-modal Missing Data aware Stacked Autoencoder, for semi-automating screening for SRs. We use a multi-modal view that exploits three representations, of: 1) documents, 2) topics, and 3) citation networks. Documents that contain similar words will be nearby in the document embedding space. Models can also exploit the relationship between documents and the associated SR MeSH terms to capture article relevancy. Finally, related works will likely share the same citations, and thus closely related articles would, intuitively, be trained to be close to each other in the embedding space. However, using all three learned representations as features directly result in an unwieldy number of parameters. Thus, motivated by recent work on multi-modal auto-encoders, we adopt a multi-modal stacked autoencoder that can learn a shared representation encoding all three representations in a compressed space. However, in practice one or more of these modalities may be missing for an article (e.g., if we cannot recover citation information). Therefore, we propose to learn to impute the shared representation even when specific inputs are missing. We find this new model significantly improves performance on a dataset consisting of 15 SRs compared to existing approaches.

[1]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[2]  T. Trikalinos,et al.  Systematic Review: Charged-Particle Radiation Therapy for Cancer , 2009, Annals of Internal Medicine.

[3]  Laura A. Levit,et al.  Finding what works in health care : standards for systematic reviews , 2011 .

[4]  M. de Rijke,et al.  Siamese CBOW: Optimizing Word Embeddings for Sentence Representations , 2016, ACL.

[5]  Hossam M. Hammady,et al.  Rayyan—a web and mobile app for systematic reviews , 2016, Systematic Reviews.

[6]  Aaron M. Cohen,et al.  Optimizing Feature Representation for Automated Systematic Review Work Prioritization , 2008, AMIA.

[7]  Stan Matwin,et al.  A new algorithm for reducing the workload of experts in performing systematic reviews , 2010, J. Am. Medical Informatics Assoc..

[8]  Ian D. Reid,et al.  Multi-modal Auto-Encoders as Joint Estimators for Robotics Scene Understanding , 2016, Robotics: Science and Systems.

[9]  Byron C. Wallace,et al.  Learning Disentangled Representations of Texts with Application to Biomedical Abstracts , 2018, EMNLP.

[10]  S. Ananiadou,et al.  Using text mining for study identification in systematic reviews: a systematic review of current approaches , 2015, Systematic Reviews.

[11]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[12]  Yann LeCun,et al.  Disentangling factors of variation in deep representation using adversarial training , 2016, NIPS.

[13]  S. Wooding,et al.  The answer is 17 years, what is the question: understanding time lags in translational research , 2011, Journal of the Royal Society of Medicine.

[14]  J. Lau,et al.  Effectiveness of Management Strategies for Renal Artery Stenosis: A Systematic Review , 2006, Annals of Internal Medicine.

[15]  Carla E. Brodley,et al.  Semi-automated screening of biomedical citations for systematic reviews , 2010, BMC Bioinformatics.

[16]  Gang Hua,et al.  Ordinal Regression with Multiple Output CNN for Age Estimation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Ethan M Balk,et al.  Reporting of Systematic Reviews of Micronutrients and Health: A Critical Appraisal , 2009, The American journal of clinical nutrition.

[18]  Sophia Ananiadou,et al.  Reducing systematic review workload through certainty-based screening , 2014, J. Biomed. Informatics.

[19]  H. Bastian,et al.  Seventy-Five Trials and Eleven Systematic Reviews a Day: How Will We Ever Keep Up? , 2010, PLoS medicine.

[20]  Martin J. Westgate,et al.  Predicting the time needed for environmental systematic reviews and systematic maps , 2018, Conservation biology : the journal of the Society for Conservation Biology.

[21]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[22]  Dina Demner-Fushman,et al.  Towards Automating the Initial Screening Phase of a Systematic Review , 2010, MedInfo.

[23]  Pierre Zweigenbaum,et al.  Automating Document Discovery in the Systematic Review Process: How to Use Chaff to Extract Wheat , 2018, LREC.

[24]  Tingting Mu,et al.  A semi-supervised approach using label propagation to support citation screening , 2017, J. Biomed. Informatics.

[25]  M. Southgate Anti-Slavery Picnic at Weymouth Landing, Massachusetts , 1999 .

[26]  I. Olkin,et al.  Estimating time to conduct a meta-analysis from number of citations retrieved. , 1999, JAMA.

[27]  Philippe Ravaud,et al.  Automatic screening using word embeddings achieved high sensitivity and workload reduction for updating living network meta-analyses. , 2019, Journal of clinical epidemiology.

[28]  William R. Hersh,et al.  Reducing workload in systematic review preparation using automated citation classification. , 2006, Journal of the American Medical Informatics Association : JAMIA.

[29]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[30]  VincentPascal,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010 .

[31]  Ahmed K. Elmagarmid,et al.  Learning to identify relevant studies for systematic reviews using random forest and external information , 2015, Machine Learning.

[32]  Carla E. Brodley,et al.  Class Imbalance, Redux , 2011, 2011 IEEE 11th International Conference on Data Mining.

[33]  Mohammed K. Ali,et al.  Global Diabetes Prevention Interventions: A Systematic Review and Network Meta-analysis of the Real-World Impact on Incidence, Weight, and Glucose , 2018, Diabetes Care.

[34]  Juan Jose García Adeva,et al.  Automatic text classification to support systematic reviews in medicine , 2014, Expert Syst. Appl..

[35]  L. Hedges,et al.  A Brief History of Research Synthesis , 2002, Evaluation & the health professions.

[36]  Kyle Lo,et al.  SciBERT: Pretrained Contextualized Embeddings for Scientific Text , 2019, ArXiv.

[37]  Brian E. Howard,et al.  SWIFT-Review: a text-mining workbench for systematic review , 2016, Systematic Reviews.

[38]  Sophia Ananiadou,et al.  Topic detection using paragraph vectors to support active learning in systematic reviews , 2016, J. Biomed. Informatics.

[39]  D. Gough,et al.  Systematic Research Synthesis to Inform Policy, Practice and Democratic Debate , 2002, Social Policy and Society.

[40]  Wiebke Wagner,et al.  Steven Bird, Ewan Klein and Edward Loper: Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit , 2010, Lang. Resour. Evaluation.

[41]  Carla E. Brodley,et al.  Deploying an interactive machine learning system in an evidence-based practice center: abstrackr , 2012, IHI '12.

[42]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[43]  Jing Liao,et al.  Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error , 2019, Systematic Reviews.

[44]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[45]  Joyce Ho,et al.  PMCVec: Distributed phrase representation for biomedical text processing , 2019, J. Biomed. Informatics X.

[46]  D. Moher,et al.  Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement , 2009, BMJ : British Medical Journal.

[47]  Aaron M. Cohen,et al.  Research Paper: Cross-Topic Learning for Work Prioritization in Systematic Review Creation and Update , 2009, J. Am. Medical Informatics Assoc..

[48]  Sophia Ananiadou,et al.  Developing a Robust Part-of-Speech Tagger for Biomedical Text , 2005, Panhellenic Conference on Informatics.

[49]  I. Koulouridis,et al.  Dose of erythropoiesis-stimulating agents and adverse outcomes in CKD: a metaregression analysis. , 2013, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[50]  D. Gough,et al.  An Introduction to Systematic Reviews , 2017 .

[51]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[52]  David Ogilvie,et al.  Pinpointing needles in giant haystacks: use of text mining to reduce impractical screening workload in extremely large scoping reviews , 2014, Research synthesis methods.

[53]  Carla E. Brodley,et al.  Active learning for biomedical citation screening , 2010, KDD.