论文信息 - Spá: A Web-Based Viewer for Text Mining in Evidence Based Medicine

Spá: A Web-Based Viewer for Text Mining in Evidence Based Medicine

Summarizing the evidence about medical interventions is an immense undertaking, in part because unstructured Portable Document Format (PDF) documents remain the main vehicle for disseminating scientific findings. Clinicians and researchers must therefore manually extract and synthesise information from these PDFs. We introduce Spa1,2 a web-based viewer that enables automated annotation and summarisation of PDFs via machine learning. To illustrate its functionality, we use Spa to semi-automate the assessment of bias in clinical trials. Spa has a modular architecture, therefore the tool may be widely useful in other domains with a PDF-based literature, including law, physics, and biology.

[1] J. Sterne,et al. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials , 2011, BMJ : British Medical Journal.

[2] Lisa Hartling,et al. Risk of bias versus quality assessment of randomised controlled trials: cross sectional study , 2009, BMJ : British Medical Journal.

[3] Lisa Hartling,et al. Applying the Risk of Bias Tool in a Systematic Review of Combination Long-Acting Beta-Agonists and Inhaled Corticosteroids for Persistent Asthma , 2011, PloS one.

[4] Massimiliano Pontil,et al. Regularized multi--task learning , 2004, KDD.

[5] Tommi Tervonen,et al. Deficiencies in the transfer and availability of clinical trials evidence: a review of existing systems and standards , 2012, BMC Medical Informatics and Decision Making.

[6] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.

[7] Alessandro Moschitti,et al. End-to-End Relation Extraction Using Distant Supervision from External Semantic Repositories , 2011, ACL.

[8] Hal Daumé,et al. Frustratingly Easy Domain Adaptation , 2007, ACL.

[9] D. Sackett,et al. Evidence based medicine: what it is and what it isn't , 1996, BMJ.