Constructing a Syndromic Terminology Resource for Veterinary Text Mining

Public health surveillance systems rely on the automated monitoring of large amounts of text. While building a text mining system for veterinary syndromic surveillance, we exploit automatic and semi-automatic methods for terminology construction at different stages. Our approaches include term extraction from free-text, grouping of term variants based on string similarity, and linking to an existing medical ontology.

[1]  Son Doan,et al.  BioCaster: detecting public health rumors with a Web-based text mining system , 2008, Bioinform..

[2]  John Berezowski,et al.  Mining free-text medical records for companion animal enteric syndrome surveillance. , 2014, Preventive veterinary medicine.

[3]  Wendy W. Chapman,et al.  Fever detection from free-text clinical records for biosurveillance , 2004, Journal of Biomedical Informatics.

[4]  Wendy W. Chapman,et al.  Document-level classification of CT pulmonary angiography reports based on an extension of the ConText algorithm , 2011, J. Biomed. Informatics.

[5]  E. Morignat,et al.  Unsupervised clustering of wildlife necropsy data for syndromic surveillance , 2010, BMC veterinary research.

[6]  Ben Y. Reis,et al.  Surveillance Sans Frontières: Internet-Based Emerging Infectious Disease Intelligence and the HealthMap Project , 2008, PLoS medicine.

[7]  Marianthi Markatou,et al.  Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection , 2011, J. Am. Medical Informatics Assoc..

[8]  Nigel Collier,et al.  A multilingual ontology for infectious disease surveillance: rationale, design and challenges , 2007, Lang. Resour. Evaluation.

[9]  Ralf Steinberger,et al.  Text Mining from the Web for Medical Intelligence , 2007, NATO ASI Mining Massive Data Sets for Security.

[10]  Naoaki Okazaki,et al.  Simple and Efficient Algorithm for Approximate Dictionary Matching , 2010, COLING.

[11]  Crawford W. Revie,et al.  Exploratory Analysis of Methods for Automated Classification of Laboratory Test Orders into Syndromic Groups in Veterinary Medicine , 2013, PloS one.

[12]  J. Marc Overhage,et al.  Using Natural Language Processing to Improve Accuracy of Automated Notifiable Disease Reporting , 2008, AMIA.

[13]  Fabio Rinaldi,et al.  Terminology as knowledge in answer extraction , 2002 .

[14]  A Mawudeku,et al.  Landscape of international event-based biosurveillance , 2010, Emerging health threats journal.

[15]  Toon Calders,et al.  A Hybrid Disambiguation Measure for Inaccurate Cultural Heritage Data , 2014, LaTeCH@EACL.

[16]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[17]  W. Chapman,et al.  Syndrome and outbreak detection using chief-complaint data--experience of the Real-Time Outbreak and Disease Surveillance project. , 2004, MMWR supplements.

[18]  Maria Kvist,et al.  Negation Scope Delimitation in Clinical Text Using Three Approaches: NegEx, PyConTextNLP and SynNeg , 2013, NODALIDA.

[19]  Kurt L Zimmerman,et al.  Uses of informatics to solve real world problems in veterinary medicine. , 2011, Journal of veterinary medical education.

[20]  Ron Artstein,et al.  Survey Article: Inter-Coder Agreement for Computational Linguistics , 2008, CL.

[21]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[22]  Gonzalo Navarro,et al.  A guided tour to approximate string matching , 2001, CSUR.

[23]  Elmer V. Bernstam,et al.  Toward a veterinary informatics research agenda: An analysis of the PubMed-indexed literature , 2007, Int. J. Medical Informatics.

[24]  Peter Christen,et al.  A Comparison of Personal Name Matching: Techniques and Practical Issues , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).