Ir-Man: An Information Retrieval Framework for Marine Animal Necropsy Analysis

This paper proposes Ir-Man (Information Retrieval for Marine Animal Necropsies), a framework for retrieving discrete information from marine mammal post-mortem reports for statistical analysis. When a marine mammal is reported dead after stranding in Scotland, the carcass is examined by the Scottish Marine Animal Strandings Scheme (SMASS) to establish the circumstances of the animal's death. This involves the creation of a "post-mortem" (or necropsy) report, which systematically describes the body. These semi-structured reports record lesions (damage or abnormalities to anatomical regions) as well as other observations. Observations embedded within these texts are used to determine cause of death. While a cause of death is recorded separately, many other descriptions may be of pathological and epidemiological significance when aggregated and analysed collectively. As manual extraction of these descriptions is costly, time consuming and at times erroneous, there is a need for an automated information retrieval mechanism which is a non-trivial task given the wide variety of possible descriptions, pathologies and species. The Ir-Man framework consists of a new ontology, a lexicon of observations and anatomical terms and an entity relation engine for information retrieval and statistics generation from a pool of necropsy reports. We demonstrate the effectiveness of our framework by creating a rule-based binary classifier for identifying bottlenose dolphin attacks (BDA) in harbour porpoise gross pathology reports and achieved an accuracy of 83.4%.

[1]  P. Jepson,et al.  Levels of Polychlorinated Biphenyls Are Still Associated with Toxic Effects in Harbor Porpoises (Phocoena phocoena) Despite Having Fallen below Proposed Toxicity Thresholds. , 2020, Environmental science & technology.

[2]  Lejun Gong,et al.  A dictionary-based approach to identify biomedical concepts , 2015, 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[3]  B. Godley,et al.  Microplastics in marine mammals stranded around the British coast: ubiquitous but transitory? , 2019, Scientific Reports.

[4]  Hongyuan Gao,et al.  Using natural language processing to extract mammographic findings , 2015, J. Biomed. Informatics.

[5]  M. Craven,et al.  Machine learning for syndromic surveillance using veterinary necropsy reports , 2020, PloS one.

[6]  S. Bhanumathi,et al.  Identifying symptoms and treatment for heart disease from biomedical literature using text data mining , 2017, 2017 International Conference on Computation of Power, Energy Information and Commuincation (ICCPEIC).

[7]  P. Jepson,et al.  Juvenile harbor porpoises in the UK are exposed to a more neurotoxic mixture of polychlorinated biphenyls than adults. , 2019, The Science of the total environment.

[8]  S. Lewis,et al.  Uberon, an integrative multi-species anatomy ontology , 2012, Genome Biology.

[9]  Hans-Ulrich Prokosch,et al.  Ontology-Based Data Integration between Clinical and Research Systems , 2015, PloS one.

[10]  Fabio Rinaldi,et al.  Constructing a Syndromic Terminology Resource for Veterinary Text Mining , 2015, TIA.

[11]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[12]  Samson W. Tu,et al.  Protégé-2000: An Open-Source Ontology-Development and Knowledge-Acquisition Environment: AMIA 2003 Open Source Expo , 2003, AMIA.

[13]  Haibo Yu,et al.  An ontology-based approach for text mining of stroke electronic medical records , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[14]  Ming Liu,et al.  CausalTriad: Toward Pseudo Causal Relation Discovery and Hypotheses Generation from Medical Text Data , 2018, BCB.

[15]  D. K. Lobiyal,et al.  Concepts extraction for medical documents using ontology , 2015, 2015 International Conference on Advances in Computer Engineering and Applications.

[16]  Salvatore Vitabile,et al.  An ontology-based retrieval system for mammographic reports , 2015, 2015 IEEE Symposium on Computers and Communication (ISCC).

[17]  Geng Yang,et al.  Extraction of biomedical informtion related to breast cancer using text mining , 2016, 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[18]  Fabio Rinaldi,et al.  The value of necropsy reports for animal health surveillance , 2018, BMC Veterinary Research.

[19]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[20]  Yonghwa Choi,et al.  A Neural Named Entity Recognition and Multi-Type Normalization Tool for Biomedical Text Mining , 2019, IEEE Access.

[21]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[22]  Joyce C. Ho,et al.  NamedKeys: Unsupervised Keyphrase Extraction for Biomedical Documents , 2019, BCB.

[23]  Josette F. Jones,et al.  Knowledge Discovery and Data Mining of Free Text Radiology Reports , 2011, 2011 IEEE First International Conference on Healthcare Informatics, Imaging and Systems Biology.

[24]  Wen Qu,et al.  Named Entity Recognition From Biomedical Texts Using a Fusion Attention-Based BiLSTM-CRF , 2019, IEEE Access.