Extraction and Exploration of Correlations in Patient Status Data

The paper discusses an Information Extraction approach, which is applied for the automatic processing of hospital Patient Records (PRs) in Bulgarian language. The main task reported here is retrieval of status descriptions related to anatomical organs. Due to the specific telegraphic PR style, the approach is focused on shallow analysis. Missing text descriptions and default values are another obstacle. To overcome it, we propose an algorithm for exploring the correlations between patient status data and the corresponding diagnosis. Rules for interdependencies of the patient status data are generated by clustering according to chosen metrics. In this way it becomes possible to fill in status templates for each patient when explicit descriptions are unavailable in the text. The article summarises evaluation results which concern the performance of the current IE prototype.

[1]  Douglas G. Altman,et al.  Practical statistics for medical research , 1990 .

[2]  Ralph Grishman,et al.  Scenario customization for information extraction , 2000 .

[3]  Carol Friedman,et al.  Towards a comprehensive medical language processing system: methods and issues , 1997, AMIA.

[4]  G Demetriou,et al.  Two applications of information extraction to biological science journal articles: enzyme interactions and protein structures. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[5]  KaiserKatharina,et al.  How can information extraction ease formalizing treatment processes in clinical practice guidelines , 2007 .

[6]  Christopher G. Chute,et al.  Technical Brief: Mayo Clinic NLP System for Patient Smoking Status Identification , 2008, J. Am. Medical Informatics Assoc..

[7]  Fabio Rinaldi,et al.  Relation Mining over a Corpus of Scientific Literature , 2005, AIME.

[8]  Svetla Boytcheva,et al.  Some Aspects of Negation Processing in Electronic Health Records , 2005 .

[9]  S. A. Jalaee,et al.  Abstract , 1999, Veterinary Record.

[10]  Ralph Grishman,et al.  Information Extraction: Techniques and Challenges , 1997, SCIE.

[11]  Jean Charlet,et al.  Building Medical Ontologies Based on Terminology Extraction from Texts: Methodological Propositions , 2005, AIME.

[12]  Peter Willett,et al.  Protein Structures and Information Extraction from Biological Texts: The PASTA System , 2003, Bioinform..

[13]  Florence Reeder,et al.  MiTAP for Bio-Security: A Case Study , 2002 .