Validation of text-mining and content analysis techniques using data collected from veterinary practice management software systems in the UK.

Electronic patient records from practice management software systems have been used extensively in medicine for the investigation of clinical problems leading to the creation of decision support frameworks. To date, technologies that have been utilised for this purpose such as text mining and content analysis have not been employed significantly in veterinary medicine. The aim of this research was to pilot the use of content analysis and text-mining software for the synthesis and analysis of information extracted from veterinary electronic patient records. The purpose of the work was to be able to validate this approach for future employment across a number of practices for the purposes of practice based research. The approach utilised content analysis (Prosuite) and text mining (WordStat) software to aggregate the extracted text. Text mining tools such as Keyword in Context (KWIC) and Keyword Retrieval (KR) were employed to identify specific occurrences of data across the records. Two different datasets were interrogated, a bespoke test dataset that had been set up specifically for the purpose of the research, and a functioning veterinary clinic dataset that had been extracted from one veterinary practice. Across both datasets, the KWIC analysis was found to have a high level of accuracy with the search resulting in a sensitivity of between 85.3-100%, a specificity of between 99.1-99.7%, a positive predictive value between 93.5-95.8% and a negative predictive value between 97.7-100%. The KR search, based on machine learning, was utilised for the clinic-based dataset and was found to perform slightly better than the KWIC analysis. This study is the first to demonstrate the application of content analysis and text mining software for validation purposes across a number of different datasets for the purpose of search and recall of specific information across electronic patient records. This has not been demonstrated previously for small animal veterinary epidemiological research for the purposes of large scale analysis for practice-based research. Extension of this work to investigate more complex diseases across larger populations is required to fully explore the use of this approach in veterinary practice.

[1]  Julie A Jacko,et al.  The informatics imperative in veterinary medicine: collaboration across disciplines. , 2011, Journal of veterinary medical education.

[2]  R. Fletcher,et al.  Clinical Epidemiology: The Essentials , 1982 .

[3]  Alexandra G. Smith,et al.  Emergency admission and survival from aggressive non-Hodgkin lymphoma: A report from the UK's population-based Haematological Malignancy Research Network , 2017, European journal of cancer.

[4]  P. McGreevy,et al.  Longevity and mortality of cats attending primary care veterinary practices in England , 2015, Journal of feline medicine and surgery.

[5]  Hsinchun Chen,et al.  Knowledge Management, Data Mining, and Text Mining in Medical Informatics , 2005 .

[6]  R M Anholt,et al.  The application of medical informatics to the veterinary management programs at companion animal practices in Alberta, Canada: a case study. , 2014, Preventive veterinary medicine.

[7]  P. J. Armstrong,et al.  Health status and population characteristics of dogs and cats examined at private veterinary practices in the United States , 2022 .

[8]  E. B. Wilson Probable Inference, the Law of Succession, and Statistical Inference , 1927 .

[9]  John Berezowski,et al.  Mining free-text medical records for companion animal enteric syndrome surveillance. , 2014, Preventive veterinary medicine.

[10]  J. Hall,et al.  Urinary incontinence in bitches under primary veterinary care in England: prevalence and risk factors , 2017, The Journal of small animal practice.

[11]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[12]  J. Mccambridge,et al.  Alcohol consumption and all-cause mortality: an analysis of general practice database records for patients with long-term conditions , 2017, Journal of Epidemiology & Community Health.

[13]  P. McGreevy,et al.  Prevalence of disorders recorded in cats attending primary-care veterinary practices in England. , 2014, Veterinary journal.

[14]  Perry L. Miller,et al.  Research Paper: Exploring the Degree of Concordance of Coded and Textual Data in Answering Clinical Queries from a Clinical Data Repository , 2000, J. Am. Medical Informatics Assoc..

[15]  Kurt L Zimmerman,et al.  Uses of informatics to solve real world problems in veterinary medicine. , 2011, Journal of veterinary medical education.

[16]  Paul J. Watson,et al.  Statistics for Veterinary and Animal Science , 1999 .

[17]  Linda A. Watson,et al.  Information Retrieval: A Health and Biomedical Perspective. , 2005 .

[18]  E. DeLong,et al.  Discordance of Databases Designed for Claims Payment versus Clinical Information Systems: Implications for Outcomes Research , 1993, Annals of Internal Medicine.

[19]  K. Faunt,et al.  The power of practice: harnessing patient outcomes for clinical decision making. , 2007, The Veterinary clinics of North America. Small animal practice.

[20]  A. Majeed,et al.  Identifying undiagnosed diabetes: cross-sectional survey of 3.6 million patients' electronic records. , 2008, The British journal of general practice : the journal of the Royal College of General Practitioners.

[21]  T. Parkin,et al.  Use of free text clinical records in identifying syndromes and analysing health data , 2007, Veterinary Record.

[22]  S. Dawson,et al.  Antibacterial prescribing patterns in small animal veterinary practice identified via SAVSNET: the small animal veterinary surveillance network , 2011, Veterinary Record.

[23]  Edward H. Shortliffe,et al.  The Computer Meets Medicine and Biology: Emergence of a Discipline , 2001 .

[24]  David Moher,et al.  The REporting of Studies Conducted Using Observational Routinely-Collected Health Data (RECORD) Statement: Methods for Arriving at Consensus and Developing Reporting Guidelines , 2015, PloS one.

[25]  Agreement between veterinary patient data collected from different sources. , 2015, Veterinary journal.

[26]  L. Glickman,et al.  Adverse events after vaccine administration in cats: 2,560 cases (2002-2005). , 2007, Journal of the American Veterinary Medical Association.

[27]  S. Dawson,et al.  Surveillance of diarrhoea in small animal practice through the Small Animal Veterinary Surveillance Network (SAVSNET). , 2014, Veterinary journal.

[28]  J. Jones-Diette,et al.  Accuracy of the electronic patient record in a first opinion veterinary practice. , 2017, Preventive veterinary medicine.

[29]  M. Ward,et al.  Adverse events diagnosed within three days of vaccine administration in dogs. , 2005, Journal of the American Veterinary Medical Association.

[30]  Alan Radford,et al.  A Semi-Automated Approach to Building Text Summarisation Classifiers , 2012, MLDM.

[31]  Marco Duz,et al.  Validation of an Improved Computer-Assisted Technique for Mining Free-Text Electronic Medical Records , 2017, JMIR medical informatics.

[32]  J. Hall,et al.  Urinary incontinence in male dogs under primary veterinary care in England: prevalence and risk factors , 2018, The Journal of small animal practice.

[33]  W. DuMouchel,et al.  Unlocking Clinical Data from Narrative Reports: A Study of Natural Language Processing , 1995, Annals of Internal Medicine.

[34]  Ross M. Mullner,et al.  Clinical Epidemiology: The Essentials, 4th Edition , 2006 .

[35]  Hannah Doit,et al.  A method for extracting electronic patient record data from practice management software systems used in veterinary practice , 2016, BMC Veterinary Research.

[36]  G. Lim,et al.  Measures of Diagnostic Accuracy: Sensitivity, Specificity, PPV and NPV , 2011 .

[37]  I. Buchan,et al.  Developing a network for small animal disease surveillance , 2010, Veterinary Record.