Technical Brief: Automatic Classification of Foot Examination Findings Using Clinical Notes and Machine Learning

We examine the feasibility of a machine learning approach to identification of foot examination (FE) findings from the unstructured text of clinical reports. A Support Vector Machine (SVM) based system was constructed to process the text of physical examination sections of in- and out-patient clinical notes to identify if the findings of structural, neurological, and vascular components of a FE revealed normal or abnormal findings or were not assessed. The system was tested on 145 randomly selected patients for each FE component using 10-fold cross validation. The accuracy was 80%, 87% and 88% for structural, neurological, and vascular component classifiers, respectively. Our results indicate that using machine learning to identify FE findings from clinical reports is a viable alternative to manual review and warrants further investigation. This application may improve quality and safety by providing inexpensive and scalable methodology for quality and risk factor assessments at the point of care.

[1]  Peter J. Haug,et al.  Automatic identification of pneumonia related concepts on chest x-ray reports , 1999, AMIA.

[2]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[3]  Aaron M. Cohen,et al.  An Effective General Purpose Approach for Automated Biomedical Document Classification , 2006, AMIA.

[4]  George Hripcsak,et al.  Automated detection of adverse events using natural language processing of discharge summaries. , 2005, Journal of the American Medical Informatics Association : JAMIA.

[5]  Stephen D. Persell,et al.  Assessing the validity of national quality measures for coronary artery disease using an electronic health record. , 2006, Archives of internal medicine.

[6]  Ronald Horswell,et al.  The impact of a staged management approach to diabetes foot care in the Louisiana public hospital system. , 2003, The Journal of the Louisiana State Medical Society : official organ of the Louisiana State Medical Society.

[7]  Serguei V. S. Pakhomov,et al.  Epidemiology of angina pectoris: role of natural language processing of the medical record. , 2007, American heart journal.

[8]  Naomi Sager,et al.  Research Paper: Natural Language Processing and the Representation of Clinical Data , 1994, J. Am. Medical Informatics Assoc..

[9]  F Harris,et al.  Performance indicators. , 1985, Health and social service journal.

[10]  Paul C Tang,et al.  Research Paper: Comparison of Methodologies for Calculating Quality Measures Based on Administrative Data versus Clinical Data from an Electronic Health Record System: Implications for Performance Measures , 2007, J. Am. Medical Informatics Assoc..

[11]  Serguei V. S. Pakhomov,et al.  Electronic medical records for clinical research: application to the identification of heart failure. , 2007, The American journal of managed care.

[12]  Hazel Roddam,et al.  Assessing adherence to the evidence base in the management of poststroke dysphagia , 2006, Clinical rehabilitation.

[13]  Eric W. Ford,et al.  Predicting the adoption of electronic health records by physicians: when will health care be paperless? , 2006, Journal of the American Medical Informatics Association : JAMIA.

[14]  Tapio Salakoski,et al.  Towards Automated Classification of Intensive Care Nursing Narratives , 2006, MIE.

[16]  P. Gompertz,et al.  Reliability and validity of the Intercollegiate Stroke Audit Package. , 2001, Journal of evaluation in clinical practice.

[17]  Theodore J Thompson,et al.  The Association between Quality of Care and the Intensity of Diabetes Disease Management Programs , 2006, Annals of Internal Medicine.

[18]  Carol Friedman,et al.  A broad-coverage natural language processing system , 2000, AMIA.

[19]  Yindalon Aphinyanagphongs,et al.  Text Categorization Models for Retrieval of High Quality Articles in Internal Medicine , 2003, AMIA.

[20]  M. Mangan,et al.  Project LEAP of New Jersey: lower extremity amputation prevention in persons with type 2 diabetes. , 1999, The American journal of managed care.

[21]  Gary M Marsh,et al.  Methodology to improve data quality from chart review in the managed care setting. , 2002, The American journal of managed care.

[22]  Peter J. Haug,et al.  Research Paper: Automatic Detection of Acute Bacterial Pneumonia from Chest X-ray Reports , 2000, J. Am. Medical Informatics Assoc..

[23]  Alastair Baker,et al.  Crossing the Quality Chasm: A New Health System for the 21st Century , 2001, BMJ : British Medical Journal.

[24]  G G Youngson,et al.  Use of clinical audit for revalidation: is it sufficiently accurate? , 2001, Journal of quality in clinical practice.

[25]  Michael M Engelgau,et al.  Trends in care practices and outcomes among Medicare beneficiaries with diabetes. , 2005, American journal of preventive medicine.

[26]  Ted Pedersen,et al.  A Comparative Study of Supervised Learning as Applied to Acronym Expansion in Clinical Reports , 2006, AMIA.

[27]  Serguei V. S. Pakhomov,et al.  Quality Performance Measurement Using the Text of Electronic Medical Records , 2008, Medical decision making : an international journal of the Society for Medical Decision Making.

[28]  Dean F. Sittig,et al.  Natural language processing in the electronic medical record: assessing clinician adherence to tobacco treatment guidelines. , 2005, American journal of preventive medicine.

[29]  Hongfang Liu,et al.  Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues , 2006, BMC Bioinformatics.

[30]  J. Leahy,et al.  Improvements in Diabetes Processes of Care and Intermediate Outcomes: United States, 1988–2002Saaddine JB, Cadwell B, Gregg EW, et al (Centers for Disease Control and Prevention, Atlanta, Ga) Ann Intern Med 144:465–474, 2006§ , 2007 .