Making the most of injury surveillance data: using narrative text to identify exposure information in case-control studies.

INTRODUCTION Free-text fields in injury surveillance databases can provide detailed information beyond routinely coded data. Additional data, such as exposures and covariates can be identified from narrative text and used to conduct case-control studies. METHODS To illustrate this, we developed a text-search algorithm to identify helmet status (worn, not worn, use unknown) in the U.S. National Electronic Injury Surveillance System (NEISS) narratives for bicycling and other sports injuries from 2005 to 2011. We calculated adjusted odds ratios (ORs) for head injury associated with helmet use, with non-head injuries representing controls. For bicycling, we validated ORs against published estimates. ORs were calculated for other sports and we examined factors associated with helmet reporting. RESULTS Of 105,614 bicycling injury narratives reviewed, 14.1% contained sufficient helmet information for use in the case-control study. The adjusted ORs for head injuries associated with helmet-wearing were smaller than, but directionally consistent, with previously published estimates (e.g., 1999 Cochrane Review). ORs illustrated a protective effect of helmets for other sports as well (less than 1). CONCLUSIONS This exploratory analysis illustrates the potential utility of relatively simple text-search algorithms to identify additional variables in surveillance data. Limitations of this study include possible selection bias and the inability to identify individuals with multiple injuries. A similar approach can be applied to study other injuries, conditions, risks, or protective factors. This approach may serve as an efficient method to extend the utility of injury surveillance data to conduct epidemiological research.

[1]  Frederick P. Rivara,et al.  Effectiveness of Bicycle Safety Helmets in Preventing Head Injuries , 1996 .

[2]  Steven H. Brown,et al.  Automated identification of postoperative complications within an electronic medical record using natural language processing. , 2011, JAMA.

[3]  Kirsten McKenzie,et al.  The use of narrative text for injury surveillance research: a systematic review. , 2010, Accident; analysis and prevention.

[4]  John Shawe-Taylor,et al.  Extracting Diagnoses and Investigation Results from Unstructured Text in Electronic Health Records by Semi-Supervised Machine Learning , 2012, PloS one.

[5]  P. Zimmern,et al.  Data for free--can an electronic medical record provide outcome data for incontinence/prolapse repair procedures? , 2012, The Journal of urology.

[6]  D. Thompson,et al.  Effectiveness of bicycle safety helmets in preventing head injuries. A case-control study. , 1996, JAMA.

[7]  K. Enskär,et al.  Psychosocial health information in free text notes of Swedish children's health records. , 2013, Scandinavian journal of caring sciences.

[8]  B. Hagel,et al.  The effect of helmets on the risk of head and neck injuries among skiers and snowboarders: a meta-analysis , 2010, Canadian Medical Association Journal.

[9]  Richard L Berg,et al.  Use of an Electronic Medical Record for the Identification of Research Subjects with Diabetes Mellitus , 2007, Clinical Medicine & Research.

[10]  Frederick P Rivara,et al.  Epidemiology of bicycle injuries and risk factors for serious injury , 1997, Injury Prevention.

[11]  D. Thompson,et al.  A case-control study of the effectiveness of bicycle safety helmets. , 1989, The New England journal of medicine.

[12]  M. Kenward,et al.  Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls , 2009, BMJ : British Medical Journal.

[13]  W. Hersh Adding value to the electronic health record through secondary use of data for quality assurance, research, and surveillance. , 2007, The American journal of managed care.

[14]  D. Thompson,et al.  Cochrane Review : Helmets for preventing head and facial injuries in bicyclists , 2017 .

[15]  R G Attewell,et al.  Bicycle helmet efficacy: a meta-analysis. , 2001, Accident; analysis and prevention.

[16]  Jennifer L. Kelsey,et al.  Methods in Observational Epidemiology , 1986 .