Validation of probabilistic linkage to match de-identified ambulance records to a state trauma registry.

Objectives: To validate the accuracy of using probabilistic linkage for matching de-identified ambulance records to a state trauma registry. Methods: This was a retrospective cohort analysis. Three thousand nine hundred nineteen true matches between ambulance and state trauma registry data from 1998 to 2003 were identified by deterministic matching on trauma identification number and verified by human review. Two thousand thirty-eight ambulance records from trauma patients not meeting criteria for a true match, and an identical number of trauma registry records randomly selected from the one local county served by a different EMS provider, were included as nonmatches. There were 17 variables considered for linkage, which included the following: age, gender, race, county, hospital, date, rural setting, call and arrival times, mechanism, penetrating injury, vital signs, intubation, and intoxication. Probabilistic linkage was used to link the two data sets, using seven different combinations of common variables (maximum, 17; minimum, 4). The sensitivity and specificity of identifying true matches and nonmatches (95% confidence intervals [95% CI]) were calculated for each combination of variables. Results: Using all 17 available variables, 3,766 of 3,919 true matches were appropriately linked (sensitivity, 96.1%; 95% CI = 95.4% to 96.7%), with eight mismatches (specificity, 99.6%; 95% CI = 99.2% to 99.8%). Sensitivity fell below 95% with 98% regardless of the number of variables included. Conclusions: Probabilistic linkage is a valid method for matching ambulance records to a trauma registry without the use of patient identifiers; however, the sensitivity of identifying true matches is critically dependent on the number and type of common variables included in the analysis.

[1]  D. Clark,et al.  Hospital trauma registries linked with population-based data. , 1999, The Journal of trauma.

[2]  J M Dean,et al.  Probabilistic linkage of computerized ambulance and inpatient hospital discharge records: a potential tool for evaluation of emergency medical services. , 2001, Annals of emergency medicine.

[3]  D. Clark,et al.  Comparison of probabilistic and deterministic record linkage in the development of a statewide trauma registry. , 1995, Proceedings. Symposium on Computer Applications in Medical Care.

[4]  Matthew A. Jaro,et al.  Probabilistic linkage of large public health data files. , 1995, Statistics in medicine.

[5]  D. Clark,et al.  Practical introduction to record linkage for injury research , 2004, Injury Prevention.

[6]  T. Blakely,et al.  Probabilistic record linkage and a method to calculate the positive predictive value. , 2002, International journal of epidemiology.

[7]  J. Langley,et al.  Determining First Admissions in a Hospital Discharge File via Record Linkage , 1998, Methods of Information in Medicine.

[8]  Gregory Luke Larkin,et al.  From Hippocrates to HIPAA: Privacy and confidentiality in Emergency Medicine—Part I: Conceptual, moral, and legal foundations , 2004, Annals of Emergency Medicine.

[9]  A C Allen,et al.  An assessment of the validity of a computer system for probabilistic record linkage of birth and infant death records in Canada. The Fetal and Infant Health Study Group. , 2000, Chronic diseases in Canada.

[10]  J. Marc Overhage,et al.  Analysis of a Probabilistic Record Linkage Technique without Human Review , 2003, AMIA.

[11]  D. Clark,et al.  Evaluating an inclusive trauma system using linked population-based data. , 2004, The Journal of trauma.

[12]  S A Waien,et al.  Linking large administrative databases: a method for conducting emergency medical services cohort studies using existing data. , 1997, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[13]  L J Cook,et al.  Probabilistic Record Linkage: Relationships between File Sizes, Identifiers, and Match Weights , 2001, Methods of Information in Medicine.