This paper describes the linkage of data from three Dutch Perinatal Registries: the Dutch National Midwife Registry, the Dutch National Obstetrics Registry and the Dutch National Pediatrics Registry, for the year of 2001. All these registries are anonymous and lack a common identifier. We used probabilistic and deterministic record linkage techniques to combine data from the mother, delivery and child involving to the same pregnancy. Records of singleton and twin pregnancies were linked separately. We have developed a probabilistic close method based on maximum likelihood methods to estimate the weights of individual linking variables and the threshold value for the overall weight. Probabilistic linkage identified 80% more links than a full deterministic linkage approach. External validation revealed an error rate of less than 1%. Our method is a flexible and powerful method to link anonymous registries in the absence of a gold standard.
[1]
Howard B. Newcombe,et al.
Handbook of record linkage: methods for health and statistical studies, administration, and business
,
1988
.
[2]
Matthew A. Jaro,et al.
Probabilistic linkage of large public health data files.
,
1995,
Statistics in medicine.
[3]
Matthew A. Jaro,et al.
Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, Florida
,
1989
.
[4]
P. Ivax,et al.
A THEORY FOR RECORD LINKAGE
,
2004
.
[5]
J. Marc Overhage,et al.
Analysis of a Probabilistic Record Linkage Technique without Human Review
,
2003,
AMIA.
[6]
D. Rubin,et al.
A method for calibrating false-match rates in record linkage
,
1995
.
[7]
References
,
1971
.