Record linkage

This article describes methods for matching duplicates within or across files using non‐unique identifiers such as first name, last name, date of birth, address, and other characteristics. Copyright © 2010 John Wiley & Sons, Inc.

[1]  W. Winkler USING THE EM ALGORITHM FOR WEIGHT COMPUTATION IN THE FELLEGI-SUNTER MODEL OF RECORD LINKAGE , 2000 .

[2]  P. Ivax,et al.  A THEORY FOR RECORD LINKAGE , 2004 .

[3]  W. Winkler Overview of Record Linkage and Current Research Directions , 2006 .

[4]  W. Deming,et al.  On the Problem of Matching Lists by Samples , 1959 .

[5]  William S. Cooper,et al.  Foundations of Probabilistic and Utility-Theoretic Indexing , 1978, JACM.

[6]  Matthew A. Jaro,et al.  Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, Florida , 1989 .

[7]  W. Winkler IMPROVED DECISION RULES IN THE FELLEGI-SUNTER MODEL OF RECORD LINKAGE , 1993 .

[8]  Antonio Zamora,et al.  Automatic spelling correction in scientific and scholarly text , 1984, CACM.

[9]  Pradeep Ravikumar,et al.  A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.

[10]  Howard B. Newcombe,et al.  Record linkage: making maximum use of the discriminating power of identifying information , 1962, CACM.

[11]  William E. Winkler,et al.  String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage. , 1990 .

[12]  H. Newcombe,et al.  Methods for Computer Linkage of Hospital Admission-Separation Records into Cumulative Health Histories , 1975, Methods of Information in Medicine.

[13]  Pradeep Ravikumar,et al.  A Hierarchical Graphical Model for Record Linkage , 2004, UAI.

[14]  L. Getoor,et al.  A Latent Dirichlet Allocation Model for Entity Resolution , 2005 .

[15]  William E. Yancey Evaluating String Comparator Performance for Record Linkage , 2005 .

[16]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[17]  H B NEWCOMBE,et al.  Automatic linkage of vital records. , 1959, Science.

[18]  Gonzalo Navarro,et al.  A guided tour to approximate string matching , 2001, CSUR.