Tolerating spelling errors during patient validation.

Misspellings, typographical errors, and variant name forms present a considerable problem for a Clinical Information System when validating patient data. Algorithms to correct these types of errors are being used, but they are based either on a study of frequent types of errors associated with general words in an English text rather than types of errors associated with the spelling of names, or on errors that are phonologically based. This paper investigates the types of errors that are specifically associated with the spelling of patient names, and proposes an algorithm that effectively handles such types of errors. This paper also studies the effectiveness of several relaxation techniques and compares them with the one that is being proposed.

[1]  M E Fair,et al.  Application of exact ODDS for partial agreements of names in record linkage. , 1991, Computers and biomedical research, an international journal.

[2]  Fred J. Damerau,et al.  A technique for computer detection and correction of spelling errors , 1964, CACM.

[3]  R Goehring Identification of patients in medical data bases--soundex codes versus match code. , 1985, Medical informatics = Medecine et informatique.

[4]  Cyril N. Alberga,et al.  String similarity and misspellings , 1967, CACM.

[5]  Donald E. Knuth,et al.  Fast Pattern Matching in Strings , 1977, SIAM J. Comput..

[6]  Aviezri S. Fraenkel,et al.  A hash code method for detecting and correcting spelling errors , 1982, CACM.

[7]  Michael Allen Bickel,et al.  Automatic correction to misspelled names: a fourth-generation language approach , 1987, CACM.

[8]  Rudolf Thurmayr,et al.  The Use of Phonetic Code for Patient Identification , 1982 .

[9]  P R Krause,et al.  A review of algorithms for molecular sequence comparison. , 1991, Computers and biomedical research, an international journal.

[10]  James L. Peterson,et al.  Computer programs for detecting and correcting spelling errors , 1980, CACM.

[11]  Robert S. Boyer,et al.  A fast string searching algorithm , 1977, CACM.

[12]  Jr. Jm Gersting,et al.  Two Search Techniques within a Human Pedigree Database. , 1982 .

[13]  Leon Davidson,et al.  Retrieval of misspelled names in an airlines passenger record system , 1962, CACM.

[14]  G. de V. Smit,et al.  A Comparison of Three String Matching Algorithms , 1982, Softw. Pract. Exp..

[15]  D. M. Joseph,et al.  Correction of misspellings and typographical errors in a free-text medical English information storage and retrieval system. , 1979, Methods of information in medicine.

[16]  James W. Fickett,et al.  The GenBank genetic sequence databank , 1986, Nucleic Acids Res..

[17]  R. E. Nather On the compilation of subscripted variables , 1961, CACM.

[18]  Peter Weiner,et al.  Linear Pattern Matching Algorithms , 1973, SWAT.