论文信息 - Fast parallel algorithms for approximate string matching

Fast parallel algorithms for approximate string matching

Given a text string, a much shorter pattern string, and an integer k , parallel algorithms for finding all occurrences of the pattern string in the text string with at most A; differences (as defined by edit distance) are discussed. First, a real-time parallel algorithm, which could be implemented on a systolic array using m (the length of the pattern string) very simple processing elements, is proposed. After the algorithm gets started, it outputs the minimum edit distance from the pattern string to a substring of the text string at each time step. Thus, the algorithm is well-suited for real-time searching of text databases or biological nucleic acid sequence databases. Second, several different ways for solving the same problem with different CRCW-PRAM assumptions (priority model, combination model, and common — value model) are developed. This class of algorithms uses 0 ( m x n) or 0 ( m x m x n) processors and achieve a time complexity of 0(k) . Key words, approximate string matching, edit distance, systolic computation, CRCW-PRAM models.

Yi Jiang | Yi Jiang

[1] Peter H. Sellers,et al. An Algorithm for the Distance Between Two Finite Sequences , 1974, J. Comb. Theory, Ser. A.

[2] David Sankoff,et al. Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[3] Zvi Galil,et al. An Improved Algorithm for Approximate String Matching , 1989, SIAM J. Comput..

[4] Esko Ukkonen,et al. Algorithms for Approximate String Matching , 1985, Inf. Control..

[5] Udi Manber,et al. Introduction to algorithms - a creative approach , 1989 .

[6] Gad M. Landau,et al. Efficient string matching in the presence of errors , 1985, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[7] Gad M. Landau,et al. Fast Parallel and Serial Approximate String Matching , 1989, J. Algorithms.

[8] Robert Langridge,et al. Mapping and interpreting biological information , 1991, CACM.

[9] Gad M. Landau,et al. Parallel Construction of a Suffix Tree (Extended Abstract) , 1987, ICALP.