Bit-Parallel Algorithms for the Merged Longest Common Subsequence Problem

It is often a necessity to compare some sequences to find out how similar they are. There are many similarity measures that can be used, e.g., longest common subsequence, edit distance, sequence alignment. Recently a merged longest common subsequence (MergedLCS) problem was formulated with applications in bioinformatics. We propose the bit-parallel algorithms for the MergedLCS problem and evaluate them in practice showing that they are usually tens times faster than the already published methods.

[1]  Sebastian Deorowicz,et al.  Bit-Parallel Algorithm for the Constrained Longest Common Subsequence Problem , 2010, Fundam. Informaticae.

[2]  Gaston H. Gonnet,et al.  A new approach to text searching , 1992, CACM.

[3]  Heikki Hyyro Bit-Parallel LCS-length Computation Revisited , 2004 .

[4]  S. D,et al.  Fast algorithm for the constrained longest common subsequence problem , 2022 .

[5]  Trevor I. Dix,et al.  A Bit-String Longest-Common-Subsequence Algorithm , 1986, Inf. Process. Lett..

[6]  B. Birren,et al.  Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae , 2004, Nature.

[7]  Rajeev Raman,et al.  String-Matching techniques for musical similarity and melodic recognition , 1998 .

[8]  Kun-Mao Chao,et al.  A fast algorithm for computing a longest common increasing subsequence , 2005, Inf. Process. Lett..

[9]  Jorma Tarhio,et al.  Searching monophonic patterns within polyphonic sources , 2000 .

[10]  Sebastian Deorowicz Speeding up transposition-invariant string matching , 2006, Inf. Process. Lett..

[11]  Henry S. Warren,et al.  Hacker's Delight , 2002 .

[12]  Esko Ukkonen,et al.  Including Interval Encoding into Edit Distance Based Music Comparison and Retrieval , 2003 .

[13]  Sebastian Deorowicz,et al.  Fast algorithm for the constrained longest common subsequence problem , 2007 .

[14]  Hsing-Yen Ann,et al.  Dynamic programming algorithms for the mosaic longest common subsequence problem , 2007, Inf. Process. Lett..

[15]  Dan Gusfield Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[16]  Hsing-Yen Ann,et al.  Efficient algorithms for finding interleaving relationship between sequences , 2008, Inf. Process. Lett..

[17]  Ricardo Baeza-Yates,et al.  Efficient text searching , 1989 .

[18]  Yin-Te Tsai,et al.  The constrained longest common subsequence problem , 2003, Inf. Process. Lett..

[19]  Chang-Biau Yang,et al.  Efficient Sparse Dynamic Programming for the Merged LCS Problem , 2008, BIOCOMP.

[20]  Maxime Crochemore,et al.  A fast and practical bit-vector algorithm for the Longest Common Subsequence problem , 2001, Inf. Process. Lett..

[21]  Alberto Apostolico,et al.  General Pattern Matching , 2010, Algorithms and Theory of Computation Handbook.

[22]  Alfredo De Santis,et al.  A simple algorithm for the constrained sequence problems , 2004, Information Processing Letters.