Efficient Algorithms for the delta-Approximate String Matching Problem in Musical Sequences

The δ-approximate string matching problem, recently introduced in connection with applications to music retrieval, is a generalization of the exact string matching problem for alphabets of integer numbers. In the δ-approximate variant, (exact) matching between any pair of symbols/integers a and b is replaced by the notion of δ-matching =δ, where a =δ b if and only if |a− b| ≤ δ for a given value of the approximation bound δ. After surveying the state-of-the-art, we describe some new effective algorithms for the δ-matching problem, obtained by adapting existing string matching algorithms. The algorithms discussed in the paper are then compared with respect to a large set of experimental tests. From these, in particular it turns out that two of our newly proposed algorithms often achieve the best performances, especially in the case of large alphabets and short patterns, which typically occurs in practical situations in music retrieval.

[1]  Wojciech Plandowski,et al.  Three Heuristics for delta-Matching: delta-BM Algorithms , 2002, CPM.

[2]  S Karlin,et al.  Efficient algorithms for molecular sequence analysis. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Aleksandar Milosavljevic,et al.  Discovering simple DNA sequences by the algorithmic significance method , 1993, Comput. Appl. Biosci..

[4]  Wojciech Plandowski,et al.  Speeding up two string-matching algorithms , 2005, Algorithmica.

[5]  Domenico Cantone,et al.  Forward-Fast-Search: Another Fast Variant of the Boyer-Moore String Matching Algorithm , 2003, Stringology.

[6]  Maxime Crochemore,et al.  Algorithms For Computing Approximate Repetitions In Musical Sequences , 2002, Int. J. Comput. Math..

[7]  Andrew Hume,et al.  Fast string searching , 1991, USENIX Summer.

[8]  Andrew Chi-Chih Yao,et al.  The Complexity of Pattern Matching for a Random String , 1977, SIAM J. Comput..

[9]  Gaston H. Gonnet,et al.  A new approach to text searching , 1989, SIGIR '89.

[10]  Maxime Crochemore,et al.  Approximate String Matching in Musical Sequences , 2001, Stringology.

[11]  Wojciech Plandowski,et al.  Pattern-Matching Problems for Two-Dimensional Images Described by Finite Automata , 2000, Nord. J. Comput..

[12]  Robert S. Boyer,et al.  A fast string searching algorithm , 1977, CACM.

[13]  Thomas Berry,et al.  A Fast String Matching Algorithm and Experimental Results , 1999, Stringology.

[14]  Thierry Lecroq,et al.  A Very Fast String Matching Algorithm for Small Alphabeths and Long Patterns (Extended Abstract) , 1998, CPM.

[15]  Gonzalo Navarro,et al.  A Bit-Parallel Approach to Suffix Automata: Fast Extended String Matching , 1998, CPM.

[16]  Daniel Sunday,et al.  A very fast substring search algorithm , 1990, CACM.

[17]  Rajeev Raman,et al.  String-Matching techniques for musical similarity and melodic recognition , 1998 .