Optimal Computation of all Repetitions in a Weighted String
暂无分享,去创建一个
A repetition in a string of letters consists of exact concatenations of identical factors of the string. Crochemore’s repetitions algorithm, usually also referred to as Crochemore’s partitioning algorithm, was introduced in 1981, and was the first optimal O(n log n)-time algorithm to compute all repetitions in a string of length n. A weighted string is a string in which a set of letters may occur at each position with respective probabilities of occurrence. In this article, we present a new variant of Crochemore’s partitioning algorithm for weighted strings, which requires optimal time O(n log n), thus improving on the best known O(n2)-time algorithm for computing all repetitions in a weighted string of length n.
[1] Wojciech Rytter,et al. Repetitions in strings: Algorithms and combinatorics , 2009, Theor. Comput. Sci..
[2] Costas S. Iliopoulos,et al. Locating tandem repeats in weighted sequences in proteins , 2013, BMC Bioinformatics.
[3] Costas S. Iliopoulos,et al. The Weighted Suffix Tree: An Efficient Data Structure for Handling Molecular Weighted Sequences and its Applications , 2006, Fundam. Informaticae.
[4] Maxime Crochemore,et al. An Optimal Algorithm for Computing the Repetitions in a Word , 1981, Inf. Process. Lett..