Extension and use of KMRCRelat algorithm for biological problems

The main goal of this paper is to present a new extension of KMRCRelat algorithm allowing Word Train searches. First, we recall the fundamental lemma of our KMRCRelat algorithm and we focus on the concept of flexible relational repeated words. Then, we introduce the concept of Word Trains and we show, in deep, the needed modification and extension to include such features in this algorithm. To illustrate Word Train searches, we present all details about KMRCRelat extension computation steps. We also show how this new extension is applicable to the characterization of short tandem repeat in DNA sequence providing an alternative way to solve originally this known problem in sequence analysis. Before concluding by introducing other possible KMRCRelat applications, the paper expands a set of experimental results obtained when applied to locus identification problem.