Clustering Method for Repeat Analysis in DNA sequences

The paper proposes a clustering technique which utilizes this output and classifies the repeats based on similarity measures and creates Repeat Classes. The algorithm finally outputs Clusters congregated into a Repeat Data Bank which can be indexed based on Repeat Classes for future use. The algorithm efficiently organizes these repeats into classes for small and large genome sizes as well as for partial/complete sequences. II. ALGORITHM DESCRIPTION