A Biological Sequence Compression based on Look up Table (LUT) using Complementary Palindrome of Fixed Size

Data Storage costs have an appreciable proportion of total cost in the creation and analysis of DNA sequences. In particular, the increase in the DNA sequences is highly remarkable with compare to increase in the disk storage capacity. General text compression algorithms do not utilize the specific characteristics of DNA sequences. In this paper we have proposed a compression algorithm based on cross complementary properties of DNA sequences. This technique helps for comparing DNA sequences and also to identify similar subsequences which may lead to the identification of structure as well as similar function. The experimental results show that it performs better compression as compared to other existing compression algorithms.