Optimising Storage Resource using Morpheme based Text Compression Technique

paper, we present a text compression technique which utilises morpheme-based text compression to optimise storage resources. The proposed technique is designed to decompose words into their morphemes and then to produce code representations for compression. The proposed algorithm is implemented using English Language text data and applied using 30 different texts of different lengths collected from different sources with different natures. The efficiency increases with the increase in the number of long, repetitive morphemes in the input data. To the best of our knowledge, the resulting implementation is the first to demonstrate lossless compression using such a technique. We illustrate its suitability and effectiveness on a number of benchmark file sizes - small, middle-sized, large, and very large real-world application. The results indicated a good compression performance of 98% making the approach an attractive one. A further virtue of this method is its dynamic application. A degraded compression can be compensated for by appending identified morphemes within the document to the dictionary to improve compression. The evaluation experiments show that: if storage space is the primary consideration, the morpheme- based text compression technique is an efficient approach for compressing text data.