HIGH DENSITY DATA STORAGE IN DNA USING AN EFFICIENT MESSAGE ENCODING SCHEME

This paper suggests a message encoding scheme for small text files in nucleotide strands for ultra high data density storage in DNA. The proposed scheme leads to high volume data density and depends on adoption of sequence transformation algorithms. Compression of small text files must fulfill special requirement since they have small context. The use of transformation algorithm generates better context information for compression with Huffman encoding. We tested the suggested scheme on collection of small text size files. The testing result showed the proposed scheme reduced the number of nucleotides for representing text message over existing method and realization of high data density storage in DNA

[1]  D. J. Wheeler,et al.  A Block-sorting Lossless Data Compression Algorithm , 1994 .

[2]  C Bancroft,et al.  Long-Term Storage of Information in DNA , 2001, Science.

[3]  Shuhong Jiao,et al.  Code for encryption hiding data into genomic DNA of living organisms , 2008, 2008 9th International Conference on Signal Processing.

[4]  M. Tomita,et al.  Alignment‐Based Approach for Durable Data Storage into Living Organisms , 2007, Biotechnology progress.

[5]  David A. Huffman,et al.  A method for the construction of minimum-redundancy codes , 1952, Proceedings of the IRE.

[6]  Tomás Lang,et al.  Introduction to Digital Systems , 1998 .

[7]  Masanori Arita,et al.  Secret Signatures Inside Genomic DNA , 2004, Biotechnology progress.

[8]  Jan Lansky,et al.  Comparison of Text Models for BWT , 2007, 2007 Data Compression Conference (DCC'07).

[9]  J P Cox,et al.  Long-term data storage in DNA. , 2001, Trends in biotechnology.

[10]  G.G. Langdon,et al.  Data compression , 1988, IEEE Potentials.

[11]  Claude E. Shannon,et al.  Communication theory of secrecy systems , 1949, Bell Syst. Tech. J..

[12]  Catherine Taylor Clelland,et al.  Hiding messages in DNA microdots , 1999, Nature.

[13]  Gerard Battail,et al.  Heredity as an Encoded Communication Process , 2010, IEEE Transactions on Information Theory.

[14]  Mark Nelson,et al.  The Data Compression Book , 2009 .

[15]  Ricardo A. Baeza-Yates,et al.  Compression: A Key for Next-Generation Text Retrieval Systems , 2000, Computer.