New Trends of Digital Data Storage in DNA

With the exponential growth in the capacity of information generated and the emerging need for data to be stored for prolonged period of time, there emerges a need for a storage medium with high capacity, high storage density, and possibility to withstand extreme environmental conditions. DNA emerges as the prospective medium for data storage with its striking features. Diverse encoding models for reading and writing data onto DNA, codes for encrypting data which addresses issues of error generation, and approaches for developing codons and storage styles have been developed over the recent past. DNA has been identified as a potential medium for secret writing, which achieves the way towards DNA cryptography and stenography. DNA utilized as an organic memory device along with big data storage and analytics in DNA has paved the way towards DNA computing for solving computational problems. This paper critically analyzes the various methods used for encoding and encrypting data onto DNA while identifying the advantages and capability of every scheme to overcome the drawbacks identified priorly. Cryptography and stenography techniques have been analyzed in a critical approach while identifying the limitations of each method. This paper also identifies the advantages and limitations of DNA as a memory device and memory applications.

[1]  Menachem Ailenberg,et al.  An improved Huffman coding method for archiving text, images, and music characters in DNA. , 2009, BioTechniques.

[2]  Ichiro Matsumura,et al.  Overlap extension PCR cloning: a simple and reliable way to create recombinant plasmids. , 2010, BioTechniques.

[3]  Sanjay Goel Digital Forensics and Cyber Crime , 2010 .

[4]  Catherine Taylor Clelland,et al.  Hiding messages in DNA microdots , 1999, Nature.

[5]  D. Huffman A Method for the Construction of Minimum-Redundancy Codes , 1952 .

[6]  A. Doig Improving the efficiency of the genetic code by varying the codon length--the perfect genetic code. , 1997, Journal of theoretical biology.

[7]  Masanori Arita,et al.  Writing Information into DNA , 2004, Aspects of Molecular Computing.

[8]  Manish K. Gupta,et al.  Natural Data Storage: A Review on sending Information from now to then via Nature , 2015, ArXiv.

[9]  Jian Ma,et al.  A Rewritable, Random-Access DNA-Based Storage System , 2015, Scientific Reports.

[10]  Erik Winfree,et al.  A Sticker-Based Model for DNA Computation , 1998, J. Comput. Biol..

[11]  Pei Yun Lee,et al.  Agarose Gel Electrophoresis for the Separation of DNA Fragments , 2012, Journal of visualized experiments : JoVE.

[12]  Yunpeng Zhang,et al.  Research on DNA Cryptography , 2012 .

[13]  S. Schuster Next-generation sequencing transforms today's biology , 2008, Nature Methods.

[14]  Nigam Sangwan,et al.  Text Encryption with Huffman Compression , 2012 .

[15]  Siddhant Shrivastava,et al.  Data Storage in DNA , 2014 .

[16]  Clifford R. Johnson,et al.  Solution of a 20-Variable 3-SAT Problem on a DNA Computer , 2002, Science.

[17]  Monica Borda,et al.  Fundamentals in Information Theory and Coding , 2011 .

[18]  Olga Tornea,et al.  DNA Cryptographic Algorithms , 2009 .

[19]  L M Adleman,et al.  Molecular computation of solutions to combinatorial problems. , 1994, Science.

[20]  Pak Chung Wong,et al.  Organic data memory using the DNA approach , 2003, CACM.

[21]  Monica Borda,et al.  DNA secret writing techniques , 2010, 2010 8th International Conference on Communications.

[22]  G. Church,et al.  Next-Generation Digital Information Storage in DNA , 2012, Science.

[23]  M. Tomita,et al.  Stabilizing synthetic data in the DNA of living organisms , 2008, Systems and Synthetic Biology.

[24]  Félix Balado On the embedding capacity of DNA strands under substitution, insertion, and deletion mutations , 2010, Electronic Imaging.

[25]  Edgar N. Gilbert,et al.  Synchronization of binary messages , 1960, IRE Trans. Inf. Theory.

[26]  M Castillo,et al.  From Hard Drives to Flash Drives to DNA Drives , 2014, American Journal of Neuroradiology.

[27]  Rahul Vishwakarma,et al.  HIGH DENSITY DATA STORAGE IN DNA USING AN EFFICIENT MESSAGE ENCODING SCHEME , 2012 .

[28]  R J Lipton,et al.  DNA solution of hard computational problems. , 1995, Science.

[29]  C Bancroft,et al.  Long-Term Storage of Information in DNA , 2001, Science.

[30]  Zenon Chaczko,et al.  Review of Big Data Storage Based on DNA Computing , 2015, 2015 Asia-Pacific Conference on Computer Aided System Engineering.

[31]  D. J. Wheeler,et al.  A Block-sorting Lossless Data Compression Algorithm , 1994 .

[32]  Jack Parker Computing with DNA , 2003, EMBO reports.

[33]  J P Cox,et al.  Long-term data storage in DNA. , 2001, Trends in biotechnology.

[34]  Ewan Birney,et al.  Towards practical, high-capacity, low-maintenance information storage in synthesized DNA , 2013, Nature.

[35]  Martin Schmiedecker,et al.  Digital Forensics and Cyber Crime , 2017, Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering.

[36]  Monica Borda,et al.  SECRET WRITING BY DNA HYBRIDIZATION , 2009 .

[37]  M. Tomita,et al.  Alignment‐Based Approach for Durable Data Storage into Living Organisms , 2007, Biotechnology progress.

[38]  Geoff C. Smith,et al.  Some possible codes for encrypting data in DNA , 2003, Biotechnology Letters.

[39]  Simon Litsyn,et al.  Error-correcting codes with bounded running digital sum , 1991, IEEE Trans. Inf. Theory.