On the Shannon capacity of DNA data embedding

This paper firstly gives a brief overview of information embedding in deoxyribonucleic acid (DNA) sequences and its applications. DNA data embedding can be considered as a particular case of communications with or without side information, depending on the use of coding or noncoding DNA sequences, respectively. Although several DNA data embedding methods have been proposed over the last decade, it is still an open question to determine the maximum amount of information that can theoretically be embedded-that is, its Shannon capacity. This is the main question tackled in this paper.

[1]  Modegi-T Watermark Embedding Techniques for DNA Sequences Using Codon Usage Bias Features , 2005 .

[2]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[3]  Gregory W. Wornell,et al.  The duality between information embedding and source coding with side information and some applications , 2003, IEEE Trans. Inf. Theory.

[4]  Y X Fu,et al.  Estimating mutation rate and generation time from longitudinal samples of DNA sequences. , 2001, Molecular biology and evolution.

[5]  Kannan Ramchandran,et al.  Duality between source coding and channel coding and its extension to the side information case , 2003, IEEE Trans. Inf. Theory.

[6]  Rebecca S. Eisenberg,et al.  Structure and function in gene patenting , 1997, Nature Genetics.

[7]  Pierre Moulin,et al.  Data-Hiding Codes , 2005, Proceedings of the IEEE.

[8]  Geoff C. Smith,et al.  Some possible codes for encrypting data in DNA , 2003, Biotechnology Letters.

[9]  Dominik Heider,et al.  DNA-based watermarks using the DNA-Crypt algorithm , 2007, BMC Bioinformatics.

[10]  M. Tomita,et al.  Alignment‐Based Approach for Durable Data Storage into Living Organisms , 2007, Biotechnology progress.

[11]  W. Wayt Gibbs,et al.  The unseen genome: gems among the junk. , 2003, Scientific American.

[12]  Pak Chung Wong,et al.  Organic data memory using the DNA approach , 2003, CACM.

[13]  L. Goddard Information Theory , 1962, Nature.

[14]  Viviana I. Risca DNA-BASED STEGANOGRAPHY , 2001, Cryptologia.

[15]  Miodrag Potkonjak,et al.  Hiding Data in DNA , 2002, Information Hiding.

[16]  Catherine Taylor Clelland,et al.  Hiding messages in DNA microdots , 1999, Nature.

[17]  T. Kunkel DNA Replication Fidelity* , 2004, Journal of Biological Chemistry.