Alignment‐Based Approach for Durable Data Storage into Living Organisms

The practical realization of DNA data storage is a major scientific goal. Here we introduce a simple, flexible, and robust data storage and retrieval method based on sequence alignment of the genomic DNA of living organisms. Duplicated data encoded by different oligonucleotide sequences was inserted redundantly into multiple loci of the Bacillus subtilis genome. Multiple alignment of the bit data sequences decoded by B. subtilis genome sequences enabled the retrieval of stable and compact data without the need for template DNA, parity checks, or error‐correcting algorithms. Combined with the computational simulation of data retrieval from mutated message DNA, a practical use of this alignment‐based method is discussed.

[1]  D. Dubnau,et al.  Fate of transforming DNA following uptake by competent Bacillus subtilis. I. Formation and properties of the donor-recipient complex. , 1971 .

[2]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[3]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[4]  S. Kowalczykowski,et al.  Biochemistry of homologous recombination in Escherichia coli. , 1994, Microbiological reviews.

[5]  Catherine Taylor Clelland,et al.  Hiding messages in DNA microdots , 1999, Nature.

[6]  A. Kuzminov Recombinational Repair of DNA Damage inEscherichia coli and Bacteriophage λ , 1999, Microbiology and Molecular Biology Reviews.

[7]  A. Kuzminov Recombinational repair of DNA damage in Escherichia coli and bacteriophage lambda. , 1999, Microbiology and molecular biology reviews : MMBR.

[8]  A Y3K bug , 2000, Nature Biotechnology.

[9]  J P Cox,et al.  Long-term data storage in DNA. , 2001, Trends in biotechnology.

[10]  C Bancroft,et al.  Long-Term Storage of Information in DNA , 2001, Science.

[11]  Pak Chung Wong,et al.  Organic data memory using the DNA approach , 2003, CACM.

[12]  Far different levels of gene expression provided by an oriented cloning system in Bacillus subtilis and Escherichia coli. , 2003, FEMS microbiology letters.

[13]  Masanori Arita,et al.  Comma-free design for DNA words , 2004, CACM.

[14]  Geoff C. Smith,et al.  Some possible codes for encrypting data in DNA , 2003, Biotechnology Letters.

[15]  Masanori Arita,et al.  Secret Signatures Inside Genomic DNA , 2004, Biotechnology progress.

[16]  M. Itaya,et al.  Combining two genomes in one cell: stable cloning of the Synechocystis PCC6803 genome in the Bacillus subtilis 168 genome. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Laura Bonetta,et al.  Genome sequencing in the fast lane , 2006, Nature Methods.