Embedding Permanent Watermarks in Synthetic Genes

As synthetic biology advances, labeling of genes or organisms, like other high-value products, will become important not only to pinpoint their identity, origin, or spread, but also for intellectual property, classification, bio-security or legal reasons. Ideally information should be inseparably interlaced into expressed genes. We describe a method for embedding messages within open reading frames of synthetic genes by adapting steganographic algorithms typically used for watermarking digital media files. Text messages are first translated into a binary string, and then represented in the reading frame by synonymous codon choice. To aim for good expression of the labeled gene in its host as well as retain a high degree of codon assignment flexibility for gene optimization, codon usage tables of the target organism are taken into account. Preferably amino acids with 4 or 6 synonymous codons are used to comprise binary digits. Several different messages were embedded into open reading frames of T7 RNA polymerase, GFP, human EMG1 and HIV gag, variously optimized for bacterial, yeast, mammalian or plant expression, without affecting their protein expression or function. We also introduced Vigenère polyalphabetic substitution to cipher text messages, and developed an identifier as a key to deciphering codon usage ranking stored for a specific organism within a sequence of 35 nucleotides.

[1]  M. Graf,et al.  Multiparameter RNA and Codon Optimization: A Standardized Tool to Assess and Enhance Autologous Mammalian Gene Expression , 2011, PloS one.

[2]  D. Raab,et al.  The GeneOptimizer Algorithm: using a sliding window approach to cope with the vast sequence space in multiparameter DNA sequence optimization , 2010, Systems and Synthetic Biology.

[3]  Thomas H Segall-Shapiro,et al.  Creation of a Bacterial Cell Controlled by a Chemically Synthesized Genome , 2010, Science.

[4]  Jan Kubicek,et al.  Gene optimization mechanisms: A multi-gene study reveals a high success rate of full-length human proteins expressed in Escherichia coli , 2010, Protein science : a publication of the Protein Society.

[5]  Dominik Heider,et al.  DNA watermarks in non-coding regulatory sequences , 2009, BMC Research Notes.

[6]  Sophia Sonnewald,et al.  The Xanthomonas campestris pv. vesicatoria type III effector protein XopJ inhibits protein secretion: evidence for interference with cell wall-associated defense responses. , 2009, Molecular plant-microbe interactions : MPMI.

[7]  Dominik Heider,et al.  Watermarking sexually reproducing diploid organisms , 2008, Bioinform..

[8]  R. Wagner,et al.  Importance of Protease Cleavage Sites within and Flanking Human Immunodeficiency Virus Type 1 Transframe Protein p6* for Spatiotemporal Regulation of Protease Activation , 2008, Journal of Virology.

[9]  D. Heider,et al.  DNA watermarks: A proof of concept , 2008, BMC Molecular Biology.

[10]  Dominik Heider,et al.  DNA-based watermarks using the DNA-Crypt algorithm , 2007, BMC Bioinformatics.

[11]  M. Preuss,et al.  A role for the RabA4b effector protein PI-4Kβ1 in polarized expansion of root hair cells in Arabidopsis thaliana , 2006, The Journal of cell biology.

[12]  R. D. Gietz,et al.  Yeast transformation by the LiAc/SS Carrier DNA/PEG method. , 2006, Methods in molecular biology.

[13]  Masanori Arita,et al.  Secret Signatures Inside Genomic DNA , 2004, Biotechnology progress.

[14]  Shan Ling Pan,et al.  Using e-CRM for a unified view of the customer , 2003, CACM.

[15]  Pak Chung Wong,et al.  Organic data memory using the DNA approach , 2003, CACM.

[16]  Miodrag Potkonjak,et al.  Hiding Data in DNA , 2002, Information Hiding.

[17]  M. Graf,et al.  Multiple Effects of Codon Usage Optimization on Expression and Immunogenicity of DNA Candidate Vaccines Encoding the Human Immunodeficiency Virus Type 1 Gag Protein , 2001, Journal of Virology.

[18]  Catherine Taylor Clelland,et al.  Hiding messages in DNA microdots , 1999, Nature.

[19]  A. Willems,et al.  Studies on the transformation of intact yeast cells by the LiAc/SS‐DNA/PEG procedure , 1995, Yeast.