The origin and evolution of the genetic code.

We argue that a primitive genetic code with only 20 separate words explains that there are 20 coded amino acids in modern life. The existence of 64 words on the modern genetic code requires modern life to read almost exclusively one strand of DNA in one direction. In our primitive code, both the original and the complementary sequence are read in either direction to give the same strings of amino acids. The algebra of complements forces synonymy of primitive codons so as to reduce the 64 independent codons of the modern code to exactly 20 independent separate words in the primitive condition. The synonymy in the modern code is the result of selection rather than algebraic forcing. The primitive code has almost no resilience to base mutations, unlike the third base redundancy of the modern code. Our primitive and the modern code are orthogonal. If palindromic proteins were coded by hairpin DNA or RNA, then (i) no punctuation would be needed; (ii) the reverse reading would give the same secondarily folded protein structure; and (iii) the sugar backbone would be read in the conventional 5' to 3' direction for the original arm and its complement. Modern copying of genetic material is almost always antiparallel. However, occasional parallel copying, as does occur in modern life, would give the complementary hairpin that would also read 5' to 3' along its entire length.(ABSTRACT TRUNCATED AT 250 WORDS)