Study on the genetic code: comparison with multiplexed codes

The genetic code is a key concept in genetics. This code establishes the link between the DNA (desoxyribonucleic acid), composed of nucleic acids, and proteins, composed of amino acids. The genetic code is known to be universal, indeed almost all organisms use this code, and it is degenerated, this means that several sequences of nucleic acids can be translated into the same amino acid. This feature is also present in multiplexed codes which is a new concept recently introduced for robust source coding. In this paper we present a comparison between multiplexed codes and the genetic code. We show why this comparison is relevant, and we point out the fact that the genetic code is almost optimal in the context of multiplexed codes theory.

[1]  Stéphane Grumbach,et al.  A New Challenge for Compression Algorithms: Genetic Sequences , 1994, Inf. Process. Manag..

[2]  Hervé Jégou,et al.  Robust multiplexed codes for compression of heterogeneous data , 2005, IEEE Transactions on Information Theory.

[3]  E. Szathmáry,et al.  What is the optimum size for the genetic alphabet? , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[4]  G. Battail,et al.  Should genetics get an information-theoretic education? , 2006, IEEE Engineering in Medicine and Biology Magazine.

[5]  En-Hui Yang,et al.  Estimating DNA sequence entropy , 2000, SODA '00.

[6]  M.K. Gupta,et al.  The quest for error correction in biology , 2006, IEEE Engineering in Medicine and Biology Magazine.

[7]  Benoist,et al.  On the Entropy of DNA: Algorithms and Measurements based on Memory and Rapid Convergence , 1994 .