A Striking Property of Genetic Code-like Transformations

The gene expression process in nature plays a key role in evaluating the fitness of DNA through the production of different proteins in different cells. The production of proteins from DNA goes through different stages. Among others, the transcription stage produces the mRNA from the DNA and translation produces the amino acid sequence in proteins from the mRNA. The translation process is accomplished by mapping the mRNA sequence using a transformation called the genetic code. This code considers every consequent triplet (codon) of nucleic acids in the mRNA sequence and maps it to a corresponding amino acid. This paper shows that genetic code-like transformations (GCTs) introduce very interesting properties to the representation of a genetic fitness function. It presents a Fourier1 analysis of GCTs. It points out that such transformations can convert some function representations of exponential description in Fourier basis to a description that is highly suitable for polynomial-complexity approximation. More precisely, such transformations can construct a Fourier representation with only a polynomial number of terms that are exponentially more significant than the rest. Polynomial-complexity approximation of functions from data is a fundamental problem in inductive learning, data mining, search, and optimization. Therefore the work has important implications in these areas. It is unlikely that such representations can be constructed for all functions. However, since such transformations appear to work well in nature, the class of such functions may not be trivial and should be explored further.

[1]  Hillol Kargupta,et al.  SEARCH, Computational Processes in Evolution, and Preliminary Development of the Gene Expression Messy Genetic Algorithm , 1997, Complex Syst..

[2]  Gang Wang,et al.  Revisiting the GEMGA: scalable evolutionary optimization through linkage learning , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[3]  Hillol Kargupta,et al.  Extending the class of order-k delineable problems for the gene expression messy genetic algorithm , 1996 .

[4]  Annie S. Wu,et al.  Empirical Studies of the Genetic Algorithm with Noncoding Segments , 1995, Evolutionary Computation.

[5]  Christian M. Reidys,et al.  Evolution on Random Structures , 1995 .

[6]  Kalyanmoy Deb,et al.  Messy Genetic Algorithms: Motivation, Analysis, and First Results , 1989, Complex Syst..

[7]  J. Monod,et al.  Genetic regulatory mechanisms in the synthesis of proteins. , 1961, Journal of Molecular Biology.

[8]  Hillol Kargupta,et al.  The Gene Expression Messy Genetic Algorithm , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[9]  Gunar E. Liepins,et al.  Polynomials, Basis Sets, and Deceptiveness in Genetic Algorithms , 1991, Complex Syst..

[10]  B. Sankur,et al.  Applications of Walsh and related functions , 1986 .

[11]  Dirk Thierens Estimating the significant non-linearities in the genome problem-coding , 1999 .

[12]  Hornos Algebraic model for the evolution of the genetic code. , 1993, Physical review letters.

[13]  Hillol Kargupta,et al.  Function induction, gene expression, and evolutionary representation construction , 1999 .

[14]  Annie S. Wu,et al.  A Survey of Intron Research in Genetics , 1996, PPSN.

[15]  A. D. Bethke,et al.  Comparison of genetic algorithms and gradient-based optimizers on parallel processors : efficiency of use of processing capacity , 1976 .

[16]  Peter F. Stadler,et al.  Fast Fourier Transform for Fitness Landscapes , 2002 .

[17]  Stuart A. Kauffman,et al.  ORIGINS OF ORDER , 2019, Origins of Order.

[18]  Hillol Kargupta,et al.  A perspective on the foundation and evolution of the linkage learning genetic algorithms , 2000 .

[19]  D. R. McGregor,et al.  Designing application-specific neural networks using the structured genetic algorithm , 1992, [Proceedings] COGANN-92: International Workshop on Combinations of Genetic Algorithms and Neural Networks.

[20]  J. Walsh A Closed Set of Normal Orthogonal Functions , 1923 .

[21]  Hillol Kargupta,et al.  DNA To Protein: Transformations and Their Possible Role in Linkage Learning , 1997, ICGA.

[22]  P Béland,et al.  The origin and evolution of the genetic code. , 1994, Journal of theoretical biology.

[23]  J. Bashford,et al.  A supersymmetric model for the evolution of the genetic code. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Dirk Thierens,et al.  Scalability Problems of Simple Genetic Algorithms , 1999, Evolutionary Computation.

[25]  Alden H. Wright,et al.  The Simple Genetic Algorithm and the Walsh Transform: Part I, Theory , 1998, Evolutionary Computation.

[26]  J Otsuka,et al.  Evolution of genetic information flow from the viewpoint of protein sequence similarity. , 1994, Journal of theoretical biology.

[27]  Wolfgang Banzhaf,et al.  The evolution of genetic code in Genetic Programming , 1999 .

[28]  David E. Goldberg,et al.  Genetic Algorithms and Walsh Functions: Part II, Deception and Its Analysis , 1989, Complex Syst..

[29]  Hillol Kargupta Gene expression: The missing link in evolutionary computation , 1997 .

[30]  David E. Goldberg,et al.  Genetic Algorithms and Walsh Functions: Part I, A Gentle Introduction , 1989, Complex Syst..

[31]  P. Schuster The Role of Neutral Mutations in the Evolution of RNA Molecules , 1997 .

[32]  Alden H. Wright,et al.  The Simple Genetic Algorithm and the Walsh Transform: Part II, The Inverse , 1998, Evolutionary Computation.

[33]  J. C. Jackson The harmonic sieve: a novel application of Fourier analysis to machine learning theory and practice , 1996 .

[34]  Melanie Mitchell,et al.  The Performance of Genetic Algorithms on Walsh Polynomials: Some Anomalous Results and Their Explanation , 1991, ICGA.

[35]  L. Darrell Whitley,et al.  A Tractable Walsh Analysis of SAT and its Implications for Genetic Algorithms , 1998, AAAI/IAAI.

[36]  R. Rosenberg Simulation of genetic populations with biochemical properties : technical report , 1967 .

[37]  Sanghamitra Bandyopadhyay,et al.  Further Experimentations on the Scalability of the GEMGA , 1998, PPSN.

[38]  John Daniel. Bagley,et al.  The behavior of adaptive systems which employ genetic and correlation algorithms : technical report , 1967 .

[39]  Mike Mannion,et al.  Complex systems , 1997, Proceedings International Conference and Workshop on Engineering of Computer-Based Systems.

[40]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[41]  L. Darrell Whitley,et al.  Predicting Epistasis from Mathematical Models , 1999, Evolutionary Computation.

[42]  Anne Brindle,et al.  Genetic algorithms for function optimization , 1980 .

[43]  Eyal Kushilevitz,et al.  Learning decision trees using the Fourier spectrum , 1991, STOC '91.