Learning functions using randomized genetic code-like transformations: probabilistic properties and experimentations

Inductive learning of nonlinear functions plays an important role in constructing predictive models and classifiers from data. We explore a novel randomized approach to construct linear representations of nonlinear functions proposed elsewhere [H. Kargupta (2001)], [H. Kargupta et al., (2002)]. This approach makes use of randomized codebooks, called the genetic code-like transformations (GCTs) for constructing an approximately linear representation of a nonlinear target function. We first derive some of the results presented elsewhere [H. Kargupta et al., (2002)] in a more general context. Next, it investigates different probabilistic and limit properties of GCTs. It also presents several experimental results to demonstrate the potential of this approach.

[1]  Marvin Minsky,et al.  Perceptrons: An Introduction to Computational Geometry , 1969 .

[2]  Hillol Kargupta,et al.  Toward Machine Learning Through Genetic Code-like Transformations , 2002, Genetic Programming and Evolvable Machines.

[3]  Hillol Kargupta,et al.  Knowledge discovery from heterogeneous data streams using fourier spectrum of decision trees , 2001 .

[4]  Hornos Algebraic model for the evolution of the genetic code. , 1993, Physical review letters.

[5]  MansourYishay,et al.  Constant depth circuits, Fourier transform, and learnability , 1993 .

[6]  David E. Goldberg,et al.  The Nonuniform Walsh-Schema Transform , 1990, FOGA.

[7]  Eyal Kushilevitz,et al.  Learning decision trees using the Fourier spectrum , 1991, STOC '91.

[8]  P Béland,et al.  The origin and evolution of the genetic code. , 1994, Journal of theoretical biology.

[9]  R. Knight,et al.  The Early Evolution of the Genetic Code , 2000, Cell.

[10]  J. Bashford,et al.  A supersymmetric model for the evolution of the genetic code. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[11]  J. Monod,et al.  Genetic regulatory mechanisms in the synthesis of proteins. , 1961, Journal of molecular biology.

[12]  J Otsuka,et al.  Evolution of genetic information flow from the viewpoint of protein sequence similarity. , 1994, Journal of theoretical biology.

[13]  Hillol Kargupta,et al.  A Fourier Analysis Based Approach to Learning Decision Trees in a Distributed Environment , 2001, SDM.

[14]  P. Schuster The Role of Neutral Mutations in the Evolution of RNA Molecules , 1997 .

[15]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[16]  Hillol Kargupta,et al.  A Striking Property of Genetic Code-like Transformations , 2001, Complex Syst..

[17]  Christian M. Reidys,et al.  Evolution on Random Structures , 1995 .

[18]  Kalyanmoy Deb,et al.  Messy Genetic Algorithms: Motivation, Analysis, and First Results , 1989, Complex Syst..

[19]  Hillol Kargupta,et al.  SEARCH, Computational Processes in Evolution, and Preliminary Development of the Gene Expression Messy Genetic Algorithm , 1997, Complex Syst..

[20]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[21]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[22]  Noam Nisan,et al.  Constant depth circuits, Fourier transform, and learnability , 1989, 30th Annual Symposium on Foundations of Computer Science.

[23]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[24]  J. C. Jackson The harmonic sieve: a novel application of Fourier analysis to machine learning theory and practice , 1996 .

[25]  Stuart A. Kauffman,et al.  ORIGINS OF ORDER , 2019, Origins of Order.

[26]  Wolfgang Banzhaf,et al.  The evolution of genetic code in Genetic Programming , 1999 .

[27]  John Daniel. Bagley,et al.  The behavior of adaptive systems which employ genetic and correlation algorithms : technical report , 1967 .

[28]  L. Hurst,et al.  Early fixation of an optimal genetic code. , 2000, Molecular biology and evolution.

[29]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[30]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.