Classification of proteomic kinetic patterns using supervised genetic programming

The rapidly emerging field of quantitative proteomics has established itself as a credible approach for understanding of the biology of whole organisms. Classification of proteins according to the level of their expression during a particular process allows discovering causal relationships among genes and proteins involved in the process. In the paper we present a supervised method of classification of proteomic kinetic patterns based on genetic programming allowing for extraction of user defined patterns from a database of kinetic profiles. The method combines robustness of genetic programming algorithm with the flexibility given by user interaction.

[1]  Hitoshi Iba,et al.  Classification of Gene Expression Profile Using Combinatory Method of Evolutionary Computation and Machine Learning , 2004, Genetic Programming and Evolvable Machines.

[2]  Chih-Jian Lih,et al.  Analysis of DNA microarrays using algorithms that employ rule-based expert knowledge , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Patrick Tan,et al.  Genetic algorithms applied to multi-class prediction for the analysis of gene expression data , 2003, Bioinform..

[4]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Lalit M. Patnaik,et al.  Application of genetic programming for multicategory pattern classification , 2000, IEEE Trans. Evol. Comput..

[6]  Jiří Vohradský,et al.  Adaptive classification of two‐dimensional gel electrophoretic spot patterns by neural networks and cluster analysis , 1997, Electrophoresis.

[7]  Venu Govindaraju,et al.  Issues in evolving GP based classifiers for a pattern recognition task , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[8]  Christian A. Rees,et al.  Distinctive gene expression patterns in human mammary epithelial cells and breast cancers. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Vic Ciesielski,et al.  Representing classification problems in genetic programming , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[10]  Jano I. van Hemert,et al.  A Comparison of Genetic Programming Variants for Data Classification , 1999, IDA.

[11]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Susmita Datta,et al.  Comparisons and validation of statistical clustering techniques for microarray gene expression data , 2003, Bioinform..

[13]  Alex A. Freitas,et al.  Data Mining with Constrained-syntax Genetic Programming: Applications in Medical Data Sets , 2001 .

[14]  J. Vohradský,et al.  Identification of procaryotic developmental stages by statistical analyses of two‐dimensional gel patterns , 1997, Electrophoresis.

[15]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Richard M. Karp,et al.  CLIFF: clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts , 2001, ISMB.