Evolutionary voting kernel machines for cyclooxygenase-2 inhibitor activity comparisons

With the growing interest of biological data prediction and chemical data prediction, more complicated kernels are designed to measure data similarities. In (1), we proposed a kind of evolutionary granular kernel trees (EGKTs) for drug activity comparisons. In EGKTs, feature granules and tree structures are predefined based on the possible substituent locations. In (2), we proposed a granular kernel tree structure evolving system (GKTSES) to evolve the structures of GKTs in the case that we lack knowledge to predefine kernel trees. In this paper, evolutionary voting kernel machines (EVKMs) are presented based on GKTSES. Experimental results show that EVKMs are more stable than GKTSES in cyclooxygenase-2 inhibitor activity comparisons. Index Terms—Drug activity comparisons, kernel, genetic algorithms, granular kernel trees, support vector machines.

[1]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[2]  Hisashi Kashima,et al.  Kernels for Semi-Structured Data , 2002, ICML.

[3]  Hisashi Kashima,et al.  Kernels for graph classification , 2002 .

[4]  Thomas Gärtner,et al.  A survey of kernels for structured data , 2003, SKDD.

[5]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[6]  Gregory W. Kauffman,et al.  QSAR and k-Nearest Neighbor Classification Analysis of Selective Cyclooxygenase-2 Inhibitors Using Topologically-Based Numerical Descriptors , 2001, J. Chem. Inf. Comput. Sci..

[7]  Michael G. Madden,et al.  The Genetic Evolution of Kernels for Support Vector Machine Classifiers , 2004 .

[8]  David Haussler,et al.  Convolution kernels on discrete structures , 1999 .

[9]  C. Berg,et al.  Harmonic Analysis on Semigroups: Theory of Positive Definite and Related Functions , 1984 .

[10]  Yanqing Zhang,et al.  Granular Kernel Trees with parallel Genetic Algorithms for drug activity comparisons , 2007, Int. J. Data Min. Bioinform..

[11]  N. Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods: Kernel-Induced Feature Spaces , 2000 .

[12]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[13]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[14]  C. Berg,et al.  Harmonic Analysis on Semigroups , 1984 .

[15]  Michael Collins,et al.  Convolution Kernels for Natural Language , 2001, NIPS.

[16]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[17]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[18]  Yanqing Zhang,et al.  Evolutionary Construction of Granular Kernel Trees for Cyclooxygenase-2 Inhibitor Activity Comparison , 2006, Trans. Comp. Sys. Biology.

[19]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.