COX-2 activity prediction in Chinese medicine using neural network based ensemble learning methods

In this paper, neural network based ensemble learning methods are introduced in predicting activities of COX-2 inhibitors in Chinese medicine quantitative structure-activity relationship (QSAR) research. Three different ensemble learning methods: bagging, boosting and random subspace are tested using neural networks as basic regression rules. Experiments show that all three methods, especially boosting, are fast and effective ways in the activity prediction of Chinese medicine QSAR research, which is generally based on a small amount of training samples.

[1]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[2]  John Hallam,et al.  IEEE International Joint Conference on Neural Networks , 2005 .

[3]  L. Hall,et al.  Molecular Structure Description: The Electrotopological State , 1999 .

[4]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[7]  Yoshua Bengio,et al.  Boosting Neural Networks , 2000, Neural Computation.

[8]  Tomasz Arodz,et al.  Computational methods in developing quantitative structure-activity relationships (QSAR): a review. , 2006, Combinatorial chemistry & high throughput screening.

[9]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[10]  Zehong Yang,et al.  Feature Selection in Predicting the Activity of Cyclooxygenase-2 Inhibitors , 2006, Artificial Intelligence and Applications.

[11]  A. Gui Study on quantitative structure-activity relationships of COX/5-LO dual inhibitors , 2003 .

[12]  P. Schellhammer,et al.  Boosted decision tree analysis of surface-enhanced laser desorption/ionization mass spectral serum profiles discriminates prostate cancer from noncancer patients. , 2002, Clinical chemistry.

[13]  H. Wiener Structural determination of paraffin boiling points. , 1947, Journal of the American Chemical Society.

[14]  Ting Wang,et al.  Boosting: An Ensemble Learning Tool for Compound Classification and QSAR Modeling , 2005, J. Chem. Inf. Model..

[15]  S. Jachak Cyclooxygenase inhibitory natural products: current status. , 2006, Current medicinal chemistry.

[16]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[17]  J. Ross Quinlan,et al.  Bagging, Boosting, and C4.5 , 1996, AAAI/IAAI, Vol. 1.

[18]  Alan F. Murray,et al.  International Joint Conference on Neural Networks , 1993 .

[19]  Michel Petitjean,et al.  Applications of the radius-diameter diagram to the classification of topological and geometrical shapes of chemical compounds , 1992, J. Chem. Inf. Comput. Sci..

[20]  Ying Liu,et al.  Drug design by machine learning: ensemble learning for QSAR modeling , 2005, Fourth International Conference on Machine Learning and Applications (ICMLA'05).

[21]  Robert P. W. Duin,et al.  Bagging, Boosting and the Random Subspace Method for Linear Classifiers , 2002, Pattern Analysis & Applications.

[22]  B. Fan,et al.  CoMFA/CoMSIA/HQSAR and Docking Study of the Binding Mode of Selective Cyclooxygenase (COX‐2) Inhibitors , 2004, QSAR & combinatorial science.

[23]  Wang Jiaxin,et al.  Feature selection in predicting the activity of cyclooxygenase-2 inhibitors , 2006 .

[24]  Zehong Yang,et al.  Feature Selection and Activity Prediction in Chinese Medicine Research Using a Hybrid Model GA-SVM , 2006, MLMTA.