Using hybrid GA-ANN to predict biological activity of HIV protease inhibitors

The prediction of biological activity of a chemical compound from its structural features, representing its physico-chemical properties, plays an important role in drug discovery, design and development. Since the biological data is highly non-linear, the machine-learning techniques have been widely used for modeling it. In the present work, the clustering, genetic algorithm (GA) and artificial neural networks (ANN) are used to develop computational prediction models on a dataset of HIV protease inhibitors. The hybrid GA- ANN technique is used for feature selection. The ANN-QSAR prediction models are then developed to link the structures to their reported biological activity. These models can be useful for predicting the biological activity of new untested HIV protease inhibitors and virtual screening for identifying new lead compounds.

[1]  Julio Caballero,et al.  Ensembles of Bayesian‐regularized Genetic Neural Networks for Modeling of Acetylcholinesterase Inhibition by Huprines , 2006, Chemical biology & drug design.

[2]  Walter Cedeño,et al.  Using particle swarms for the development of QSAR models based on K-nearest neighbor and kernel regression , 2003, J. Comput. Aided Mol. Des..

[3]  Zheng Rong Yang,et al.  Bio-basis function neural network for prediction of protease cleavage sites in proteins , 2005, IEEE Transactions on Neural Networks.

[4]  A. Roli Artificial Neural Networks , 2012, Lecture Notes in Computer Science.

[5]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[6]  E. LaVoie,et al.  Bioisosterism: A Rational Approach in Drug Design. , 1996, Chemical reviews.

[7]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[8]  Thomas Bäck,et al.  Theory of Genetic Algorithms , 2001, Current Trends in Theoretical Computer Science.

[9]  Andreas Persidis,et al.  Artificial intelligence for drug design , 1997, Nature Biotechnology.

[10]  T. Niwa Prediction of biological targets using probabilistic neural networks and atom-type descriptors. , 2004, Journal of medicinal chemistry.

[11]  Y.S. Hung,et al.  Gene selection for Brain Cancer Classification , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[12]  Didier Villemin,et al.  Neural Networks: Accurate Nonlinear QSAR Model for HEPT Derivatives , 2003, J. Chem. Inf. Comput. Sci..

[13]  Ernö Pretsch,et al.  Application of genetic algorithms in molecular modeling , 1994, J. Comput. Chem..

[14]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[15]  Philipp Slusallek,et al.  Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.

[16]  Jonathan D. Hirst,et al.  Contemporary QSAR Classifiers Compared , 2007, J. Chem. Inf. Model..

[17]  Daniel Boley,et al.  Clustering and classification techniques to assess aquatic toxicity , 2000, KES'2000. Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies. Proceedings (Cat. No.00TH8516).

[18]  Garland R. Marshall,et al.  VALIDATE: A New Method for the Receptor-Based Prediction of Binding Affinities of Novel Ligands , 1996 .

[19]  M. Møller A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning , 1990 .

[20]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[21]  Mark Dybul,et al.  Guidelines for Using Antiretroviral Agents among HIV-Infected Adults and Adolescents: The Panel on Clinical Practices for Treatment of HIV* , 2002, Annals of Internal Medicine.

[22]  Jan A Snyman,et al.  Practical Mathematical Optimization: An Introduction to Basic Optimization Theory and Classical and New Gradient-Based Algorithms , 2005 .

[23]  Ning Qian,et al.  On the momentum term in gradient descent learning algorithms , 1999, Neural Networks.

[24]  A. Molla,et al.  Discovery of ritonavir, a potent inhibitor of HIV protease with high oral bioavailability and clinical efficacy. , 1998, Journal of medicinal chemistry.

[25]  M. D. Kelly,et al.  A new method for estimating the importance of hydrophobic groups in the binding site of a protein. , 2005, Journal of medicinal chemistry.

[26]  Martin Fodslette Møller,et al.  A scaled conjugate gradient algorithm for fast supervised learning , 1993, Neural Networks.

[27]  Martin T. Hagan,et al.  Gauss-Newton approximation to Bayesian learning , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[28]  Jens Sadowski,et al.  Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification , 2003, J. Chem. Inf. Comput. Sci..

[29]  Sean Ekins,et al.  Application of data mining approaches to drug delivery. , 2006, Advanced drug delivery reviews.

[30]  Strategy of Drug Design: A Molecular Guide to Biological Activity , 1975 .

[31]  C. Hansch,et al.  On the structure of medicinal chemistry. , 1976, Journal of medicinal chemistry.

[32]  Magnus R. Hestenes,et al.  Conjugate Direction Methods in Optimization , 1980 .