On the Use of Neural Network Ensembles in QSAR and QSPR

Despite their growing popularity among neural network practitioners, ensemble methods have not been widely adopted in structure-activity and structure-property correlation. Neural networks are inherently unstable, in that small changes in the training set and/or training parameters can lead to large changes in their generalization performance. Recent research has shown that by capitalizing on the diversity of the individual models, ensemble techniques can minimize uncertainty and produce more stable and accurate predictors. In this work, we present a critical assessment of the most common ensemble technique known as bootstrap aggregation, or bagging, as applied to QSAR and QSPR. Although aggregation does offer definitive advantages, we demonstrate that bagging may not be the best possible choice and that simpler techniques such as retraining with the full sample can often produce superior results. These findings are rationalized using Krogh and Vedelsby's decomposition of the generalization error into a term that measures the average generalization performance of the individual networks and a term that measures the diversity among them. For networks that are designed to resist over-fitting, the benefits of aggregation are clear but not overwhelming.

[1]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[2]  Dimitris K. Agrafiotis,et al.  A Novel Method for Building Regression Tree Models for QSAR Based on Artificial Ant Colony Systems , 2001, J. Chem. Inf. Comput. Sci..

[3]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[4]  Ajay A unified framework for using neural networks to build QSARs. , 1993, Journal of medicinal chemistry.

[5]  Kimito Funatsu,et al.  GA Strategy for Variable Selection in QSAR Studies: GA-Based PLS Analysis of Calcium Channel Antagonists , 1997, J. Chem. Inf. Comput. Sci..

[6]  K Bowden,et al.  Structure-activity relationships of dihydrofolate reductase inhibitors. , 1993, Journal of chemotherapy.

[7]  H Ichikawa,et al.  Neural networks applied to structure-activity relationships. , 1990, Journal of medicinal chemistry.

[8]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[9]  D. Manallack,et al.  Analysis of linear and nonlinear QSAR data using neural networks. , 1994, Journal of medicinal chemistry.

[10]  Padraig Cunningham,et al.  The NeuralBAG algorithm: optimizing generalization performance in bagged neural networks , 1999, ESANN.

[11]  L. Breiman SOME INFINITY THEORY FOR PREDICTOR ENSEMBLES , 2000 .

[12]  James Devillers,et al.  Neural Networks in QSAR and Drug Design , 1996 .

[13]  D. Livingstone,et al.  Structure-activity relationships of antifilarial antimycin analogues: a multivariate pattern recognition study. , 1990, Journal of medicinal chemistry.

[14]  S. So,et al.  Application of neural networks: quantitative structure-activity relationships of the derivatives of 2,4-diamino-5-(substituted-benzyl)pyrimidines as DHFR inhibitors. , 1992, Journal of medicinal chemistry.

[15]  Leo Breiman,et al.  Stacked regressions , 2004, Machine Learning.

[16]  A. N. Tikhonov,et al.  Solutions of ill-posed problems , 1977 .

[17]  Jonathan D. Hirst,et al.  Quantitative structure-activity relationships by neural networks and inductive logic programming. I. The inhibition of dihydrofolate reductase by pyrimidines , 1994, J. Comput. Aided Mol. Des..

[18]  T. A. Andrea,et al.  Applications of neural networks in quantitative structure-activity relationships of dihydrofolate reductase inhibitors. , 1991, Journal of medicinal chemistry.

[19]  David Hartsough,et al.  Toward an Optimal Procedure for Variable Selection and QSAR Model Building , 2001, J. Chem. Inf. Comput. Sci..

[20]  Brian T. Luke,et al.  Evolutionary Programming Applied to the Development of Quantitative Structure-Activity Relationships and Quantitative Structure-Property Relationships , 1994, J. Chem. Inf. Comput. Sci..

[21]  Nils J. Nilsson,et al.  Learning Machines: Foundations of Trainable Pattern-Classifying Systems , 1965 .

[22]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[23]  Hxugo Kubiny Variable Selection in QSAR Studies. I. An Evolutionary Algorithm , 1994 .

[24]  Anders Krogh,et al.  Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[25]  Tom Heskes Balancing Between Bagging and Bumping , 1996, NIPS.

[26]  Padraig Cunningham,et al.  Confidence and prediction intervals for neural network ensembles , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[27]  Johann Gasteiger,et al.  Multivariate structure‐activity relationships between data from a battery of biological tests and an ensemble of structure descriptors: The PLS method , 1984 .

[28]  Anton J. Hopfinger,et al.  Application of Genetic Function Approximation to Quantitative Structure-Activity Relationships and Quantitative Structure-Property Relationships , 1994, J. Chem. Inf. Comput. Sci..

[29]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[30]  D. Agrafiotis,et al.  Variable selection for QSAR by artificial ant colony systems , 2002, SAR and QSAR in environmental research.

[31]  Peter C. Jurs,et al.  Automated Descriptor Selection for Quantitative Structure-Activity Relationships Using Generalized Simulated Annealing , 1995, J. Chem. Inf. Comput. Sci..

[32]  Eric Bauer,et al.  An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants , 1999, Machine Learning.

[33]  James H. Wikel,et al.  The use of neural networks for variable selection in QSAR , 1993 .

[34]  George M. Whitesides,et al.  Feed-Forward Neural Networks in Chemistry: Mathematical Systems for Classification and Pattern Recognition , 1994 .

[35]  D. Opitz,et al.  Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..

[36]  Frank R. Burden,et al.  Use of Automatic Relevance Determination in QSAR Studies Using Bayesian Neural Networks , 2000, J. Chem. Inf. Comput. Sci..

[37]  D. Maddalena,et al.  Prediction of receptor properties and binding affinity of ligands to benzodiazepine/GABAA receptors using artificial neural networks. , 1995, Journal of medicinal chemistry.

[38]  Y. L. Loukas,et al.  Adaptive neuro-fuzzy inference system: an instant and architecture-free predictor for improved QSAR studies. , 2001, Journal of medicinal chemistry.

[39]  F. Burden,et al.  Robust QSAR models using Bayesian regularized neural networks. , 1999, Journal of medicinal chemistry.

[40]  F. Girosi,et al.  Networks for approximation and learning , 1990, Proc. IEEE.

[41]  Johann Gasteiger,et al.  Neural Networks for Chemists: An Introduction , 1993 .

[42]  David W. Opitz,et al.  An Empirical Evaluation of Bagging and Boosting , 1997, AAAI/IAAI.

[43]  D. Agrafiotis,et al.  Feature selection for structure-activity correlation using binary particle swarms. , 2002, Journal of medicinal chemistry.

[44]  Jie Zhang,et al.  Developing robust non-linear models through bootstrap aggregated neural networks , 1999, Neurocomputing.

[45]  M Karplus,et al.  Evolutionary optimization in quantitative structure-activity relationship: an application of genetic neural networks. , 1996, Journal of medicinal chemistry.