Boosted regression trees, multivariate adaptive regression splines and their two-step combinations with multiple linear regression or partial least squares to predict blood-brain barrier passage: a case study.

The use of some unconventional non-linear modeling techniques, i.e. classification and regression trees and multivariate adaptive regression splines-based methods, was explored to model the blood-brain barrier (BBB) passage of drugs and drug-like molecules. The data set contains BBB passage values for 299 structural and pharmacological diverse drugs, originating from a structured knowledge-based database. Models were built using boosted regression trees (BRT) and multivariate adaptive regression splines (MARS), as well as their respective combinations with stepwise multiple linear regression (MLR) and partial least squares (PLS) regression in two-step approaches. The best models were obtained using combinations of MARS with either stepwise MLR or PLS. It could be concluded that the use of combinations of a linear with a non-linear modeling technique results in some improved properties compared to the individual linear and non-linear models and that, when the use of such a combination is appropriate, combinations using MARS as non-linear technique should be preferred over those with BRT, due to some serious drawbacks of the BRT approaches.

[1]  Y Vander Heyden,et al.  Evaluation of chromatographic descriptors for the prediction of gastro-intestinal absorption of drugs. , 2007, Journal of chromatography. A.

[2]  Harpreet S. Chadha,et al.  Hydrogen bonding. 33. Factors that influence the distribution of solutes between blood and brain. , 1994, Journal of pharmaceutical sciences.

[3]  D L Massart,et al.  Multivariate adaptive regression splines (MARS) in chromatographic quantitative structure-retention relationship studies. , 2004, Journal of chromatography. A.

[4]  D L Massart,et al.  Classification and regression tree analysis for molecular descriptor selection and retention prediction in chromatographic quantitative structure-retention relationship studies. , 2003, Journal of chromatography. A.

[5]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[6]  Antonio Chana,et al.  CODES/neural network model: A useful tool for in silico prediction of oral absorption and blood-brain barrier permeability of structurally diverse drugs , 2004 .

[7]  D L Massart,et al.  Classification of drugs in absorption classes using the classification and regression trees (CART) methodology. , 2005, Journal of pharmaceutical and biomedical analysis.

[8]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[9]  J. Friedman Multivariate adaptive regression splines , 1990 .

[10]  Yi-Zeng Liang,et al.  Monte Carlo cross validation , 2001 .

[11]  Menghui H. Zhang,et al.  Evaluation of boosted regression trees (BRTs) and two‐step BRT procedures to model and predict blood‐brain barrier passage , 2007 .

[12]  U Norinder,et al.  Theoretical calculation and prediction of brain-blood partitioning of organic solutes using MolSurf parametrization and PLS statistics. , 1998, Journal of pharmaceutical sciences.

[13]  J. Platts,et al.  Correlation and prediction of a large blood-brain distribution data set--an LFER study. , 2001, European journal of medicinal chemistry.

[14]  B Testa,et al.  Predicting blood-brain barrier permeation from three-dimensional molecular structure. , 2000, Journal of medicinal chemistry.

[15]  Douglas B. Kitchen,et al.  Computational models to predict blood–brain barrier permeation and CNS activity , 2003, J. Comput. Aided Mol. Des..

[16]  Eric R. Ziegel,et al.  Handbook of Chemometrics and Qualimetrics, Part B , 2000, Technometrics.

[17]  D. Coomans,et al.  Exploration of linear modelling techniques and their combination with multivariate adaptive regression splines to predict gastro-intestinal absorption of drugs. , 2007, Journal of pharmaceutical and biomedical analysis.

[18]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[19]  Harris Drucker,et al.  Improving Regressors using Boosting Techniques , 1997, ICML.

[20]  Yi-Zeng Liang,et al.  Two-step multivariate adaptive regression splines for modeling a quantitative relationship between gas chromatography retention indices and molecular descriptors. , 2003, Journal of chromatography. A.

[21]  Thomas Richardson,et al.  Boosting methodology for regression problems , 1999, AISTATS.

[22]  Bahram Hemmateenejad,et al.  Correlation ranking procedure for factor selection in PC-ANN modeling and application to ADMETox evaluation , 2005 .

[23]  R A Morrison,et al.  Current methodologies used for evaluation of intestinal permeability and absorption. , 2000, Journal of pharmacological and toxicological methods.

[24]  I. Hidalgo,et al.  Assessing the absorption of new pharmaceuticals. , 2001, Current topics in medicinal chemistry.

[25]  Yvan Vander Heyden,et al.  Classification Tree Models for the Prediction of Blood-Brain Barrier Passage of Drugs , 2006, J. Chem. Inf. Model..

[26]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[27]  Ramamurthi Narayanan,et al.  In silico ADME modelling: prediction models for blood-brain barrier permeation using a systematic variable selection method. , 2005, Bioorganic & medicinal chemistry.

[28]  D. E. Clark,et al.  Rapid calculation of polar molecular surface area and its application to the prediction of transport phenomena. 2. Prediction of blood-brain barrier penetration. , 1999, Journal of pharmaceutical sciences.

[29]  Ronald D. Snee,et al.  Validation of Regression Models: Methods and Examples , 1977 .

[30]  Michał J. Markuszewski,et al.  Brain/blood distribution described by a combination of partition coefficient and molecular mass , 1996 .

[31]  D L Massart,et al.  Prediction of gastro-intestinal absorption using multivariate adaptive regression splines. , 2005, Journal of pharmaceutical and biomedical analysis.

[32]  G Mannens,et al.  Strategies for absorption screening in drug discovery and development. , 2001, Current topics in medicinal chemistry.

[33]  M. Abraham,et al.  Air to brain, blood to brain and plasma to brain distribution of volatile organic compounds: linear free energy analyses. , 2006, European journal of medicinal chemistry.