Use of Classification Regression Tree in Predicting Oral Absorption in Humans

The purpose of this study is to explore the use of classification regression trees (CART) in predicting, in the dose-independent range, the fraction dose absorbed in humans. Since the results from clinical formulations in humans were used for training the model, a hypothetical state of drug molecules already dissolved in the intestinal fluid was adopted. Therefore, the molecular attributes affecting dissolution were not considered in the model. As a result, the model projects the highest achievable fraction dose absorbed, providing a reference point for manipulating the formulations or solid states to optimize oral clinical efficacy. A set of approximately 1260 structures and their human oral pharmacokinetic data, including bioavailability and/or absorption and/or radio-labeled studies, were used, with 899 compounds as the training set and 362 the test set. The numerical range of the fraction dose absorbed, 0 to 1, was divided into 6 classes with each class having a size of approximately 0.16. A set of 28 structural descriptors was used for modeling oral absorption without considering active transport. Then, a separate branch was created for modeling oral absorption involving active transport. The AAE of the training set was 0.12 and those of five test sets ranged from 0.17 to 0.2. In terms of classification, two test sets of unpublished, proprietary compounds showed 79% to 86% prediction when the predicted values fallen within +/- one class of real values were considered predicted. Overall, the computational errors from all the test sets of diverse structures were similar and reasonably acceptable. As compared to artificial membranes for ranking drug absorption potential, prediction by the CART model is considered fast and reasonably accurate for accelerating drug discovery. One can not only improve continuously the accuracy of CART computations by expanding the chemical space of the training set but also calculate the statistical errors associated with individual decision paths resulting from the training set to determine whether to accept individual computations of any test sets.

[1]  Veerabahu Shanmugasundaram,et al.  Estimation of Aqueous Solubility of Organic Compounds with QSPR Approach , 2004, Pharmaceutical Research.

[2]  G. W. Wheland,et al.  Advanced Organic Chemistry , 1951, Nature.

[3]  Peter C. Jurs,et al.  Prediction of Aqueous Solubility of Heteroatom‐Containing Organic Compounds from Molecular Structure. , 2001 .

[4]  G. Amidon,et al.  Gastrointestinal Transport of Peptide and Protein Drugs and Prodrugs , 1994 .

[5]  Gordon M. Crippen,et al.  Prediction of Physicochemical Parameters by Atomic Contributions , 1999, J. Chem. Inf. Comput. Sci..

[6]  Kin-Kai Hwang,et al.  A comparative study of artificial membrane permeability assay for high throughput profiling of drug absorption potential. , 2002, European journal of medicinal chemistry.

[7]  M. Hümpel,et al.  Pharmacokinetics of proterguride in rat and cynomolgus monkey. , 1988, Xenobiotica; the fate of foreign compounds in biological systems.

[8]  A. Doherty,et al.  Structure-activity relationships in a series of orally active gamma-hydroxy butenolide endothelin antagonists. , 1997, Journal of medicinal chemistry.

[9]  A. Molla,et al.  Discovery of ritonavir, a potent inhibitor of HIV protease with high oral bioavailability and clinical efficacy. , 1998, Journal of medicinal chemistry.

[10]  E. Roth,et al.  Predicting stroke inpatient rehabilitation outcome using a classification tree approach. , 1994, Archives of physical medicine and rehabilitation.

[11]  G. Amidon,et al.  Characterization of the Oral Absorption of β-Lactam Antibiotics. I. Cephalosporins: Determination of Intrinsic Membrane Absorption Parameters in the Rat Intestine In Situ , 1988, Pharmaceutical Research.

[12]  H. Kim,et al.  Dose-dependent pharmacokinetics of a new neuroprotective agent for ischemia-reperfusion damage, KR-31378, in rats. , 2000, Biopharmaceutics & drug disposition.

[13]  Ismael J. Hidalgo,et al.  The Role of an α‐Amino Group on H+‐dependent Transepithelial Transport of Cephalosporins in Caco‐2 Cells , 1999 .

[14]  J. Mair,et al.  A decision tree for the early diagnosis of acute myocardial infarction in nontraumatic chest pain patients at hospital admission. , 1995, Chest.

[15]  R. Dixon,et al.  Discovery of TBC11251, a potent, long acting, orally active endothelin receptor-A selective antagonist. , 1997, Journal of Medicinal Chemistry.

[16]  R. Conradi,et al.  The relationship between peptide structure and transport across epithelial cell monolayers , 1992 .

[17]  P. Selzer,et al.  Fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties. , 2000, Journal of medicinal chemistry.

[18]  John Bradshaw,et al.  Identification of Biological Activity Profiles Using Substructural Analysis and Genetic Algorithms , 1998, J. Chem. Inf. Comput. Sci..

[19]  Shaomeng Wang,et al.  Estimation of aqueous solubility of organic molecules by the group contribution approach. Application to the study of biodegradation , 1992, J. Chem. Inf. Comput. Sci..

[20]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[21]  M. Merino Sanjuán,et al.  Intestinal transport of cefuroxime axetil in rats: absorption and hydrolysis processes. , 2002, International journal of pharmaceutics.

[22]  P. Gros,et al.  Lipophilic cations: a group of model substrates for the multidrug-resistance transporter. , 1992, Biochemistry.

[23]  H. van de Waterbeemd,et al.  Property-based design: optimization of drug absorption and pharmacokinetics. , 2001, Journal of medicinal chemistry.

[24]  G. Amidon,et al.  Passive and Carrier-Mediated Intestinal Absorption Components of Two Angiotensin Converting Enzyme (ACE) Inhibitor Prodrugs in Rats: Enalapril and Fosinopril , 1989, Pharmaceutical Research.

[25]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[26]  Luhua Lai,et al.  A New Atom-Additive Method for Calculating Partition Coefficients , 1997, J. Chem. Inf. Comput. Sci..