In silico prediction of volume of distribution in human using linear and nonlinear models on a 669 compound data set.

The prediction of human pharmacokinetics early in the drug discovery cycle has become of paramount importance, aiding candidate selection and benefit-risk assessment. We present herein computational models to predict human volume of distribution at steady state (VD(ss)) entirely from in silico structural descriptors. Using both linear and nonlinear statistical techniques, partial least-squares (PLS), and random forest (RF) modeling, a data set of human VD(ss) values for 669 drug compounds recently published ( Drug Metab. Disp. 2008 , 36 , 1385 - 1405 ) was explored. Descriptors covering 2D and 3D molecular topology, electronics, and physical properties were calculated using MOE and Volsurf+. Model evaluation was accomplished using a leave-class-out approach on nine therapeutic or structural classes. The models were assessed using an external test set of 29 additional compounds. Our analysis generated models, both via a single method or consensus which were able to predict human VD(ss) within geometric mean 2-fold error, a predictive accuracy considered good even for more resource-intensive approaches such as those requiring data generated from studies in multiple animal species.