Improved Naïve Bayesian Modeling of Numerical Data for Absorption, Distribution, Metabolism and Excretion (ADME) Property Prediction

We have implemented a naïve Bayesian classifier which models continuous numerical data using a Gaussian distribution. Several cases of interest in the area of absorption, distribution, metabolism, and excretion prediction are presented which demonstrate that this approach is superior to the implementation of naïve Bayesian classifiers in which continuous chemical descriptors are modeled as binary data. We demonstrate that this enhanced performance, upon comparison with other implementations, is independent of the descriptor sets chosen. We also compare the performance of three implementations of naïve Bayesian classifiers with other previously described models.