Estimating the domain of applicability for machine learning QSAR models: a study on aqueous solubility of drug discovery molecules