A framework based on multivariate distribution-based virtual sample generation and DNN for predicting water quality with small data