Consequences of sample size, variable selection, and model validation and optimisation, for predicting classification ability from analytical data