Machine-learning-guided library design cycle for directed evolution of enzymes: the effects of training data composition on sequence space exploration