Optimisation and Control of Fed-Batch Production using Q-Learning