Using the proximal policy optimisation algorithm for solving the stochastic capacitated lot sizing problem