A Simulation Environment and Reinforcement Learning Method for Waste Reduction