Managing chance-constrained hydropower with reinforcement learning and backoffs