Simulation-based optimization of Markov decision processes