Fitted Policy Search: Direct Policy Search using a Batch Reinforcement Learning Approach