Reinforcement Learning For Field Development Policy Optimization