Optimization of Steam Injection for Heavy Oil Reservoirs Using Reinforcement Learning