Synthesis of reinforcement learning, neural networks and PI control applied to a simulated heating coil