Disturbance Observable Reinforcement Learning that Compensates for Changes in Environment