Dynamic metasurface control using Deep Reinforcement Learning