Energy-efficient VM scheduling based on deep reinforcement learning