Safe resource management of non-cooperative microgrids based on deep reinforcement learning