Deep hierarchical reinforcement learning to manage the trade-off between sustainability and profitability in common pool resources systems