Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks