Interbank Market Formation through Reinforcement Learning and Risk Aversion