Bidirectional Reinforcement Learning for Asset Allocation