A Deep Reinforcement Learning Approach to Rare Event Estimation