Stable and Efficient Reinforcement Learning Method for Avoidance Driving of Unmanned Vehicles