STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence