Cost-Efficient Reinforcement Learning for Optimal Trade Execution on Dynamic Market Environment