Reinforced Deterministic and Probabilistic Load Forecasting via $Q$ -Learning Dynamic Model Selection