A Novel Model-Based Reinforcement Learning for Online Anomaly Detection in Smart Power Grid