RLTD: A reinforcement learning-based truth data discovery scheme for decision support systems under sustainable environments