Adaptable Claim Rewriting with Offline Reinforcement Learning for Effective Misinformation Discovery