Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data