Minimax Value Interval for Off-Policy Evaluation and Policy Optimization