Adaptive Trade-Offs in Off-Policy Learning