Pupil dilation and response slowing distinguish deliberate explorative choices in the probabilistic learning task