Nonprehensile Planar Manipulation through Reinforcement Learning with Multimodal Categorical Exploration