Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping