Learning-based attacks in Cyber-Physical Systems: Exploration, Detection, and Control Cost trade-offs