AutoCAT: Reinforcement Learning for Automated Exploration of Cache-Timing Attacks