Reinforcement Learning-Based UAVs Resource Allocation for Integrated Sensing and Communication (ISAC) System