Initial investigation of UAV swarm behaviors in a search-and-rescue scenario using reinforcement learning