DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization