Reinforcement learning based metric filtering for evolutionary distance metric learning