Hybrid System of Reinforcement Learning and Flocking Control in Multi-robot Domain