Automatic collective motion tuning using actor-critic deep reinforcement learning