Imitation Learning from Visual Data with Multiple Intentions