Video Grounding and Its Generalization