Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training