Understanding Video Transformers for Segmentation: A Survey of Application and Interpretability