Robust Facial Expression Recognition with Convolutional Visual Transformers