Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling