Pre-training strategies and datasets for facial representation learning