Cross-Lingual Supervision improves Large Language Models Pre-training