Unified Speech-Text Pre-training for Speech Translation and Recognition