LocVTP: Video-Text Pre-training for Temporal Localization