Temporal Context Aggregation for Video Retrieval with Contrastive Learning