Scene-Text Aware Image and Text Retrieval with Dual-Encoder