How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?