Inflate and Shrink:Enriching and Reducing Interactions for Fast Text-Image Retrieval