Injecting Image Details into CLIP's Feature Space