Channel and spatial attention mechanism for fashion image captioning