Bridging images and natural language with deep learning