Deep Visual-Semantic Alignments for Generating Image Descriptions
暂无分享,去创建一个
We present a model that generates natural language descriptions of images and their regions. Our approach leverages datasets of images and their sentence descriptions to learn about the inter-modal...