A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets