Vision and Language: Bridging Vision and Language with Deep Learning