Tutorial on Bridging Video and Language with Deep Learning