Rich Visual and Language Representation with Complementary Semantics for Video Captioning
暂无分享,去创建一个
It is interesting and challenging to translate a video to natural description sentences based on the video content. In this work, an advanced framework is built to generate sentences with coherence...