文
论文分享
演练场
杂货铺
论文推荐
字
编辑器下载
登录
注册
Linagzhe Yuan
发表
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
pdf
Shih-Fu Chang, Wei-Hong Chuang, Boqing Gong, 2021, NeurIPS.