论文信息 - AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR - 字舞流文

AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR

C. Schmid | Arsha Nagrani | P. H. Seo