论文信息 - Video-LLaVA: Learning United Visual Representation by Alignment Before Projection - 字舞流文

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Peng Jin | Munan Ning | Bin Lin | Yang Ye | Bin Zhu | Li Yuan