论文信息 - Cross-modal supervision for learning active speaker detection in video - 字舞流文

Cross-modal supervision for learning active speaker detection in video

Cees Snoek | Tinne Tuytelaars | Amir Ghodrati | Zhenyang Li | Roeland De Geest | Stratis Gavves