论文信息 - VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders - 字舞流文

VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders

Siteng Huang | Yachen Kang | Donglin Wang | Xuyang Liu | Honggang Chen