论文信息 - A Theoretical Understanding of shallow Vision Transformers: Learning, Generalization, and Sample Complexity - 字舞流文

A Theoretical Understanding of shallow Vision Transformers: Learning, Generalization, and Sample Complexity

M. Wang | Sijia Liu | Pin-Yu Chen | Hongkang Li