Evaluating visual "common sense" using fine-grained classification and captioning tasks
暂无分享,去创建一个
Ingo Bax | Roland Memisevic | Waseem Gharbieh | Raghav Goyal | Farzaneh Mahdisoltani | Guillaume Berger | R. Memisevic | I. Bax | W. Gharbieh | F. Mahdisoltani | Raghav Goyal | Guillaume Berger
[1] Abhinav Gupta,et al. What Actions are Needed for Understanding Human Actions in Videos? , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[2] Fabio Viola,et al. The Kinetics Human Action Video Dataset , 2017, ArXiv.
[3] Susanne Westphal,et al. The “Something Something” Video Database for Learning and Evaluating Visual Common Sense , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[4] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).