SeeHear: Signer Diarisation and a New Dataset
暂无分享,去创建一个
Gül Varol | Andrew Zisserman | Bencie Woll | Triantafyllos Afouras | Samuel Albanie | Ben Saunders | Richard Bowden | Liliane Momeni | Necati Cihan Camgöz | Andrew Brown | Chuhan Zhang | Ernesto Coto | Abhishek Dutta | Neil Fox | Andrew Zisserman | R. Bowden | Triantafyllos Afouras | Samuel Albanie | B. Woll | Gül Varol | Liliane Momeni | Neil Fox | Abhishek Dutta | Chuhan Zhang | Ben Saunders | Ernesto Coto | Andrew Brown
[1] Hermann Ney,et al. Neural Sign Language Translation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[2] Yang Liu,et al. Synthetically Supervised Feature Learning for Scene Text Recognition , 2018, ECCV.
[3] Roee Aharoni,et al. Real-Time Sign Language Detection using Human Pose Estimation , 2020, ECCV Workshops.
[4] Andrew Zisserman,et al. Seeing wake words: Audio-visual Keyword Spotting , 2020, BMVC.
[5] J. Coates,et al. Turn‐taking patterns in deaf conversation , 2001 .
[6] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[7] Shuo Yang,et al. WIDER FACE: A Face Detection Benchmark , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[9] Andrew Zisserman,et al. Watch, read and lookup: learning to spot signs from multiple supervisors , 2020, ACCV.
[10] Guanghan Ning,et al. LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[11] Meredith Ringel Morris,et al. Sign Language Recognition, Generation, and Translation: An Interdisciplinary Perspective , 2019, ASSETS.
[12] Yuxiao Hu,et al. MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition , 2016, ECCV.
[13] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.
[14] Xin Yu,et al. Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).
[15] Themos Stafylakis,et al. Zero-shot keyword spotting for visual speech recognition in-the-wild , 2018, ECCV.
[16] Hermann Ney,et al. Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers , 2015, Comput. Vis. Image Underst..
[17] Peter Wittenburg,et al. Automatic Signer Diarization - The Mover Is the Signer Approach , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.
[18] Enhua Wu,et al. Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[19] N. Schiavetti,et al. Temporal characteristics of speech in simultaneous communication. , 1995, Journal of speech and hearing research.
[20] Oscar Koller,et al. Quantitative Survey of the State of the Art in Sign Language Recognition , 2020, ArXiv.
[21] Stefanos Zafeiriou,et al. RetinaFace: Single-stage Dense Face Localisation in the Wild , 2019, ArXiv.
[22] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[23] R. Wilbur,et al. Modality interactions of speech and signing in simultaneous communication. , 1998, Journal of speech, language, and hearing research : JSLHR.
[24] Myle Ott,et al. Facebook FAIR’s WMT19 News Translation Task Submission , 2019, WMT.