论文信息 - Domain-Adaptive Discriminative One-Shot Learning of Gestures - 字舞流文

Domain-Adaptive Discriminative One-Shot Learning of Gestures

The objective of this paper is to recognize gestures in videos – both localizing the gesture and classifying it into one of multiple classes.

Andrew Zisserman | Tomas Pfister | James Charles | Andrew Zisserman | Tomas Pfister | James Charles

[1] Alexei A. Efros,et al. Ensemble of exemplar-SVMs for object detection and beyond , 2011, 2011 International Conference on Computer Vision.

[2] Andrew Blake,et al. "GrabCut" , 2004, ACM Trans. Graph..

[3] Danica Kragic,et al. The Path Kernel , 2013, ICPRAM.

[4] Wei Li,et al. One-shot learning gesture recognition from RGB-D data using bag of features , 2013, J. Mach. Learn. Res..

[5] Jean Ponce,et al. Automatic annotation of human actions in video , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[6] Fernando De la Torre,et al. Generalized time warping for multi-modal alignment of human motion , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7] Jitendra Malik,et al. Discriminative Decorrelation for Clustering and Classification , 2012, ECCV.

[8] Ali Farhadi,et al. Transfer Learning in Sign language , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Andrew Zisserman,et al. Learning sign language by watching TV (using weakly aligned subtitles) , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Cordelia Schmid,et al. Finding Actors and Actions in Movies , 2013, 2013 IEEE International Conference on Computer Vision.

[11] Marco Cuturi,et al. Fast Global Alignment Kernels , 2011, ICML.

[12] Marie-Pierre Jolly,et al. Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images , 2001, ICCV.

[13] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[14] Andrew Zisserman,et al. Domain Adaptation for Upper Body Pose Tracking in Signed TV Broadcasts , 2013, BMVC.

[15] Guang Li,et al. Sign Language Recognition and Translation with Kinect , 2013 .

[16] Isabelle Guyon,et al. ChaLearn gesture challenge: Design and first results , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[17] Cordelia Schmid,et al. A time series kernel for action recognition , 2011, BMVC.

[18] Isabelle Guyon,et al. Results and Analysis of the ChaLearn Gesture Challenge 2012 , 2012, WDIA.

[19] Richard Bowden,et al. Learning signs from subtitles: A weakly supervised approach to sign language recognition , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[20] Andrew Zisserman,et al. Large-scale Learning of Sign Language by Watching TV (Using Co-occurrences) , 2013, BMVC.

[21] Sudeep Sarkar,et al. Similarity Measure between Two Gestures Using Triplets , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[22] Matthieu Guillaumin,et al. Segmentation Propagation in ImageNet , 2012, ECCV.

[23] Hanqing Lu,et al. Fusing multi-modal features for gesture recognition , 2013, ICMI '13.

[24] Tomoko Matsui,et al. A Kernel for Time Series Based on Global Alignments , 2006, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[25] Charles Markham,et al. Weakly Supervised Training of a Sign Language Recognition System Using Multiple Instance Learning Density Matrices , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[26] Andrew Zisserman,et al. Automatic and Efficient Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts , 2012, BMVC.

[27] Martial Hebert,et al. Event Detection in Crowded Videos , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[28] Sudeep Sarkar,et al. Finding recurrent patterns from continuous sign language sentences for automated extraction of signs , 2012, J. Mach. Learn. Res..

[29] Shigeki Sagayama,et al. Dynamic Time-Alignment Kernel in Support Vector Machine , 2001, NIPS.

[30] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[31] Mubarak Shah,et al. Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32] Andrew Zisserman,et al. Automatic and Efficient Human Pose Estimation for Sign Language Videos , 2013, International Journal of Computer Vision.

[33] Giorgio Metta,et al. Keep it simple and sparse: real-time action recognition , 2013, J. Mach. Learn. Res..

[34] Sergio Escalera,et al. ChaLearn multi-modal gesture recognition 2013: grand challenge and workshop summary , 2013, ICMI '13.