Inferring the Structure of Action Movies
暂无分享,去创建一个
Cordelia Schmid | Danila Potapov | Matthijs Douze | Zaïd Harchaoui | Jérôme Revaud | C. Schmid | M. Douze | Z. Harchaoui | Jérôme Revaud | D. Potapov | Matthijs Douze
[1] Thomas S. Huang,et al. Exploring video structure beyond the shots , 1998, Proceedings. IEEE International Conference on Multimedia Computing and Systems (Cat. No.98TB100241).
[2] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[3] Rémi Ronfard,et al. A framework for aligning and indexing movies with their script , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).
[4] Ying Li,et al. Content-based movie analysis and indexing based on audiovisual cues , 2004, IEEE Transactions on Circuits and Systems for Video Technology.
[5] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .
[6] Blake Snyder. Save the Cat: The Last Book on Screenwriting You'll Ever Need , 2005 .
[7] Andrew Zisserman,et al. Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video , 2006, BMVC.
[8] Bertrand Chupeau,et al. A Video Fingerprint Based on Visual Digest and Local Fingerprints , 2006, 2006 International Conference on Image Processing.
[9] Ronald W. Schafer,et al. Introduction to Digital Speech Processing , 2007, Found. Trends Signal Process..
[10] Patrick Pérez,et al. Retrieving actions in movies , 2007, 2007 IEEE 11th International Conference on Computer Vision.
[11] Ben Taskar,et al. Movie/Script: Alignment and Parsing of Video and Text Transcription , 2008, ECCV.
[12] Luc Van Gool,et al. Exemplar-based Action Recognition in Video , 2009, BMVC.
[13] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.
[14] C. Schmid,et al. Actions in context , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[15] Thomas Mensink,et al. Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.
[16] Georges Quénot,et al. TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.
[17] Luc Van Gool,et al. A Hough transform-based voting framework for action recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[18] Thomas Serre,et al. HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.
[19] Fernando De la Torre,et al. Joint segmentation and classification of human actions in video , 2011, CVPR 2011.
[20] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[21] Fernando De la Torre,et al. Max-Margin Early Event Detectors , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[22] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[23] Cordelia Schmid,et al. Action and Event Recognition with Fisher Vectors on a Compact Feature Set , 2013, 2013 IEEE International Conference on Computer Vision.
[24] Cordelia Schmid,et al. Temporal Localization of Actions with Actoms. , 2013, IEEE transactions on pattern analysis and machine intelligence.
[25] Cordelia Schmid,et al. Finding Actors and Actions in Movies , 2013, 2013 IEEE International Conference on Computer Vision.
[26] Leonid Sigal,et al. Poselet Key-Framing: A Model for Human Activity Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[27] Cordelia Schmid,et al. The AXES submissions at TRECVID 2013 , 2013, TRECVID.
[28] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[29] Andrew Zisserman,et al. Fisher Vector Faces in the Wild , 2013, BMVC.
[30] C. Schmid,et al. Category-Specific Video Summarization , 2014, ECCV.
[31] Cordelia Schmid,et al. The LEAR submission at Thumos 2014 , 2014 .
[32] Mohammad Soleymani,et al. VSD, a public dataset for the detection of violent scenes in movies: design, annotation, analysis and evaluation , 2014, Multimedia Tools and Applications.
[33] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.
[34] Kristen Grauman,et al. Efficient Activity Detection in Untrimmed Video with Max-Subgraph Search , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.