The INRIA-LIM-VocR and AXES submissions to TrecVid 2014 Multimedia Event Detection
暂无分享,去创建一个
Cordelia Schmid | Danila Potapov | Matthijs Douze | Jean-Luc Gauvain | Lori Lamel | Dan Oneata | Karteek Alahari | Zaid Harchaoui | Nicolas Chesneau | Jakob Verbeek | Mattis Paulin | Clement Leray | Christoph Andreas Schmidt | C. Schmid | J. Gauvain | Alahari Karteek | L. Lamel | Jakob Verbeek | Mattis Paulin | D. Potapov | Clement Leray | Matthijs Douze | C. Schmidt | Dan Oneaţă | N. Chesneau | Zaïd Harchaoui
[1] Frédéric Jurie,et al. Modeling spatial layout with fisher vectors for image categorization , 2011, 2011 International Conference on Computer Vision.
[2] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[3] Thomas Mensink,et al. Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.
[4] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[5] Cordelia Schmid,et al. Label-Embedding for Attribute-Based Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[6] Lori Lamel. Multilingual Speech Processing Activities in Quaero: Application to Multimedia Search in Unstructured Data , 2012, Baltic HLT.
[7] Jiri Matas,et al. Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..
[8] Joakim Andén,et al. Multiscale Scattering for Audio Classification , 2011, ISMIR.
[9] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[10] Georges Quénot,et al. TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.
[11] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..
[12] Pavel Matejka,et al. Towards Lower Error Rates in Phoneme Recognition , 2004, TSD.
[13] Thomas Serre,et al. HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.
[14] Jean-Luc Gauvain,et al. Speech Processing for Audio Indexing , 2008, GoTAL.
[15] Jean-Luc Gauvain,et al. On the Use of MLP Features for Broadcast News Transcription , 2008, TSD.
[16] Cordelia Schmid,et al. Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.
[17] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.
[18] Andreas Stolcke,et al. Using MLP features in SRI's conversational speech recognition system , 2005, INTERSPEECH.
[19] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[20] Jean-Luc Gauvain,et al. Partitioning and transcription of broadcast news data , 1998, ICSLP.
[21] Gabriela Csurka,et al. Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval , 2008, CLEF.
[22] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .
[23] Cordelia Schmid,et al. The AXES submissions at TRECVID 2013 , 2013, TRECVID.
[24] Cordelia Schmid,et al. AXES at TRECVID 2012: KIS, INS, and MED , 2012, TRECVID.
[25] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[26] Cordelia Schmid,et al. Action and Event Recognition with Fisher Vectors on a Compact Feature Set , 2013, 2013 IEEE International Conference on Computer Vision.
[27] Jean-Luc Gauvain,et al. Rapid development of a Latvian speech-to-text system , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.