Semantic Video Retrieval using Deep Learning Techniques
暂无分享,去创建一个
[1] Wolfram Burgard,et al. Multimodal deep learning for robust RGB-D object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[2] Naokazu Yokoya,et al. Video Summarization Using Deep Semantic Features , 2016, ACCV.
[3] Heng Tao Shen,et al. Video Captioning With Attention-Based LSTM and Semantic Consistency , 2017, IEEE Transactions on Multimedia.
[4] Benoit Huet,et al. An ontology-based evidential framework for video indexing using high-level multimodal fusion , 2011, Multimedia Tools and Applications.
[5] Xinlei Chen,et al. Microsoft COCO Captions: Data Collection and Evaluation Server , 2015, ArXiv.
[6] Jiwen Lu,et al. Deep Video Hashing , 2017, IEEE Transactions on Multimedia.
[7] Sergey Ioffe,et al. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.
[8] Heiko Schuldt,et al. IMOTION - A Content-Based Video Retrieval Engine , 2015, MMM.
[9] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Usman Ghani Khan,et al. Video Retrieval System Using Parallel Multi-Class Recurrent Neural Network Based on Video Description , 2018, 2018 14th International Conference on Emerging Technologies (ICET).
[11] Dumitru Erhan,et al. Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[12] Beng Chin Ooi,et al. Effective deep learning-based multi-modal retrieval , 2015, The VLDB Journal.
[13] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[14] Elena Stringa,et al. Image Retrieval by Example: Techniques and Demonstrations , 2001 .
[15] Heng Tao Shen,et al. Attention-based LSTM with Semantic Consistency for Videos Captioning , 2016, ACM Multimedia.
[16] Alex Pentland,et al. Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.
[17] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[18] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[19] Ji Wan,et al. Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.
[20] Shih-Fu Chang,et al. VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.
[21] Mubarak Shah,et al. Action and Object Detection for TRECVID , 2018, TRECVID.
[22] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[23] Subhashini Venugopalan,et al. Translating Videos to Natural Language Using Deep Recurrent Neural Networks , 2014, NAACL.
[24] TefasAnastasios,et al. Deep convolutional learning for Content Based Image Retrieval , 2018 .
[25] Xiaolin Hu,et al. Recurrent convolutional neural network for object recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Bernd Freisleben,et al. Content-based video retrieval in historical collections of the German Broadcasting Archive , 2016, International Journal on Digital Libraries.
[27] Qiang Wu,et al. A 3D-CNN based video hashing method , 2018, International Conference on Digital Image Processing.
[28] Anastasios Tefas,et al. Deep convolutional learning for Content Based Image Retrieval , 2018, Neurocomputing.
[29] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.