论文信息 - Character Detection in Animation Movies Using Multi-Style Adaptation and Visual Attention - 字舞流文

Character Detection in Animation Movies Using Multi-Style Adaptation and Visual Attention

In-Kwon Lee | Dong-Hyuck Im | Ha Yeon Kim | Eun Cheol Lee | Yong-Seok Seo

[1] Florian Heimerl,et al. Visual Movie Analytics , 2016, IEEE Transactions on Multimedia.

[2] Abhishek Das,et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[3] Nuno Vasconcelos,et al. Towards Universal Object Detection by Domain Attention , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Jean-Christophe Burie,et al. Multi-task Model for Comic Book Image Analysis , 2019, MMM.

[6] Jürgen Beyerer,et al. Fast Deep Vehicle Detection in Aerial Images , 2017, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[7] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[8] Changsheng Xu,et al. Robust Face-Name Graph Matching for Movie Character Identification , 2012, IEEE Transactions on Multimedia.

[9] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[10] Yongdong Zhang,et al. Enhancing Video Event Recognition Using Automatically Constructed Semantic-Visual Knowledge Base , 2015, IEEE Transactions on Multimedia.

[11] Michèle Sebag,et al. Multi-Domain Adversarial Learning , 2019, ICLR.

[12] Nuno Vasconcelos,et al. Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[14] Arash Vahdat,et al. A Robust Learning Approach to Domain Adaptive Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[15] Xiaogang Wang,et al. Residual Attention Network for Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Andrea Vedaldi,et al. Universal representations: The missing link between faces, text, planktons, and cat breeds , 2017, ArXiv.

[17] Christopher D. Manning,et al. Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[18] Yi Yang,et al. Attention to Scale: Scale-Aware Semantic Image Segmentation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Patrick Lambert,et al. Animation movies trailer computation , 2006, MM '06.

[20] Paul L. Rosin,et al. Visual Sentiment Prediction Based on Automatic Discovery of Affective Regions , 2018, IEEE Transactions on Multimedia.

[21] Lifeng Sun,et al. A Matrix-Based Approach to Unsupervised Human Action Categorization , 2012, IEEE Transactions on Multimedia.

[22] Yi Li,et al. R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[23] Tanaya Guha,et al. Unsupervised Discovery of Character Dictionaries in Animation Movies , 2018, IEEE Transactions on Multimedia.

[24] Enhua Wu,et al. Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[26] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.

[27] Ronald A. Rensink. The Dynamic Representation of Scenes , 2000 .

[28] Wei-Ta Chu,et al. Manga FaceNet: Face Detection in Manga based on Deep Neural Network , 2017, ICMR.

[29] Ross B. Girshick,et al. Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] Xiaogang Wang,et al. Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Tobias Egner,et al. Feature-Based Attention and Feature-Based Expectation , 2016, Trends in Cognitive Sciences.

[33] Kiyoharu Aizawa,et al. Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34] Shih-Fu Chang,et al. Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification , 2017, IEEE Transactions on Multimedia.

[35] Jorma Laaksonen,et al. Content-Based Prediction of Movie Style, Aesthetics, and Affect: Data Set and Baseline Experiments , 2014, IEEE Transactions on Multimedia.

[36] Shifeng Zhang,et al. Single-Shot Refinement Neural Network for Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[38] Hongan Wang,et al. An Interactive SpiralTape Video Summarization , 2016, IEEE Transactions on Multimedia.

[39] Dhruv Batra,et al. Human Attention in Visual Question Answering: Do Humans and Deep Networks look at the same regions? , 2016, EMNLP.

[40] Kin K. Leung,et al. Cloud-Based Actor Identification With Batch-Orthogonal Local-Sensitive Hashing and Sparse Representation , 2016, IEEE Transactions on Multimedia.

[41] Luc Van Gool,et al. Domain Adaptive Faster R-CNN for Object Detection in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42] Zhi Tang,et al. A Faster R-CNN Based Method for Comic Characters Face Detection , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[43] Liang Lin,et al. Is Faster R-CNN Doing Well for Pedestrian Detection? , 2016, ECCV.

[44] Bolei Zhou,et al. Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Jinhui Tang,et al. CAD: Scale Invariant Framework for Real-Time Object Detection , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[46] Yunde Jia,et al. Content-Attention Representation by Factorized Action-Scene Network for Action Recognition , 2018, IEEE Transactions on Multimedia.

[47] R. Desimone,et al. Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[48] Bohyung Han,et al. Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49] Petros Maragos,et al. Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention , 2013, IEEE Transactions on Multimedia.

[50] Jiebo Luo,et al. Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection , 2012, IEEE Transactions on Multimedia.

[51] Mubarak Shah,et al. Face Recognition in Movie Trailers via Mean Sequence Sparse Representation-Based Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[52] Andrea Vedaldi,et al. Learning multiple visual domains with residual adapters , 2017, NIPS.

[53] Yi Li,et al. Deformable Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[54] Delbert Dueck,et al. Clustering by Passing Messages Between Data Points , 2007, Science.

[55] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[56] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57] Maneesh Singh,et al. Progressive Domain Adaptation for Object Detection , 2019, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[58] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[59] Antonio Torralba,et al. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[60] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[61] Huaizu Jiang,et al. Face Detection with the Faster R-CNN , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[62] Li Chen,et al. Visualization-Based Active Learning for Video Annotation , 2016, IEEE Transactions on Multimedia.

[63] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[64] Thomas Brox,et al. Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[65] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.