Multi-scale relation reasoning for multi-modal Visual Question Answering

[1]  Shaohua Wan,et al.  Exploring Deep Learning for View-Based 3D Model Retrieval , 2020, ACM Trans. Multim. Comput. Commun. Appl..

[2]  Feng Zhu,et al.  Multi-source information fusion and deep-learning-based characteristics measurement for exploring the effects of peer engagement on stock price synchronicity , 2021, Inf. Fusion.

[3]  Mohammed Atiquzzaman,et al.  Automated Colorization of a Grayscale Image With Seed Points Propagation , 2020, IEEE Transactions on Multimedia.

[4]  Yu Zhao,et al.  Knowledge-Aided Convolutional Neural Network for Small Organ Segmentation , 2019, IEEE Journal of Biomedical and Health Informatics.

[5]  Md Zakirul Alam Bhuiyan,et al.  Trust-Aware Service Offloading for Video Surveillance in Edge Computing Enabled Internet of Vehicles , 2021, IEEE Transactions on Intelligent Transportation Systems.

[6]  Yuling Xi,et al.  Stimulus-driven and concept-driven analysis for image caption generation , 2020, Neurocomputing.

[7]  Jun Yu,et al.  Click Prediction for Web Image Reranking Using Multimodal Sparse Coding , 2014, IEEE Transactions on Image Processing.

[8]  Yuan Yan Tang,et al.  High-Order Distance-Based Multiview Stochastic Learning in Image Classification , 2014, IEEE Transactions on Cybernetics.

[9]  Bo Wang,et al.  Movie Question Answering via Textual Memory and Plot Graph , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Huaming Wu,et al.  Edge Server Quantification and Placement for Offloading Social Media Services in Industrial Cognitive IoV , 2021, IEEE Transactions on Industrial Informatics.

[11]  Yuling Xi,et al.  Visual question answering model based on visual relationship detection , 2020, Signal Process. Image Commun..

[12]  Fei Gao,et al.  Deep Multimodal Distance Metric Learning Using Click Constraints for Image Ranking , 2017, IEEE Transactions on Cybernetics.