Artificial intelligence in the creative industries: a review

This paper reviews the current state of the art in Artificial Intelligence (AI) technologies and applications in the context of the creative industries. A brief background of AI, and specifically Machine Learning (ML) algorithms, is provided including Convolutional Neural Network (CNNs), Generative Adversarial Networks (GANs), Recurrent Neural Networks (RNNs) and Deep Reinforcement Learning (DRL). We categorise creative applications into five groups related to how AI technologies are used: i) content creation, ii) information analysis, iii) content enhancement and post production workflows, iv) information extraction and enhancement, and v) data compression. We critically examine the successes and limitations of this rapidly advancing technology in each of these areas. We further differentiate between the use of AI as a creative tool and its potential as a creator in its own right. We foresee that, in the near future, machine learning-based AI will be adopted widely as a tool or collaborative assistant for creativity. In contrast, we observe that the successes of machine learning in domains with fewer constraints, where AI is the `creator', remain modest. The potential of AI (or its developers) to win awards for its original creations in competition with human creatives is also limited, based on contemporary technologies. We therefore conclude that, in the context of creative industries, maximum benefit from AI will be derived where its focus is human centric -- where it is designed to augment, rather than replace, human creativity.

[1]  Jennifer Golbeck,et al.  Predicting personality with social media , 2011, CHI Extended Abstracts.

[2]  John Haugeland,et al.  Artificial intelligence - the very idea , 1987 .

[3]  Li Chen,et al.  An End-to-End Learning Framework for Video Compression , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  R. Venkatesh Babu,et al.  DeepFuse: A Deep Unsupervised Approach for Exposure Fusion with Extreme Exposure Image Pairs , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[5]  J. Welser,et al.  Future Computing Hardware for AI , 2018, 2018 IEEE International Electron Devices Meeting (IEDM).

[6]  Marcus Du Sautoy The Creativity Code , 2019 .

[7]  Heng Tao Shen,et al.  Dual Conditional GANs for Face Aging and Rejuvenation , 2018, IJCAI.

[8]  Shai Avidan,et al.  Non-local Image Dehazing , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Fan Yang,et al.  ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation , 2020, ArXiv.

[10]  Soumik Sarkar,et al.  LLNet: A deep autoencoder approach to natural low-light image enhancement , 2015, Pattern Recognit..

[11]  Ke Yan,et al.  Data augmentation using generative adversarial networks (CycleGAN) to improve generalizability in CT segmentation tasks , 2019, Scientific Reports.

[12]  Radu Timofte,et al.  NTIRE 2019 Challenge on Image Colorization: Report , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[13]  Ji Wan,et al.  Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[14]  Yu-Chiang Frank Wang,et al.  Exploring Visual and Motion Saliency for Automatic Video Object Extraction , 2013, IEEE Transactions on Image Processing.

[15]  Tom Schaul,et al.  Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.

[16]  Xianming Liu,et al.  Learning Temporal Dynamics for Video Super-Resolution: A Deep Learning Approach , 2018, IEEE Transactions on Image Processing.

[17]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[18]  Shiguang Shan,et al.  AttGAN: Facial Attribute Editing by Only Changing What You Want , 2017, IEEE Transactions on Image Processing.

[19]  Wei Wu,et al.  High Performance Visual Tracking with Siamese Region Proposal Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20]  Kristen Grauman,et al.  SoundSpaces: Audio-Visual Navigation in 3D Environments , 2020, ECCV.

[21]  Vasant Honavar,et al.  Symbolic Artificial Intelligence and Numeric Artificial Neural Networks: Towards A Resolution of the Dichotomy , 1995 .

[22]  Chen Li,et al.  State-of-the-Art in 360° Video/Image Processing: Perception, Assessment and Compression , 2020, IEEE Journal of Selected Topics in Signal Processing.

[23]  Tanya X. Short,et al.  Procedural Generation in Game Design , 2017 .

[24]  Anbang Xu,et al.  A New Chatbot for Customer Service on Social Media , 2017, CHI.

[25]  Klamer Schutte,et al.  Deep learning for software-based turbulence mitigation in long-range imaging , 2019, Security + Defence.

[26]  Andrea Vedaldi,et al.  Deep Image Prior , 2017, International Journal of Computer Vision.

[27]  Haibin Ling,et al.  Salient Object Detection in the Deep Learning Era: An In-Depth Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Hua Wang,et al.  Deformable Non-Local Network for Video Super-Resolution , 2019, IEEE Access.

[29]  Didier Stricker,et al.  IsMo-GAN: Adversarial Learning for Monocular Non-Rigid 3D Reconstruction , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[30]  David Bull,et al.  Atmospheric turbulence removal using convolutional neural network , 2019, ArXiv.

[31]  Leon A. Gatys,et al.  A Neural Algorithm of Artistic Style , 2015, ArXiv.

[32]  Eirikur Agustsson,et al.  High-Fidelity Generative Image Compression , 2020, NeurIPS.

[33]  Jiajun Wu,et al.  MarrNet: 3D Shape Reconstruction via 2.5D Sketches , 2017, NIPS.

[34]  Yuko Yamanouchi,et al.  AI-Driven Smart Production , 2020 .

[35]  Charles Malleson,et al.  3D Reconstruction from RGB-D Data , 2019, RGB-D Image Analysis and Processing.

[36]  Yun Sheng,et al.  Stylization-Based Architecture for Fast Deep Exemplar Colorization , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[38]  Huimin Lu,et al.  Underwater image de-scattering and classification by deep neural network , 2016, Comput. Electr. Eng..

[39]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[40]  Tae Hyun Kim,et al.  Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Zhe Gan,et al.  Variational Autoencoder for Deep Learning of Images, Labels and Captions , 2016, NIPS.

[42]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[43]  David R. Bull,et al.  Atmospheric Turbulence Mitigation for Sequences with Moving Objects Using Recursive Image Fusion , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[44]  Idoia Ochoa,et al.  DeepZip: Lossless Data Compression Using Recurrent Neural Networks , 2018, 2019 Data Compression Conference (DCC).

[45]  Pablo Garrido,et al.  High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Gaëtan Hadjeres,et al.  Deep Learning Techniques for Music Generation - A Survey , 2017, ArXiv.

[47]  Eliezer Yudkowsky,et al.  The Ethics of Artificial Intelligence , 2014, Artificial Intelligence Safety and Security.

[48]  Claudio Gennaro,et al.  Searching and annotating 100M Images with YFCC100M-HNfc6 and MI-File , 2017, CBMI.

[49]  Flavius Frasincar,et al.  A Hybrid Approach for Aspect-Based Sentiment Analysis Using Deep Contextual Word Embeddings and Hierarchical Attention , 2020, ICWE.

[50]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[51]  Min Yang,et al.  Towards Scalable and Reliable Capsule Networks for Challenging NLP Applications , 2019, ACL.

[52]  Kenneth O. Stanley,et al.  A Hypercube-Based Encoding for Evolving Large-Scale Neural Networks , 2009, Artificial Life.

[53]  Yingtao Tian,et al.  Towards the Automatic Anime Characters Creation with Generative Adversarial Networks , 2017, ArXiv.

[54]  Julian Togelius,et al.  Deep Learning for Video Game Playing , 2017, IEEE Transactions on Games.

[55]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Pierre Baldi,et al.  Learning Activation Functions to Improve Deep Neural Networks , 2014, ICLR.

[57]  Aljoscha Smolic,et al.  Visual Attention in Omnidirectional Video for Virtual Reality Applications , 2018, 2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX).

[58]  Hui Wang,et al.  Underwater Image Restoration Based on Convolutional Neural Network , 2018, ACML.

[59]  Yu Guo,et al.  Personalized Text Summarization Based on Gaze Patterns , 2020, 2020 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[60]  Kaiming He,et al.  PointRend: Image Segmentation As Rendering , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[61]  Nicole S. Cohen From Pink Slips to Pink Slime: Transforming Media Labor in a Digital Age , 2015 .

[62]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[63]  Angel Domingo Sappa,et al.  Infrared Image Colorization Based on a Triplet DCGAN Architecture , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[64]  Weiguo Fan,et al.  Learning to advertise , 2006, SIGIR.

[65]  Dan Feng,et al.  Benchmarking Single-Image Dehazing and Beyond , 2017, IEEE Transactions on Image Processing.

[66]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[67]  Sebastian Nowozin,et al.  Occupancy Networks: Learning 3D Reconstruction in Function Space , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[68]  C.-C. Jay Kuo,et al.  Advanced Film Grain Noise Extraction and Synthesis for High-Definition Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[69]  Xiangyu Zhang,et al.  Joint COCO and Mapillary Workshop at ICCV 2019: COCO Instance Segmentation Challenge Track , 2020, ArXiv.

[70]  Konstantinos G. Derpanis,et al.  Two-Stream Convolutional Networks for Dynamic Texture Synthesis , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[71]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[72]  Shuai Yi,et al.  FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction , 2019, NeurIPS.

[73]  Jonathan T. Barron,et al.  Unprocessing Images for Learned Raw Denoising , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[74]  Donghoon Lee,et al.  Learning Instance-Aware Object Detection Using Determinantal Point Processes , 2018, Comput. Vis. Image Underst..

[75]  Nantheera Anantrasirichai,et al.  Image Fusion via Sparse Regularization with Non-Convex Penalties , 2020, Pattern Recognit. Lett..

[76]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[77]  ByoungChul Ko,et al.  A Brief Review of Facial Emotion Recognition Based on Visual Information , 2018, Sensors.

[78]  Shang-Hong Lai,et al.  Correction to: AugGAN: Cross Domain Adaptation with GAN-Based Data Augmentation , 2018, ECCV 2018.

[79]  Dacheng Tao,et al.  DehazeNet: An End-to-End System for Single Image Haze Removal. , 2016, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[80]  Sebastian Starke,et al.  Neural state machine for character-scene interactions , 2019, ACM Trans. Graph..

[81]  Bin Li,et al.  Fully Connected Network-Based Intra Prediction for Image Coding , 2018, IEEE Transactions on Image Processing.

[82]  Nuno Vasconcelos,et al.  Self-Supervised Generation of Spatial Audio for 360 Video , 2018, NIPS 2018.

[83]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[84]  Ruigang Yang,et al.  GA-Net: Guided Aggregation Net for End-To-End Stereo Matching , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[85]  Dimitris N. Metaxas,et al.  StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[86]  Alexei A. Efros,et al.  Everybody Dance Now , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[87]  Jieqing Tan,et al.  Video Denoising Based on Spatial-Temporal Filtering , 2016, 2016 6th International Conference on Digital Home (ICDH).

[88]  Xiaoqin Zhang,et al.  Single Image Dehazing via Lightweight Multi-scale Networks , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[89]  Rob Fergus,et al.  Blind deconvolution using a normalized sparsity measure , 2011, CVPR 2011.

[90]  Mariana Afonso,et al.  Perceptually-inspired super-resolution of compressed videos , 2019, Optical Engineering + Applications.

[91]  Ahmed M. Elgammal,et al.  CAN: Creative Adversarial Networks, Generating "Art" by Learning About Styles and Deviating from Style Norms , 2017, ICCC.

[92]  Jiri Matas,et al.  A Novel Performance Evaluation Methodology for Single-Target Trackers , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[93]  Hao Li,et al.  Avatar digitization from a single image for real-time rendering , 2017, ACM Trans. Graph..

[94]  Douglas Eck,et al.  A Neural Representation of Sketch Drawings , 2017, ICLR.

[95]  Jia Deng,et al.  Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[96]  Dong-Wook Kim,et al.  NTIRE 2019 Challenge on Real Image Denoising: Methods and Results , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[97]  Nouman Azam,et al.  Comparison of term frequency and document frequency based feature selection metrics in text categorization , 2012, Expert Syst. Appl..

[98]  Bernhard Schölkopf,et al.  Learning to Deblur , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[99]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[100]  Steve DiPaola,et al.  Deep Learning for Classification of Peak Emotions within Virtual Reality Systems , 2018 .

[101]  Pietro Perona,et al.  Is bottom-up attention useful for object recognition? , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[102]  Christian Ledig,et al.  Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[103]  Li Tao,et al.  LLCNN: A convolutional neural network for low-light image enhancement , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[104]  Karen O. Egiazarian,et al.  Nonlocal Transform-Domain Filter for Volumetric Data Denoising and Reconstruction , 2013, IEEE Transactions on Image Processing.

[105]  Adrian Munteanu,et al.  CNN-Based Intra-Prediction for Lossless HEVC , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[106]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[107]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[108]  Jeff Donahue,et al.  Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.

[109]  Kyoung Mu Lee,et al.  Recurrent Neural Networks With Intra-Frame Iterations for Video Deblurring , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[110]  Karol Gregor,et al.  Temporal Difference Variational Auto-Encoder , 2018, ICLR.

[111]  Ali Borji,et al.  Salient object detection: A survey , 2014, Computational Visual Media.

[112]  Stefano Squartini,et al.  Polyphonic Sound Event Detection by Using Capsule Neural Networks , 2018, IEEE Journal of Selected Topics in Signal Processing.

[113]  Kumar Krishna Agrawal,et al.  GANSynth: Adversarial Neural Audio Synthesis , 2019, ICLR.

[114]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[115]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[116]  Harshad Rai,et al.  Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks , 2018 .

[117]  Ali Farhadi,et al.  Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks , 2016, ECCV.

[118]  Alan Bundy,et al.  Preparing for the future of Artificial Intelligence , 2016, AI & SOCIETY.

[119]  Calle Lejdfors,et al.  Adaptive enhancement and noise reduction in very low light-level video , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[120]  Meredith Ringel Morris,et al.  Sign Language Recognition, Generation, and Translation: An Interdisciplinary Perspective , 2019, ASSETS.

[121]  이창기,et al.  Convolutional Neural Network를 이용한 한국어 영화평 감성 분석 , 2016 .

[122]  David R. Bull,et al.  Fixation identification for low-sample-rate mobile eye trackers , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[123]  Nick G. Kingsbury,et al.  Atmospheric Turbulence Mitigation Using Complex Wavelet-Based Fusion , 2013, IEEE Transactions on Image Processing.

[124]  Ruslan Salakhutdinov,et al.  Generating Images from Captions with Attention , 2015, ICLR.

[125]  Lei Zhang,et al.  Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[126]  Fumio Kishino,et al.  Augmented reality: a class of displays on the reality-virtuality continuum , 1995, Other Conferences.

[127]  Isar Nejadgholi,et al.  A Review of Standard Text Classification Practices for Multi-label Toxicity Identification of Online Content , 2018, ALW.

[128]  Roger K. Moore,et al.  American Sign Language Posture Understanding with Deep Neural Networks , 2018, 2018 21st International Conference on Information Fusion (FUSION).

[129]  Cristian Sminchisescu,et al.  Semantic Segmentation with , 2012 .

[130]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[131]  Ruslan Salakhutdinov,et al.  Neural Topological SLAM for Visual Navigation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[132]  Alexei A. Efros,et al.  Colorful Image Colorization , 2016, ECCV.

[133]  Winston H. Hsu,et al.  Free-Form Video Inpainting With 3D Gated Convolution and Temporal PatchGAN , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[134]  Jingkuan Song,et al.  Binary Generative Adversarial Networks for Image Retrieval , 2017, AAAI.

[135]  Olgierd Stankiewicz,et al.  Video coding technique with parametric modeling of noise , 2019, Opto-Electronics Review.

[136]  Andrew Calway,et al.  Automated Map Reading: Image Based Localisation in 2-D Maps Using Binary Semantic Descriptors , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[137]  In So Kweon,et al.  Deep Video Inpainting , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[138]  Kwang In Kim,et al.  Look here! A parametric learning based approach to redirect visual attention , 2020, ECCV.

[139]  Xiangyu Xu,et al.  Learning Deformable Kernels for Image and Video Denoising , 2019, ArXiv.

[140]  David R. Bull,et al.  Fixation Prediction and Visual Priority Maps for Biped Locomotion , 2018, IEEE Transactions on Cybernetics.

[141]  Agata Marta Soccini Gaze estimation based on head movements in virtual reality applications using deep learning , 2017, 2017 IEEE Virtual Reality (VR).

[142]  Yifan Wang,et al.  A Fully Progressive Approach to Single-Image Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[143]  Ali Razavi,et al.  Generating Diverse High-Resolution Images with VQ-VAE , 2019, DGS@ICLR.

[144]  Florian Jug,et al.  Noise2Void - Learning Denoising From Single Noisy Images , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[145]  Paul Debevec,et al.  DeepView: View Synthesis With Learned Gradient Descent , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[146]  Ronald,et al.  Learning representations by backpropagating errors , 2004 .

[147]  Stephan Mandt,et al.  Deep Generative Video Compression , 2018, NeurIPS.

[148]  Zhenchang Xing,et al.  Ensemble application of convolutional and recurrent neural networks for multi-label text categorization , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[149]  Kai Chen,et al.  MMDetection: Open MMLab Detection Toolbox and Benchmark , 2019, ArXiv.

[150]  Jiri Matas,et al.  DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[151]  Yunsick Sung,et al.  Automatic Melody Composition Using Enhanced GAN , 2019, Mathematics.

[152]  Zhuowen Tu,et al.  Deeply Supervised Salient Object Detection with Short Connections , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[153]  Luc Van Gool,et al.  Temporal Segment Networks: Towards Good Practices for Deep Action Recognition , 2016, ECCV.

[154]  Cedric Nishan Canagarajah,et al.  Pixel- and region-based image fusion with complex wavelets , 2007, Inf. Fusion.

[155]  Daniel Rueckert,et al.  Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[156]  Nantheera Anantrasirichai,et al.  A deep learning approach to detecting volcano deformation from satellite imagery using synthetic datasets , 2019, Remote Sensing of Environment.

[157]  GuoLong Zhang Design of virtual reality augmented reality mobile platform and game user behavior monitoring using deep learning , 2020 .

[158]  Norjihan Abdul Ghani,et al.  Social media big data analytics: A survey , 2019, Comput. Hum. Behav..

[159]  Shi-Min Hu,et al.  RepFinder: finding approximately repeated scene elements for image editing , 2010, SIGGRAPH 2010.

[160]  Sebastian Starke,et al.  Local motion phases for learning multi-contact character movements , 2020, ACM Trans. Graph..

[161]  Henry Y. K. Lau,et al.  A study of cybersickness and sensory conflict theory using a motion-coupled virtual reality system , 2020, Displays.

[162]  Guillermo Sapiro,et al.  Deep Video Deblurring for Hand-Held Cameras , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[163]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[164]  Ali Farhadi,et al.  Re$^3$: Re al-Time Recurrent Regression Networks for Visual Tracking of Generic Objects , 2017, IEEE Robotics and Automation Letters.

[165]  Jitendra Malik,et al.  Mesh R-CNN , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[166]  Constantine Bekas,et al.  BAGAN: Data Augmentation with Balancing GAN , 2018, ArXiv.

[167]  Markus Schedl,et al.  Audio-visual encoding of multimedia content for enhancing movie recommendations , 2018, RecSys.

[168]  Ling Shao,et al.  RGB-D salient object detection: A survey , 2021, Comput. Vis. Media.

[169]  Léon Bottou,et al.  Wasserstein GAN , 2017, ArXiv.

[170]  Hazim Kemal Ekenel,et al.  Cycle-Dehaze: Enhanced CycleGAN for Single Image Dehazing , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[171]  Nicolas Courty,et al.  DeepJDOT: Deep Joint distribution optimal transport for unsupervised domain adaptation , 2018, ECCV.

[172]  Shuchang Zhou,et al.  Learning to Paint With Model-Based Deep Reinforcement Learning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[173]  Darren Cosker,et al.  Camera tracking in visual effects an industry perspective of structure from motion , 2016 .

[174]  Xuanqin Mou,et al.  Low-Dose CT Image Denoising Using a Generative Adversarial Network With Wasserstein Distance and Perceptual Loss , 2017, IEEE Transactions on Medical Imaging.

[175]  Stephanie Hui-Wen Chuah Why and Who Will Adopt Extended Reality Technology? Literature Review, Synthesis, and Future Research Agenda , 2018 .

[176]  Hermann Ney,et al.  Fast and Robust Training of Recurrent Neural Networks for Offline Handwriting Recognition , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[177]  Thomas S. Huang,et al.  Generative Image Inpainting with Contextual Attention , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[178]  Yi Li,et al.  Deformable Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[179]  Arvid Lundervold,et al.  An overview of deep learning in medical imaging focusing on MRI , 2018, Zeitschrift fur medizinische Physik.

[180]  Xiang Zhu,et al.  Removing Atmospheric Turbulence via Space-Invariant Deconvolution , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[181]  Bernhard Schölkopf,et al.  EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[182]  Marcus Rohrbach,et al.  Translating Videos to Natural Language Using Deep Recurrent Neural Networks , 2014, NAACL.

[183]  Huazhu Fu,et al.  Taking a Deeper Look at Co-Salient Object Detection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[184]  Derek Partridge,et al.  Creativity: a survey of AI approaches , 1993, Artificial Intelligence Review.

[185]  Gaëtan Hadjeres,et al.  Deep Learning Techniques for Music Generation , 2019 .

[186]  Ding Liu,et al.  EnlightenGAN: Deep Light Enhancement Without Paired Supervision , 2019, IEEE Transactions on Image Processing.

[187]  Wenguan Wang,et al.  Shifting More Attention to Video Salient Object Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[188]  Xiaojuan Qi,et al.  GAL: Geometric Adversarial Loss for Single-View 3D-Object Reconstruction , 2018, ECCV.

[189]  Shakir Mohamed,et al.  Distribution Matching in Variational Inference , 2018, ArXiv.

[190]  S. H. Gawande,et al.  A Comparative Study on Different Types of Approaches to Text Categorization , 2012 .

[191]  Shi-Min Hu,et al.  RepFinder: finding approximately repeated scene elements for image editing , 2010, ACM Trans. Graph..

[192]  Angeliki V. Katsenou,et al.  Encoding in the Dark Grand Challenge: an Overview , 2020, 2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[193]  Junjun Jiang,et al.  FusionGAN: A generative adversarial network for infrared and visible image fusion , 2019, Inf. Fusion.

[194]  Antoni Buades,et al.  CFA Video Denoising and Demosaicking Chain via Spatio-Temporal Patch-Based Filtering , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[195]  Alessandro Foi,et al.  Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , 2007, IEEE Transactions on Image Processing.

[196]  Xin Cai,et al.  Flattenet: A Simple and Versatile Framework for Dense Pixelwise Prediction , 2019, IEEE Access.

[197]  Mehdi Bennis,et al.  Toward Interconnected Virtual Reality: Opportunities, Challenges, and Enablers , 2016, IEEE Communications Magazine.

[198]  Jiajun Wu,et al.  Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[199]  Hannu Toivonen,et al.  Data-Driven News Generation for Automated Journalism , 2017, INLG.

[200]  B. Lange,et al.  Virtual reality for stroke rehabilitation. , 2015, The Cochrane database of systematic reviews.

[201]  Virginia Dignum,et al.  Ethics in artificial intelligence: introduction to the special issue , 2018, Ethics and Information Technology.

[202]  Thomas Paine,et al.  Large-Scale Visual Speech Recognition , 2018, INTERSPEECH.

[203]  Khaled Salah,et al.  Combating Deepfake Videos Using Blockchain and Smart Contracts , 2019, IEEE Access.

[204]  Matti Pietikäinen,et al.  Deep Learning for Generic Object Detection: A Survey , 2018, International Journal of Computer Vision.

[205]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[206]  Jitendra Malik,et al.  End-to-End Recovery of Human Shape and Pose , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[207]  David R. Bull,et al.  Fast Depth Estimation for View Synthesis , 2021, 2020 28th European Signal Processing Conference (EUSIPCO).

[208]  Konrad Schindler,et al.  Online Multi-Target Tracking Using Recurrent Neural Networks , 2016, AAAI.

[209]  Liang Lin,et al.  Multi-level Wavelet-CNN for Image Restoration , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[210]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[211]  Haoran Li,et al.  Towards Personalized Review Summarization via User-Aware Sequence Network , 2019, AAAI.

[212]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[213]  Jan van Gemert,et al.  ViDeNN: Deep Blind Video Denoising , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[214]  Han Sun,et al.  Learning With Batch-Wise Optimal Transport Loss for 3D Shape Recognition , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[215]  Victor Lempitsky,et al.  Few-Shot Adversarial Learning of Realistic Neural Talking Head Models , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[216]  Wangmeng Zuo,et al.  Spatio-Temporal Filter Adaptive Network for Video Deblurring , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[217]  Siwei Lyu,et al.  Exposing DeepFake Videos By Detecting Face Warping Artifacts , 2018, CVPR Workshops.

[218]  Yuyang Xue,et al.  Attention Based Image Compression Post-Processing Convolutional Neural Network , 2019, CVPR Workshops.

[219]  Usman Ghani Khan,et al.  Video Retrieval System Using Parallel Multi-Class Recurrent Neural Network Based on Video Description , 2018, 2018 14th International Conference on Emerging Technologies (ICET).

[220]  Matthias Nießner,et al.  State of the Art on 3D Reconstruction with RGB‐D Cameras , 2018, Comput. Graph. Forum.

[221]  Jun Wang,et al.  A recurrent neural network for solving nonlinear convex programs subject to linear constraints , 2005, IEEE Transactions on Neural Networks.

[222]  Shutao Li,et al.  Image Fusion With Guided Filtering , 2013, IEEE Transactions on Image Processing.

[223]  Quan Pan,et al.  A Generative Model for category text generation , 2018, Inf. Sci..

[224]  Yun Fu,et al.  Image Super-Resolution Using Very Deep Residual Channel Attention Networks , 2018, ECCV.

[225]  Björn W. Schuller,et al.  Deep Learning for Environmentally Robust Speech Recognition , 2017, ACM Trans. Intell. Syst. Technol..

[226]  Tim C. Kietzmann,et al.  Deepfakes: Trick or treat? , 2020, Business Horizons.

[227]  Christopher Joseph Pal,et al.  Towards Text Generation with Adversarially Learned Neural Outlines , 2018, NeurIPS.

[228]  Kaiming He,et al.  Mask R-CNN , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[229]  Jiayi Ma,et al.  Infrared and visible image fusion methods and applications: A survey , 2018, Inf. Fusion.

[230]  Wenze Shao,et al.  A Simple and Robust Deep Convolutional Approach to Blind Image Denoising , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[231]  John D. Austin,et al.  Adaptive histogram equalization and its variations , 1987 .

[232]  Ming Yang,et al.  Image Blind Denoising with Generative Adversarial Network Based Noise Modeling , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[233]  Hao Li,et al.  paGAN , 2018, Keywords of Identity, Race, and Human Mobility in Early Modern England.

[234]  Yu Liu,et al.  Video Denoising Based on a Spatiotemporal Kalman-Bilateral Mixture Model , 2013, TheScientificWorldJournal.

[235]  Ruigang Yang,et al.  Learning Warped Guidance for Blind Face Restoration , 2018, ECCV.

[236]  Jian Yang,et al.  Image Super-Resolution via Deep Recursive Residual Network , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[237]  Shang-Hong Lai,et al.  AugGAN: Cross Domain Adaptation with GAN-Based Data Augmentation , 2018, ECCV.

[238]  Touseef Iqbal,et al.  The survey: Text generation models in deep learning , 2020, J. King Saud Univ. Comput. Inf. Sci..

[239]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[240]  Long Quan,et al.  2018 Formatting Instructions for Authors Using LaTeX , 2017 .

[241]  M. Shamim Hossain,et al.  Emotion recognition using deep learning approach from audio-visual emotional big data , 2019, Inf. Fusion.

[242]  Sylvain Lefebvre,et al.  AI in the media and creative industries , 2019, ArXiv.

[243]  Jason Weston,et al.  A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[244]  Fan Zhang,et al.  Gan-Based Effective Bit Depth Adaptation for Perceptual Video Compression , 2020, 2020 IEEE International Conference on Multimedia and Expo (ICME).

[245]  Yong-Sheng Chen,et al.  Pyramid Stereo Matching Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[246]  Quoc V. Le,et al.  Recurrent Neural Networks for Noise Reduction in Robust ASR , 2012, INTERSPEECH.

[247]  Saeid Nahavandi,et al.  Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications , 2018, IEEE Transactions on Cybernetics.

[248]  Jia Xu,et al.  Learning to See in the Dark , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[249]  Dong Yang,et al.  Proximal Dehaze-Net: A Prior Learning-Based Deep Network for Single Image Dehazing , 2018, ECCV.

[250]  Guang Yang,et al.  SaliencyGAN: Deep Learning Semisupervised Salient Object Detection in the Fog of IoT , 2020, IEEE Transactions on Industrial Informatics.

[251]  Tao Lei,et al.  A review of Convolutional-Neural-Network-based action recognition , 2019, Pattern Recognit. Lett..

[252]  Jan Kautz,et al.  Video-to-Video Synthesis , 2018, NeurIPS.

[253]  Yi Wang,et al.  Scale-Recurrent Network for Deep Image Deblurring , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[254]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[255]  Pengfei Xiong,et al.  Deep Fusion Network for Image Completion , 2019, ACM Multimedia.

[256]  Aggelos K. Katsaggelos,et al.  Using Deep Neural Networks for Inverse Problems in Imaging: Beyond Analytical Methods , 2018, IEEE Signal Processing Magazine.

[257]  Santanu Chaudhury,et al.  Visual saliency guided video compression algorithm , 2013, Signal Process. Image Commun..

[258]  Qian Chen,et al.  Single infrared image enhancement using a deep convolutional neural network , 2019, Neurocomputing.

[259]  David Bull,et al.  Application of Machine Learning to Classification of Volcanic Deformation in Routinely Generated InSAR Data , 2018, Journal of Geophysical Research: Solid Earth.

[260]  Lei Zhang,et al.  Object-Driven Text-To-Image Synthesis via Adversarial Training , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[261]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[262]  Xinfeng Zhang,et al.  Image and Video Compression With Neural Networks: A Review , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[263]  Shu-Tao Xia,et al.  Second-Order Attention Network for Single Image Super-Resolution , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[264]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[265]  Matthias Nießner,et al.  Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[266]  Kristen Grauman,et al.  2.5D Visual Sound , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[267]  R. Mersereau,et al.  Iterative methods for image deblurring , 1990 .

[268]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[269]  Jian Sun,et al.  Single image haze removal using dark channel prior , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[270]  Nicholas Peretti,et al.  Rotoscope Automation with Deep Learning , 2020 .

[271]  Seunghoon Hong,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[272]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[273]  Yuning Jiang,et al.  MegDet: A Large Mini-Batch Object Detector , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[274]  Dima Damen,et al.  Scaling Egocentric Vision: The Open image in new window Dataset , 2018 .

[275]  Georgios Tzimiropoulos,et al.  Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[276]  Aggelos K. Katsaggelos,et al.  Video Super-Resolution With Convolutional Neural Networks , 2016, IEEE Transactions on Computational Imaging.

[277]  Wendy Hall,et al.  Growing the artificial intelligence industry in the UK , 2017 .

[278]  Philip S. Yu,et al.  Joint Slot Filling and Intent Detection via Capsule Neural Networks , 2018, ACL.

[279]  Fan Zhang,et al.  CVEGAN: A Perceptually-inspired GAN for Compressed Video Enhancement , 2020, ArXiv.

[280]  Leonidas J. Guibas,et al.  ShapeNet: An Information-Rich 3D Model Repository , 2015, ArXiv.

[281]  Khaled Alsaih,et al.  Machine learning techniques for diabetic macular edema (DME) classification on SD-OCT images , 2017, BioMedical Engineering OnLine.

[282]  Farid Saberi Movahed,et al.  Regularizing extreme learning machine by dual locally linear embedding manifold learning for training multi-label neural network classifiers , 2021, Eng. Appl. Artif. Intell..

[283]  Zengchang Qin,et al.  Emotion Classification with Data Augmentation Using Generative Adversarial Networks , 2018, PAKDD.

[284]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[285]  Rabab Kreidieh Ward,et al.  Deep learning for pixel-level image fusion: Recent advances and future prospects , 2018, Inf. Fusion.

[286]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[287]  Fan Zhang,et al.  BVI-DVC: A Training Database for Deep Video Compression , 2021, IEEE Transactions on Multimedia.

[288]  Lingfeng Wang,et al.  Local-Aggregation Graph Networks , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[289]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[290]  Dong Liu,et al.  One-for-All: Grouped Variation Network-Based Fractional Interpolation in Video Coding , 2019, IEEE Transactions on Image Processing.

[291]  Yaser Sheikh,et al.  VR facial animation via multiview image translation , 2019, ACM Trans. Graph..

[292]  John Ahmet Erkoyuncu,et al.  A systematic review of augmented reality applications in maintenance , 2018 .

[293]  R. S. Rajesh,et al.  A Deep Convolutional Neural Network Approach for Static Hand Gesture Recognition , 2020, Procedia Computer Science.

[294]  Yongdong Zhang,et al.  Learning Multimodal Attention LSTM Networks for Video Captioning , 2017, ACM Multimedia.

[295]  Thomas S. Huang,et al.  Free-Form Image Inpainting With Gated Convolution , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[296]  Jehee Lee,et al.  Interactive character animation by learning multi-objective control , 2018, ACM Trans. Graph..

[297]  Jan Kotera,et al.  Convolutional Neural Networks for Direct Text Deblurring , 2015, BMVC.

[298]  Jonathan P. Rowe,et al.  Interactive Narrative Personalization with Deep Reinforcement Learning , 2017, IJCAI.

[299]  Han Zhang,et al.  Self-Attention Generative Adversarial Networks , 2018, ICML.

[300]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[301]  Dani Lischinski,et al.  Deep photo: model-based photograph enhancement and viewing , 2008, SIGGRAPH 2008.

[302]  Hendrik P. A. Lensch,et al.  Infrared Colorization Using Deep Convolutional Neural Networks , 2016, 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA).

[303]  Christine Guillemot,et al.  Learning Fused Pixel and Feature-Based View Reconstructions for Light Fields , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[304]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[305]  Mohammed Bennamoun,et al.  Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[306]  Wen Gao,et al.  Enhanced Motion-Compensated Video Coding With Deep Virtual Reference Frame Generation , 2019, IEEE Transactions on Image Processing.

[307]  David R. Bull,et al.  Defectnet: Multi-Class Fault Detection on Highly-Imbalanced Datasets , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[308]  Luc Van Gool,et al.  Crossing Nets: Combining GANs and VAEs with a Shared Latent Space for Hand Pose Estimation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[309]  Vadim Bulitko,et al.  Interactive Narrative: A Novel Application of Artificial Intelligence for Computer Games , 2012, AAAI.

[310]  Dima Damen,et al.  EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[311]  Masakazu Matsugu,et al.  Subject independent facial expression recognition with robust face detection using a convolutional neural network , 2003, Neural Networks.

[312]  Yury Kartynnik,et al.  Real-time Facial Surface Geometry from Monocular Video on Mobile GPUs , 2019, ArXiv.

[313]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[314]  Tom Eccles,et al.  Learning to Play No-Press Diplomacy with Best Response Policy Iteration , 2020, NeurIPS.

[315]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[316]  Ping Tan,et al.  DualGAN: Unsupervised Dual Learning for Image-to-Image Translation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[317]  Fan Zhang,et al.  Enhancing VVC Through Cnn-Based Post-Processing , 2020, 2020 IEEE International Conference on Multimedia and Expo (ICME).

[318]  A. G. Mohapatra,et al.  World of Virtual Reality (VR) in Healthcare , 2019 .

[319]  Thomas Brox,et al.  DeMoN: Depth and Motion Network for Learning Monocular Stereo , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[320]  Xiaoyun Zhang,et al.  DVC: An End-To-End Deep Video Compression Framework , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[321]  Dacheng Tao,et al.  DehazeNet: An End-to-End System for Single Image Haze Removal , 2016, IEEE Transactions on Image Processing.

[322]  Albert Gordo,et al.  Deep Image Retrieval: Learning Global Representations for Image Search , 2016, ECCV.

[323]  Enhong Chen,et al.  Image Denoising and Inpainting with Deep Neural Networks , 2012, NIPS.

[324]  ヒルデブランド、ハロルド・エイ Pitch detection and intonation correction apparatus and method , 1998 .

[325]  Bob L. Sturm,et al.  Music transcription modelling and composition using deep learning , 2016, ArXiv.

[326]  Garrison W. Cottrell,et al.  DeepJ: Style-Specific Music Generation , 2018, 2018 IEEE 12th International Conference on Semantic Computing (ICSC).

[327]  Jaakko Lehtinen,et al.  Noise2Noise: Learning Image Restoration without Clean Data , 2018, ICML.

[328]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[329]  Chris Donahue,et al.  Adversarial Audio Synthesis , 2018, ICLR.

[330]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[331]  Jiajun Wu,et al.  Video Enhancement with Task-Oriented Flow , 2018, International Journal of Computer Vision.

[332]  Xindong Wu,et al.  Object Detection With Deep Learning: A Review , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[333]  Shengping Zhang,et al.  Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[334]  Yu Liu,et al.  Multi-focus image fusion with a deep convolutional neural network , 2017, Inf. Fusion.

[335]  Yuan Xie,et al.  Removing Turbulence Effect via Hybrid Total Variation and Deformation-Guided Kernel Regression , 2016, IEEE Transactions on Image Processing.

[336]  Rynson W. H. Lau,et al.  Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[337]  Alexei A. Efros,et al.  Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[338]  Younggun Cho,et al.  Estimation of ambient light and transmission map with common convolutional architecture , 2016, OCEANS 2016 MTS/IEEE Monterey.

[339]  Hans-Peter Seidel,et al.  Deep Shading: Convolutional Neural Networks for Screen Space Shading , 2016, Comput. Graph. Forum.

[340]  Luc Van Gool,et al.  Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds , 2020, ECCV.

[341]  Cihan Kaleli,et al.  A review on deep learning for recommender systems: challenges and remedies , 2018, Artificial Intelligence Review.

[342]  Georgios Tzimiropoulos,et al.  How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[343]  Steffen Staab,et al.  Bias in data‐driven artificial intelligence systems—An introductory survey , 2020, WIREs Data Mining Knowl. Discov..

[344]  Konstantin Dörr,et al.  Mapping the field of Algorithmic Journalism , 2016 .

[345]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[346]  Hao Li,et al.  paGAN: real-time avatars using dynamic textures , 2019, ACM Trans. Graph..

[347]  Ruigang Yang,et al.  Learning Depth with Convolutional Spatial Propagation Network , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[348]  Steven C. H. Hoi,et al.  Deep Learning for Image Super-Resolution: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[349]  Ira Kemelmacher-Shlizerman,et al.  Synthesizing Obama , 2017, ACM Trans. Graph..

[350]  Kaiqi Huang,et al.  GP-GAN: Towards Realistic High-Resolution Image Blending , 2017, ACM Multimedia.

[351]  Xiaoyong Shen,et al.  Dynamic Scene Deblurring With Parameter Selective Sharing and Nested Skip Connections , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[352]  Joseph Paul Cohen,et al.  Deep semantic segmentation of natural and medical images: a review , 2019, Artificial Intelligence Review.

[353]  Bernt Schiele,et al.  "Best-of-Many-Samples" Distribution Matching , 2019, ArXiv.

[354]  Edward J. Delp,et al.  Deepfake Video Detection Using Recurrent Neural Networks , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[355]  Bo Yang,et al.  3D Object Reconstruction from a Single Depth View with Adversarial Learning , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[356]  Feng Jiang,et al.  An End-to-End Compression Framework Based on Convolutional Neural Networks , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[357]  Michael Evans,et al.  AI IN PRODUCTION: VIDEO ANALYSIS AND MACHINE LEARNING FOR EXPANDED LIVE EVENTS COVERAGE , 2020 .

[358]  Donald E. Brown,et al.  Text Classification Algorithms: A Survey , 2019, Inf..

[359]  Kuan Fang,et al.  Track-RNN : Joint Detection and Tracking Using Recurrent Neural Networks , 2016 .

[360]  Paul E. Debevec,et al.  The relightables , 2019, ACM Trans. Graph..

[361]  Jiajun Wu,et al.  Deep multiple instance learning for image classification and auto-annotation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[362]  Oh-Wook Kwon,et al.  EMOTION RECOGNITION BY SPEECH SIGNAL , 2003 .

[363]  Qiang Wang,et al.  Fast Online Object Tracking and Segmentation: A Unifying Approach , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[364]  Sanja Fidler,et al.  Learning to Simulate Dynamic Environments With GameGAN , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[365]  Wei Wang,et al.  Deep Learning for Single Image Super-Resolution: A Brief Review , 2018, IEEE Transactions on Multimedia.

[366]  Qian Chen,et al.  Thermal Infrared Colorization via Conditional Generative Adversarial Network , 2018, Infrared Physics & Technology.

[367]  Tao Zhou,et al.  Light field salient object detection: A review and benchmark , 2020, Computational Visual Media.

[368]  Hossein Mobahi,et al.  Learning with a Wasserstein Loss , 2015, NIPS.

[369]  Yu-Chiang Frank Wang,et al.  Order-Free RNN with Visual Attention for Multi-Label Classification , 2017, AAAI.

[370]  Francesco Ricci,et al.  Contextual music information retrieval and recommendation: State of the art and challenges , 2012, Comput. Sci. Rev..

[371]  Jiaya Jia,et al.  Single Image Motion Deblurring Using Transparency , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[372]  Daiheng Gao,et al.  DeepFaceLab: A simple, flexible and extensible face swapping framework , 2020, ArXiv.

[373]  Bernhard Schölkopf,et al.  Online Video Deblurring via Dynamic Temporal Blending Network , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[374]  Yu Tian,et al.  CR-GAN: Learning Complete Representations for Multi-view Generation , 2018, IJCAI.

[375]  Jaakko Lehtinen,et al.  Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[376]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[377]  Dong Liu,et al.  CNN-Based DCT-Like Transform for Image Compression , 2018, MMM.

[378]  Jizheng Xu,et al.  AOD-Net: All-in-One Dehazing Network , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[379]  Fan Zhang,et al.  ViSTRA2: Video Coding using Spatial Resolution and Effective Bit Depth Adaptation , 2019, ArXiv.

[380]  W. Zuo,et al.  Deep Learning on Image Denoising: An overview , 2019, Neural Networks.

[381]  Luc Van Gool,et al.  The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[382]  Lok Ming Lui,et al.  Subsampled Turbulence Removal Network , 2018, Mathematics, Computation and Geometry of Data.

[383]  J. Bouchaud An introduction to statistical finance , 2002 .

[384]  Atsushi Shimada,et al.  Sparse Cost Volume for Efficient Stereo Matching , 2018, Remote. Sens..

[385]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[386]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[387]  Jiucang Hao,et al.  Emotion recognition by speech signals , 2003, INTERSPEECH.

[388]  Taku Komura,et al.  Learning motion manifolds with convolutional autoencoders , 2015, SIGGRAPH Asia Technical Briefs.

[389]  Xinfeng Zhang,et al.  Enhanced Bi-Prediction With Convolutional Neural Network for High-Efficiency Video Coding , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[390]  Chen Change Loy,et al.  EDVR: Video Restoration With Enhanced Deformable Convolutional Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[391]  Andrea Cavallaro,et al.  Resource Allocation for Personalized Video Summarization , 2014, IEEE Transactions on Multimedia.

[392]  Tong Zhang,et al.  Effective Use of Word Order for Text Categorization with Convolutional Neural Networks , 2014, NAACL.

[393]  Lei Zhang,et al.  FFDNet: Toward a Fast and Flexible Solution for CNN-Based Image Denoising , 2017, IEEE Transactions on Image Processing.

[394]  Stuart Russell Artificial Intelligence: A Binary Approach , 2020 .

[395]  Gregory Shakhnarovich,et al.  Recurrent Back-Projection Network for Video Super-Resolution , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[396]  Hui Chen,et al.  Temporal-Difference Learning With Sampling Baseline for Image Captioning , 2018, AAAI.

[397]  Richard Souvenir,et al.  Evaluation of Image Inpainting for Classification and Retrieval , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[398]  Bin Sheng,et al.  Deep Colorization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[399]  Jean-Michel Morel,et al.  A Non-Local CNN for Video Denoising , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[400]  Lucas Theis,et al.  Faster gaze prediction with dense networks and Fisher pruning , 2018, ArXiv.

[401]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[402]  P. Milgram,et al.  A Taxonomy of Mixed Reality Visual Displays , 1994 .

[403]  Kyoung Mu Lee,et al.  Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).