Deep Cross-Modal Audio-Visual Generation
暂无分享,去创建一个
Chenliang Xu | Zhiyao Duan | Lele Chen | Sudhanshu Srivastava | Chenliang Xu | Lele Chen | Sudhanshu Srivastava | Z. Duan
[1] Russell L. Storms. Auditory-visual cross-modal perception phenomena , 1998 .
[2] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[3] Nitish Srivastava,et al. Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..
[4] Noëlle Carbonell,et al. An experimental study of future “natural” multimodal human-computer interaction , 1993, CHI '93.
[5] Roger Levy,et al. A new approach to cross-modal multimedia retrieval , 2010, ACM Multimedia.
[6] Wei-Lun Chao,et al. An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild , 2016, ECCV.
[7] Rob Fergus,et al. Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.
[8] Roger Levy,et al. On the Role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[9] Gaurav Sharma,et al. Creating A Musical Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications , 2016, ArXiv.
[10] Antonio Bonafonte,et al. SEGAN: Speech Enhancement Generative Adversarial Network , 2017, INTERSPEECH.
[11] Hang Zhang,et al. Multi-style Generative Network for Real-time Transfer , 2017, ECCV Workshops.
[12] Mubarak Shah,et al. Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network , 2017, ArXiv.
[13] R. K. Davenport,et al. CROSS‐MODAL PERCEPTION IN APES * , 1976, Annals of the New York Academy of Sciences.
[14] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[15] Bernt Schiele,et al. Learning Deep Representations of Fine-Grained Visual Descriptions , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[16] Jason J. Corso,et al. Learning Compositional Sparse Models of Bimodal Percepts , 2014, AAAI.
[17] Brian D. Ziebart,et al. Adversarial Methods Improve Object Localization , 2016 .
[18] J. Vroomen,et al. Sound enhances visual perception: cross-modal effects of auditory organization on vision. , 2000, Journal of experimental psychology. Human perception and performance.
[19] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.
[20] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.
[21] Sidney S. Simon,et al. Merging of the Senses , 2008, Front. Neurosci..
[22] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.
[23] Gaurav Sharma,et al. See and listen: Score-informed association of sound tracks to players in chamber music performance videos , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[24] Ruifan Li,et al. Cross-modal Retrieval with Correspondence Autoencoder , 2014, ACM Multimedia.
[25] Camille Couprie,et al. Semantic Segmentation using Adversarial Networks , 2016, NIPS 2016.
[26] Wei Wang,et al. A Comprehensive Survey on Cross-modal Retrieval , 2016, ArXiv.
[27] Gaurav Sharma,et al. Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications , 2016, IEEE Transactions on Multimedia.
[28] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.
[29] Ji Liu,et al. Unsupervised Extraction of Human-Interpretable Nonverbal Behavioral Cues in a Public Speaking Scenario , 2015, ACM Multimedia.
[30] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[31] C. Krumhansl,et al. Cross-modal interactions in the perception of musical performance , 2006, Cognition.
[32] Navdeep Jaitly,et al. Adversarial Autoencoders , 2015, ArXiv.
[33] Concetto Spampinato,et al. Semi Supervised Semantic Segmentation Using Generative Adversarial Network , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[34] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.
[35] Andrew Owens,et al. Visually Indicated Sounds , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).