Cross-media analysis and reasoning: advances and directions
暂无分享,去创建一个
Wen Gao | Yao Zhao | Qing-ming Huang | Yu-xin Peng | Wen-wu Zhu | Chang-sheng Xu | Han-qing Lu | Qing-hua Zheng | Tie-jun Huang
[1] Hua Xu,et al. Applying active learning to high-throughput phenotyping algorithms for electronic health records data. , 2013, Journal of the American Medical Informatics Association : JAMIA.
[2] Eugene Garfield,et al. Historiographic Mapping of Knowledge Domains Literature , 2004, J. Inf. Sci..
[3] Beng Chin Ooi,et al. Effective Multi-Modal Retrieval based on Stacked Auto-Encoders , 2014, Proc. VLDB Endow..
[4] Vicente Ordonez,et al. Im2Text: Describing Images Using 1 Million Captioned Photographs , 2011, NIPS.
[5] Wei Zhang,et al. Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.
[6] Kira Radinsky,et al. Learning causality for news events prediction , 2012, WWW.
[7] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[8] Gerhard Weikum,et al. Knowledge Bases in the Age of Big Data Analytics , 2014, Proc. VLDB Endow..
[9] Yang Yang,et al. Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes , 2014, ACM Multimedia.
[10] Qingming Huang,et al. Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval , 2017, ACM Trans. Intell. Syst. Technol..
[11] Wenwu Zhu,et al. Learning Compact Hash Codes for Multimodal Representations Using Orthogonal Deep Structure , 2015, IEEE Transactions on Multimedia.
[12] Yuxin Peng,et al. Cross-Media Shared Representation by Hierarchical Learning with Multiple Deep Networks , 2016, IJCAI.
[13] Roger Levy,et al. A new approach to cross-modal multimedia retrieval , 2010, ACM Multimedia.
[14] Jeremy Ginsberg,et al. Detecting influenza epidemics using search engine query data , 2009, Nature.
[15] Ahmet Uyar,et al. Evaluating search features of Google Knowledge Graph and Bing Satori: Entity types, list searches and query interfaces , 2015, Online Inf. Rev..
[16] D. Lazer,et al. The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.
[17] Jeff A. Bilmes,et al. Deep Canonical Correlation Analysis , 2013, ICML.
[18] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[19] Feng-Hsiung Hsu,et al. Behind Deep Blue: Building the Computer that Defeated the World Chess Champion , 2002 .
[20] Aron Culotta,et al. Estimating county health statistics with twitter , 2014, CHI.
[21] M. Shamim Hossain,et al. Folksonomy-Based Visual Ontology Construction and Its Applications , 2016, IEEE Transactions on Multimedia.
[22] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[23] Agnar Aamodt,et al. Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..
[24] G H Land,et al. Evidence-based decision making in public health. , 1999, Journal of public health management and practice : JPHMP.
[25] Michael Gamon,et al. Active objects: actions for entity-centric search , 2012, WWW.
[26] Nikhil Rasiwasia,et al. Cluster Canonical Correlation Analysis , 2014, AISTATS.
[27] Erik T. Mueller,et al. Watson: Beyond Jeopardy! , 2013, Artif. Intell..
[28] Xinlei Chen,et al. NEIL: Extracting Visual Knowledge from Web Data , 2013, 2013 IEEE International Conference on Computer Vision.
[29] Andrew Y. Ng,et al. Parsing Natural Scenes and Natural Language with Recursive Neural Networks , 2011, ICML.
[30] Ruifan Li,et al. Cross-modal Retrieval with Correspondence Autoencoder , 2014, ACM Multimedia.
[31] Sabine Schulte im Walde,et al. A Multimodal LDA Model integrating Textual, Cognitive and Visual Modalities , 2013, EMNLP.
[32] Yao Zhao,et al. Cross-Modal Retrieval With CNN Visual Features: A New Baseline , 2017, IEEE Transactions on Cybernetics.
[33] Frédo Durand,et al. Capturing the human figure through a wall , 2015, ACM Trans. Graph..
[34] Quoc V. Le,et al. Grounded Compositional Semantics for Finding and Describing Images with Sentences , 2014, TACL.
[35] Nitish Srivastava,et al. Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..
[36] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[37] Yunhe Pan,et al. Heading toward Artificial Intelligence 2.0 , 2016 .
[38] Yiannis Aloimonos,et al. Corpus-Guided Sentence Generation of Natural Images , 2011, EMNLP.
[39] Christopher Ré,et al. Ringtail: A Generalized Nowcasting System , 2013, Proc. VLDB Endow..
[40] Nicu Sebe,et al. Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.
[41] J. Pearl. Causality: Models, Reasoning and Inference , 2000 .
[42] Estevam R. Hruschka,et al. Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.
[43] Xiaohua Zhai,et al. Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph Regularization , 2016, IEEE Transactions on Circuits and Systems for Video Technology.
[44] Vanessa E. Gray,et al. Evolutionary diagnosis method for variants in personal exomes , 2012, Nature Methods.
[45] Alessandro Lazaric,et al. Transfer in Reinforcement Learning: A Framework and a Survey , 2012, Reinforcement Learning.
[46] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.
[47] José Ruíz Ascencio,et al. Visual simultaneous localization and mapping: a survey , 2012, Artificial Intelligence Review.
[48] Yueting Zhuang,et al. Multi-modal Mutual Topic Reinforce Modeling for Cross-media Retrieval , 2014, ACM Multimedia.
[49] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[50] Peter Young,et al. Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics , 2013, J. Artif. Intell. Res..
[51] Elizabeth Gibney. DeepMind algorithm beats people at classic video games. , 2015 .
[52] Yejin Choi,et al. TreeTalk: Composition and Compression of Trees for Image Descriptions , 2014, TACL.
[53] Li Fei-Fei,et al. Building a Large-scale Multimodal Knowledge Base System for Answering Visual Queries , 2015 .
[54] Yi Yang,et al. A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[55] Yi Yang,et al. Harmonizing Hierarchical Manifolds for Multimedia Document Semantics Understanding and Cross-Media Retrieval , 2008, IEEE Transactions on Multimedia.
[56] Tat-Seng Chua,et al. Learning from Collective Intelligence , 2016, ACM Trans. Multim. Comput. Commun. Appl..
[57] Xu Jia,et al. Guiding Long-Short Term Memory for Image Caption Generation , 2015, ArXiv.
[58] Junsong Yuan,et al. Boosting cross-media retrieval via visual-auditory feature analysis and relevance feedback , 2014, ACM Multimedia.
[59] Paul M. B. Vitányi,et al. The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.
[60] Cheng Pan,et al. Automated annotation of developmental stages of Drosophila embryos in images containing spatial patterns of expression , 2013, Bioinform..
[61] Victor S. Lempitsky,et al. Neural Codes for Image Retrieval , 2014, ECCV.
[62] H. Hotelling. Relations Between Two Sets of Variates , 1936 .
[63] R. Venkatesh Babu,et al. Attribute-Graph: A Graph Based Approach to Image Ranking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[64] H. McGurk,et al. Hearing lips and seeing voices , 1976, Nature.
[65] Anupam Agrawal,et al. Vision based hand gesture recognition for human computer interaction: a survey , 2012, Artificial Intelligence Review.
[66] Ali Farhadi,et al. VisKE: Visual knowledge extraction and question answering by visual verification of relation phrases , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[67] Elizabeth Gibney,et al. Game-playing software holds lessons for neuroscience , 2015, Nature.
[68] Michael S. Bernstein,et al. Image retrieval using scene graphs , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[69] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[70] Xiaohua Zhai,et al. Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization , 2014, IEEE Transactions on Circuits and Systems for Video Technology.
[71] Atul J. Butte,et al. Clinical Arrays of Laboratory Measures, or "Clinarrays", Built from an Electronic Health Record Enable Disease Subtyping by Severity , 2007, AMIA.
[72] Chunqiang Tang,et al. On iterative intelligent medical search , 2008, SIGIR '08.
[73] Yejin Choi,et al. Baby talk: Understanding and generating simple image descriptions , 2011, CVPR 2011.
[74] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.
[75] Petros Daras,et al. Search and Retrieval of Rich Media Objects Supporting Multiple Multimodal Queries , 2012, IEEE Transactions on Multimedia.
[76] Thomas H. Davenport,et al. Book review:Working knowledge: How organizations manage what they know. Thomas H. Davenport and Laurence Prusak. Harvard Business School Press, 1998. $29.95US. ISBN 0‐87584‐655‐6 , 1998 .
[77] Hang Li,et al. Learning Similarity Function between Objects in Heterogeneous Spaces , 2010 .
[78] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[79] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.
[80] Jian Pei,et al. Parallel field alignment for cross media retrieval , 2013, ACM Multimedia.
[81] Michael Isard,et al. A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics , 2012, International Journal of Computer Vision.