Learning Compositional Sparse Models of Bimodal Percepts
暂无分享,去创建一个
[1] Chenliang Xu,et al. A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[2] H. B. Barlow,et al. Possible Principles Underlying the Transformations of Sensory Messages , 2012 .
[3] David J. Field,et al. Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.
[4] Luke S. Zettlemoyer,et al. A Joint Model of Language and Perception for Grounded Attribute Learning , 2012, ICML.
[5] Ross A. Knepper,et al. Single assembly robot in search of human partner: Versatile grounded language generation , 2013, 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI).
[6] Jake Porway,et al. Object Categorization: Learning Compositional Models for Object Categories from Small Sample Sets , 2008 .
[7] Herbert Freeman,et al. Computer Processing of Line-Drawing Images , 1974, CSUR.
[8] Kate Saenko,et al. Generating Natural-Language Video Descriptions Using Text-Mined Knowledge , 2013, AAAI.
[9] David A. Forsyth,et al. Matching Words and Pictures , 2003, J. Mach. Learn. Res..
[10] Paul Vogt,et al. The physical symbol grounding problem , 2002, Cognitive Systems Research.
[11] Thomas S. Huang,et al. Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.
[12] Charles R. Giardina,et al. Elliptic Fourier features of a closed contour , 1982, Comput. Graph. Image Process..
[13] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .
[14] Raymond J. Mooney,et al. Learning to Interpret Natural Language Navigation Instructions from Observations , 2011, Proceedings of the AAAI Conference on Artificial Intelligence.
[15] Beth Logan,et al. Mel Frequency Cepstral Coefficients for Music Modeling , 2000, ISMIR.
[16] Sanja Fidler,et al. Towards Scalable Representations of Object Categories: Learning a Hierarchy of Parts , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.
[17] Alex Pentland,et al. Learning words from sights and sounds: a computational model , 2002, Cogn. Sci..
[18] James Elder,et al. The effect of contour closure on the rapid discrimination of two-dimensional shapes , 1993, Vision Research.
[19] Sabine Schulte im Walde,et al. A Multimodal LDA Model integrating Textual, Cognitive and Visual Modalities , 2013, EMNLP.
[20] Francesco Orilia,et al. Semantics and Cognition , 1991 .
[21] Deb Roy,et al. Grounded Situation Models for Robots: Where words and percepts meet , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[22] Sven J. Dickinson,et al. Video In Sentences Out , 2012, UAI.
[23] Quan Pan,et al. Semi-coupled dictionary learning with applications to image super-resolution and photo-sketch synthesis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[24] H. Kuhn. The Hungarian method for the assignment problem , 1955 .
[25] Julia Hirschberg,et al. V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure , 2007, EMNLP.
[26] Michael S. Lewicki,et al. Efficient coding of natural sounds , 2002, Nature Neuroscience.
[27] Jeffrey Mark Siskind,et al. Grounded Language Learning from Video Described with Sentences , 2013, ACL.
[28] E. Knudsen,et al. Creating a unified representation of visual and auditory space in the brain. , 1995, Annual review of neuroscience.
[29] Antonis A. Argyros,et al. Physically Plausible 3D Scene Tracking: The Single Actor Hypothesis , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[30] Matthew R. Walter,et al. Understanding Natural Language Commands for Robotic Navigation and Mobile Manipulation , 2011, AAAI.
[31] Guillermo Sapiro,et al. Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..
[32] Antonio Torralba,et al. HOGgles: Visualizing Object Detection Features , 2013, 2013 IEEE International Conference on Computer Vision.