Learning to search for images without annotations
暂无分享,去创建一个
[1] Alexander C. Berg,et al. Automatic Attribute Discovery and Characterization from Noisy Web Data , 2010, ECCV.
[2] W. Bruce Croft,et al. Predicting query performance , 2002, SIGIR '02.
[3] Mark Craven,et al. Supervised versus multiple instance learning: an empirical comparison , 2005, ICML.
[4] Hanhui Li,et al. BAP: Bimodal Attribute Prediction for Zero-Shot Image Categorization , 2014, ACM Multimedia.
[5] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.
[6] Shih-Fu Chang,et al. Short-term audio-visual atoms for generic video concept classification , 2009, ACM Multimedia.
[7] M. de Rijke,et al. Adding semantics to microblog posts , 2012, WSDM '12.
[8] Andrew Y. Ng,et al. Zero-Shot Learning Through Cross-Modal Transfer , 2013, NIPS.
[9] Paul Over,et al. Evaluation campaigns and TRECVid , 2006, MIR '06.
[10] Dragomir Anguelov,et al. Capturing Long-Tail Distributions of Object Subcategories , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[11] James Hays,et al. SUN attribute database: Discovering, annotating, and recognizing scene attributes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[12] Koen E. A. van de Sande,et al. All vehicles are cars: subclass preferences in container concepts , 2012, ICMR '12.
[13] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Thomas Mensink,et al. Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.
[15] Krista A. Ehinger,et al. SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[16] Xirong Li,et al. Evaluating sources and strategies for learning video concepts from social media , 2013, 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI).
[17] Meng Wang,et al. Learning concept bundles for video search with complex queries , 2011, MM '11.
[18] Thomas Deselaers,et al. Measuring the Objectness of Image Windows , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[19] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.
[20] Rong Yan,et al. Negative pseudo-relevance feedback in content-based video retrieval , 2003, MULTIMEDIA '03.
[21] Stephen Gould,et al. Decomposing a scene into geometric and semantically consistent regions , 2009, 2009 IEEE 12th International Conference on Computer Vision.
[22] Cordelia Schmid,et al. Learning Color Names for Real-World Applications , 2009, IEEE Transactions on Image Processing.
[23] Dong Liu,et al. Tag ranking , 2009, WWW '09.
[24] Jitendra Malik,et al. Recognition using regions , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[25] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.
[26] Chen Xu,et al. The SUN Attribute Database: Beyond Categories for Deeper Scene Understanding , 2014, International Journal of Computer Vision.
[27] Cees G. M. Snoek,et al. Best practices for learning video concept detectors from social media examples , 2014, Multimedia Tools and Applications.
[28] Marcel Worring,et al. Bootstrapping Visual Categorization With Relevant Negatives , 2013, IEEE Transactions on Multimedia.
[29] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[30] Cyrus Rashtchian,et al. Every Picture Tells a Story: Generating Sentences from Images , 2010, ECCV.
[31] Akira Kojima,et al. A novel method for semantic video concept learning using web images , 2011, MM '11.
[32] Nenghai Yu,et al. Learning to tag , 2009, WWW '09.
[33] Subhransu Maji,et al. Classification using intersection kernel support vector machines is efficient , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[34] Cees Snoek,et al. Exploring the Long Tail of Social Media Tags , 2016, MMM.
[35] Vidit Jain,et al. Learning to re-rank: query-dependent image re-ranking using click data , 2011, WWW.
[36] A. Oliva,et al. From Blobs to Boundary Edges: Evidence for Time- and Spatial-Scale-Dependent Scene Recognition , 1994 .
[37] Vicente Ordonez,et al. Im2Text: Describing Images Using 1 Million Captioned Photographs , 2011, NIPS.
[38] Mark Craven,et al. Multiple-Instance Active Learning , 2007, NIPS.
[39] Cees Snoek,et al. Can social tagged images aid concept-based video search? , 2009, 2009 IEEE International Conference on Multimedia and Expo.
[40] Marcel Worring,et al. Learning Social Tag Relevance by Neighbor Voting , 2009, IEEE Transactions on Multimedia.
[41] Jiebo Luo,et al. Large-scale multimodal semantic concept detection for consumer video , 2007, MIR '07.
[42] Daphne Koller,et al. Learning Spatial Context: Using Stuff to Find Things , 2008, ECCV.
[43] Antonio Torralba,et al. Recognizing indoor scenes , 2009, CVPR.
[44] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[45] Michelle R. Greene. Statistics of high-level scene context , 2013, Front. Psychol..
[46] Alexei A. Efros,et al. Mid-level Visual Element Discovery as Discriminative Mode Seeking , 2013, NIPS.
[47] Kristen Grauman,et al. Zero-shot recognition with unreliable attributes , 2014, NIPS.
[48] Chong-Wah Ngo,et al. Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study , 2010, IEEE Transactions on Multimedia.
[49] Cees Snoek,et al. Image2Emoji: Zero-shot Emoji Prediction for Visual Media , 2015, ACM Multimedia.
[50] Derek Hoiem,et al. Category Independent Object Proposals , 2010, ECCV.
[51] Antonio Torralba,et al. Semantic Label Sharing for Learning with Many Categories , 2010, ECCV.
[52] Ali Farhadi,et al. Describing objects by their attributes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[53] Bogdan Ionescu,et al. Toward an Estimation of User Tagging Credibility for Social Image Retrieval , 2014, ACM Multimedia.
[54] David A. Shamma,et al. YFCC100M , 2015, Commun. ACM.
[55] Svetlana Lazebnik,et al. Multi-scale Orderless Pooling of Deep Convolutional Activation Features , 2014, ECCV.
[56] Marcel Worring,et al. Unsupervised multi-feature tag relevance learning for social image retrieval , 2010, CIVR '10.
[57] Baoxin Li,et al. YouTubeCat: Learning to categorize wild web videos , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[58] Martha Larson,et al. Reading between the tags to predict real-world size-class for visually depicted objects in images , 2011, MM '11.
[59] I. Biederman. Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.
[60] Cees Snoek,et al. Pooling Objects for Recognizing Scenes without Examples , 2016, ICMR.
[61] Florent Perronnin,et al. Textual Similarity with a Bag-of-Embedded-Words Model , 2013, ICTIR.
[62] Luc Van Gool,et al. The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.
[63] Lorenzo Torresani,et al. Classemes and Other Classifier-Based Features for Efficient Object Categorization , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[64] Alberto Del Bimbo,et al. Enriching and localizing semantic tags in internet videos , 2011, ACM Multimedia.
[65] Alexei A. Efros,et al. Scene Semantics from Long-Term Observation of People , 2012, ECCV.
[66] Thorsten Joachims,et al. Optimizing search engines using clickthrough data , 2002, KDD.
[67] Cees Snoek,et al. Objects2action: Classifying and Localizing Actions without Any Video Example , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[68] Samy Bengio,et al. Zero-Shot Learning by Convex Combination of Semantic Embeddings , 2013, ICLR.
[69] Koen E. A. van de Sande,et al. Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[70] Shih-Fu Chang,et al. To search or to label?: predicting the performance of search-based automatic image classifiers , 2006, MIR '06.
[71] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[72] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[73] ChengXiang Zhai,et al. Statistical Language Models for Information Retrieval: A Critical Review , 2008, Found. Trends Inf. Retr..
[74] Hao Su,et al. Object Bank: An Object-Level Image Representation for High-Level Visual Recognition , 2014, International Journal of Computer Vision.
[75] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[76] Fei-Fei Li,et al. Novel Dataset for Fine-Grained Image Categorization : Stanford Dogs , 2012 .
[77] James M. Rehg,et al. CENTRIST: A Visual Descriptor for Scene Categorization , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[78] Mingjing Li. Texture Moment for Content-Based Image Retrieval , 2007, 2007 IEEE International Conference on Multimedia and Expo.
[79] Irving Biederman,et al. On the Semantics of a Glance at a Scene , 2017 .
[80] Rongrong Ji,et al. Large-scale visual sentiment ontology and detectors using adjective noun pairs , 2013, ACM Multimedia.
[81] Vladimir Vapnik,et al. Statistical learning theory , 1998 .
[82] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[83] Vladlen Koltun,et al. Geodesic Object Proposals , 2014, ECCV.
[84] C. V. Jawahar,et al. Blocks That Shout: Distinctive Parts for Scene Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[85] Christoph H. Lampert,et al. Attribute-Based Classification for Zero-Shot Visual Object Categorization , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[86] Philip H. S. Torr,et al. An embarrassingly simple approach to zero-shot learning , 2015, ICML.
[87] Alberto Del Bimbo,et al. An evaluation of nearest-neighbor methods for tag refinement , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).
[88] Jun Yang,et al. (Un)Reliability of video concept detection , 2008, CIVR '08.
[89] Yiannis Aloimonos,et al. Corpus-Guided Sentence Generation of Natural Images , 2011, EMNLP.
[90] Adrian Ulges,et al. A System That Learns to Tag Videos by Watching Youtube , 2008, ICVS.
[91] C. Lawrence Zitnick,et al. Zero-Shot Learning via Visual Abstraction , 2014, ECCV.
[92] Nenghai Yu,et al. Multiple-instance ranking: Learning to rank images for image retrieval , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[93] Adrian Ulges,et al. Identifying relevant frames in weakly labeled videos for training concept detectors , 2008, CIVR '08.
[94] Antonio Torralba,et al. LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.
[95] Tamara L. Berg,et al. Baby Talk : Understanding and Generating Image Descriptions , 2011 .
[96] Christoph H. Lampert,et al. Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[97] Cees G. M. Snoek,et al. The MediaMill at TRECVID 2013: : Searching concepts, Objects, Instances and events in video , 2013, TRECVID.
[98] H. Hayne,et al. The effect of drawing on memory performance in young children. , 1995 .
[99] Bolei Zhou,et al. Object Detectors Emerge in Deep Scene CNNs , 2014, ICLR.
[100] Markus Koch,et al. Linking visual concept detection with viewer demographics , 2012, ICMR '12.
[101] Kristen Grauman,et al. Learning the Relative Importance of Objects from Tagged Images for Retrieval and Cross-Modal Search , 2011, International Journal of Computer Vision.
[102] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .
[103] Vladimir Pavlovic,et al. Attribute rating for classification of visual objects , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).
[104] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[105] Marcel Worring,et al. This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Harvesting Social Images for Bi-Concept Search , 2022 .
[106] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.
[107] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.
[108] C. Lawrence Zitnick,et al. Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.
[109] Chong-Wah Ngo,et al. Sampling and Ontologically Pooling Web Images for Visual Concept Learning , 2012, IEEE Transactions on Multimedia.
[110] Thomas Deselaers,et al. What is an object? , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[111] I. Biederman,et al. Scene perception: Detecting and judging objects undergoing relational violations , 1982, Cognitive Psychology.
[112] Santiago Manen,et al. Prime Object Proposals with Randomized Prim's Algorithm , 2013, 2013 IEEE International Conference on Computer Vision.
[113] Martha Larson,et al. SocialZap: Catch-up on Interesting Television Fragments Discovered from Social Media , 2014, ICMR.
[114] Antonio Torralba,et al. Depth Estimation from Image Structure , 2002, IEEE Trans. Pattern Anal. Mach. Intell..
[115] Roelof van Zwol,et al. Flickr tag recommendation based on collective knowledge , 2008, WWW.
[116] Sourav S. Bhowmick,et al. Content is still king: the effect of neighbor voting schemes on tag relevance for social image retrieval , 2012, ICMR.
[117] Marcel Worring,et al. Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..
[118] Gang Wang,et al. On the sampling of web images for learning visual concept classifiers , 2010, CIVR '10.
[119] Qi Tian,et al. Image Classification and Retrieval are ONE , 2015, ICMR.
[120] Grant Schindler,et al. Internet video category recognition , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.
[121] Meng Wang,et al. Harvesting visual concepts for image search with complex queries , 2012, ACM Multimedia.
[122] Meng Wang,et al. ShotTagger: tag location for internet videos , 2011, ICMR.
[123] Ivor W. Tsang,et al. Textual Query of Personal Photos Facilitated by Large-Scale Web Data , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[124] Hao Su,et al. Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.
[125] C. Lawrence Zitnick,et al. Adopting Abstract Images for Semantic Scene Understanding , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[126] Cyrus Rashtchian,et al. Collecting Image Annotations Using Amazon’s Mechanical Turk , 2010, Mturk@HLT-NAACL.
[127] Yueting Zhuang,et al. Jointly Discovering Fine-grained and Coarse-grained Sentiments via Topic Modeling , 2014, ACM Multimedia.
[128] Hayit Greenspan,et al. Finding Pictures of Objects in Large Collections of Images , 1996, Object Representation in Computer Vision.
[129] Kristen Grauman,et al. Relative attributes , 2011, 2011 International Conference on Computer Vision.
[130] Matthew B. Blaschko,et al. Learning a category independent object detection cascade , 2011, 2011 International Conference on Computer Vision.
[131] Chun Chen,et al. Personalized automatic image annotation based on reinforcement learning , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).
[132] Arnold W. M. Smeulders,et al. c ○ 2005 Springer Science + Business Media, Inc. Manufactured in The Netherlands. A Six-Stimulus Theory for Stochastic Texture , 2002 .
[133] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .
[134] Antonio Torralba,et al. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.
[135] C. Lawrence Zitnick,et al. Fast Edge Detection Using Structured Forests , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[136] Antonio Criminisi,et al. Harvesting Image Databases from the Web , 2007, 2007 IEEE 11th International Conference on Computer Vision.
[137] Jianping Fan,et al. Harvesting large-scale weakly-tagged image databases from the web , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[138] Marko Heikkilä,et al. Description of interest regions with local binary patterns , 2009, Pattern Recognit..
[139] Yun Yang,et al. Emotionally Representative Image Discovery for Social Events , 2014, ICMR.
[140] Thorsten Joachims,et al. Evaluation methods for unsupervised word embeddings , 2015, EMNLP.
[141] Kilian Q. Weinberger,et al. Resolving tag ambiguity , 2008, ACM Multimedia.
[142] Chong-Wah Ngo,et al. On the Annotation of Web Videos by Efficient Near-Duplicate Search , 2010, IEEE Transactions on Multimedia.