Handling imperfections for multimodal image annotation

This thesis deals with multimodal image annotation in the context of social media. We seek to take advantage of textual (tags) and visual information in order to enhance the image annotation performances. However, these tags are often noisy, overly personalized and only a few of them are related to the semantic visual content of the image. In addition, when combining prediction scores from different classifiers learned on different modalities, multimodal image annotation faces their imperfections (uncertainty, imprecision and incompleteness). Consequently, we consider that multimodal image annotation is subject to imperfections at two levels: the representation and the decision. Inspired from the information fusion theory, we focus in this thesis on defining, identifying and handling imperfection aspects in order to improve image annotation.

[1]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[2]  Jean Dezert,et al.  General Combination Rules for Qualitative and Quantitative Beliefs , 2008, J. Adv. Inf. Fusion.

[3]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[4]  Ming Yang,et al.  Discovery of Collocation Patterns: from Visual Words to Visual Phrases , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Marc Najork,et al.  Computing Information Retrieval Performance Measures Efficiently in the Presence of Tied Scores , 2008, ECIR.

[6]  Vladimir Kolmogorov,et al.  Spatially coherent clustering using graph cuts , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[7]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[8]  Tieniu Tan,et al.  Salient coding for image classification , 2011, CVPR 2011.

[9]  Shuicheng Yan,et al.  Learning to rank tags , 2010, CIVR '10.

[10]  Jerome J. Braun Dempster-Shafer theory and Bayesian reasoning in multisensor data fusion , 2000, SPIE Defense + Commercial Sensing.

[11]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Didier Dubois,et al.  New Semantics for Quantitative Possibility Theory , 2001, ECSQARU.

[13]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[14]  Emmanuel Dellandréa,et al.  LIRIS-Imagine at ImageCLEF 2011 Photo Annotation Task , 2011, CLEF.

[15]  Stéphane Marchand-Maillet,et al.  Effective multimodal information fusion by structure learning , 2011, 14th International Conference on Information Fusion.

[16]  Gabriele Moser,et al.  Combining Support Vector Machines and Markov Random Fields in an Integrated Framework for Contextual Image Classification , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[17]  Fakhri Karray,et al.  Multisensor data fusion: A review of the state-of-the-art , 2013, Inf. Fusion.

[18]  L. Zadeh Fuzzy sets as a basis for a theory of possibility , 1999 .

[19]  Meng Wang,et al.  Tag Tagging: Towards More Descriptive Keywords of Image Content , 2011, IEEE Transactions on Multimedia.

[20]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.

[21]  Lucas Paletta,et al.  A Comparison of Probabilistic, Possibilistic and Evidence Theoretic Fusion Schemes for Active Object Recognition , 1999, Computing.

[22]  Stéphane Marchand-Maillet,et al.  Interactive Representations of Multimodal Databases , 2010 .

[23]  Hao Xu,et al.  Tag refinement by regularized LDA , 2009, ACM Multimedia.

[24]  Motoaki Kawanabe,et al.  Multi-modal visual concept classification of images via Markov random walk over tags , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[25]  Cordelia Schmid,et al.  Aggregating Local Image Descriptors into Compact Codes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[27]  Andrew Y. Ng,et al.  The Importance of Encoding Versus Training with Sparse Coding and Vector Quantization , 2011, ICML.

[28]  Hervé Le Borgne,et al.  Fast shared boosting for large-scale concept detection , 2012, Multimedia Tools and Applications.

[29]  Thierry Denoeux,et al.  Classifier fusion in the Dempster-Shafer framework using optimized t-norm based combination rules , 2011, Int. J. Approx. Reason..

[30]  Yves Peirsman,et al.  Modelling Word Similarity: an Evaluation of Automatic Synonymy Extraction Algorithms , 2008, LREC.

[31]  Dong Liu,et al.  Image retagging , 2010, ACM Multimedia.

[32]  Matthieu Cord,et al.  Pooling in image representation: The visual codeword point of view , 2013, Comput. Vis. Image Underst..

[33]  Alberto Del Bimbo,et al.  Social media annotation , 2013, 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI).

[34]  C. V. Jawahar,et al.  Multi modal semantic indexing for image retrieval , 2010, CIVR '10.

[35]  Tom Ziemke,et al.  On the Definition of Information Fusion as a Field of Research , 2007 .

[36]  Isabelle Bloch,et al.  Fusion of Image Information under Imprecision and Uncertainty: Numerical Methods , 2001, Data Fusion and Perception.

[37]  Gang Wang,et al.  Automatic Generation of Semantic Fields for Annotating Web Images , 2010, COLING.

[38]  Wayne D. Gray,et al.  Be Wary of What Your Computer Reads: The Effects of Corpus Selection on Measuring Semantic Relatedness , 2007 .

[39]  Céline Hudelot,et al.  Tag completion based on belief theory and neighbor voting , 2013, ICMR.

[40]  Ian H. Witten,et al.  Issues in Stacked Generalization , 2011, J. Artif. Intell. Res..

[41]  Lingling Meng,et al.  A Review of Semantic Similarity Measures in WordNet 1 , 2013 .

[42]  Isabelle Bloch,et al.  Information Fusion in Signal and Image Processing , 2008 .

[43]  Miguel Á. Carreira-Perpiñán,et al.  Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[44]  Mor Naaman,et al.  HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.

[45]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[46]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[47]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[48]  Cordelia Schmid,et al.  TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[49]  Dong Liu,et al.  Tag quality improvement for social images , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[50]  Daniel P. Huttenlocher,et al.  Landmark classification in large-scale image collections , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[51]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[52]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[53]  Hinrich Schütze,et al.  Word Space , 1992, NIPS.

[54]  Franklin E White,et al.  Data Fusion Lexicon , 1991 .

[55]  Nikos Paragios,et al.  Bag-of-multimedia-words for image classification , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[56]  Mor Naaman,et al.  Why we tag: motivations for annotation in mobile and online media , 2007, CHI.

[57]  D. Dubois,et al.  Possibility theory and data fusion in poorly informed environments , 1994 .

[58]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[59]  Kilian Q. Weinberger,et al.  Reliable tags using image similarity: mining specificity and expertise from large-scale multimedia databases , 2009, WSMC '09.

[60]  Stefanie Nowak,et al.  The CLEF 2011 Photo Annotation and Concept-based Retrieval Tasks , 2011, CLEF.

[61]  Siddharth Patwardhan,et al.  Incorporating Dictionary and Corpus Information into a Context Vector Measure of Semantic Relatednes , 2003 .

[62]  Haojie Li,et al.  Towards tags ranking for social images , 2013, Neurocomputing.

[63]  Philippe Smets,et al.  Constructing the Pignistic Probability Function in a Context of Uncertainty , 1989, UAI.

[64]  Shu-Yuan Chen,et al.  Image classification using color, texture and regions , 2003, Image Vis. Comput..

[65]  Marcel Worring,et al.  Learning Social Tag Relevance by Neighbor Voting , 2009, IEEE Transactions on Multimedia.

[66]  Yihong Gong,et al.  Nonlinear Learning using Local Coordinate Coding , 2009, NIPS.

[67]  Adrian Popescu,et al.  Social media driven image retrieval , 2011, ICMR.

[68]  Ted Pedersen,et al.  Extended Gloss Overlaps as a Measure of Semantic Relatedness , 2003, IJCAI.

[69]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[70]  Antonio Torralba,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[71]  P. Walley Statistical Reasoning with Imprecise Probabilities , 1990 .

[72]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[73]  Adrian Popescu,et al.  CEA LIST's Participation to the Concept Annotation Task of ImageCLEF 2012 , 2012, CLEF.

[74]  Ahmad Abdollahzadeh Barforoush,et al.  A new word sense similarity measure in wordnet , 2008, 2008 International Multiconference on Computer Science and Information Technology.

[75]  Bernard Mérialdo,et al.  Fusion methods for multi-modal indexing of web data , 2013, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS).

[76]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[77]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[78]  Tieniu Tan,et al.  Feature Coding in Image Classification: A Comprehensive Study , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[79]  Danushka Bollegala,et al.  Measuring semantic similarity between words using web search engines , 2007, WWW '07.

[80]  Mark Sanderson,et al.  Seven Years of Image Retrieval Evaluation , 2010, ImageCLEF.

[81]  Haojie Li,et al.  Tag ranking by propagating relevance over tag and image graphs , 2012, ICIMCS '12.

[82]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[83]  De Xu,et al.  Beyond tag relevance: integrating visual attention model and multi-instance learning for tag saliency ranking , 2010, CIVR '10.

[84]  Adrian Popescu,et al.  Multimodal feature generation framework for semantic image classification , 2012, ICMR.

[85]  Florentin Smarandache,et al.  Advances and Applications of DSmT for Information Fusion (Collected Works) , 2004 .

[86]  Robert P. W. Duin,et al.  The combining classifier: to train or not to train? , 2002, Object recognition supported by user interaction for service robots.

[87]  Cordelia Schmid,et al.  Multimodal semi-supervised learning for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[88]  Hervé Le Borgne,et al.  Locality-constrained and spatially regularized coding for scene categorization , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[89]  D. Dubois,et al.  When upper probabilities are possibility measures , 1992 .

[90]  Lei Wang,et al.  In defense of soft-assignment coding , 2011, 2011 International Conference on Computer Vision.

[91]  Emmanuel Dellandréa,et al.  Multimodal recognition of visual concepts using histograms of textual concepts and selective weighted late fusion scheme , 2013, Comput. Vis. Image Underst..

[92]  Petros Maragos,et al.  Adaptive Multimodal Fusion by Uncertainty Compensation With Application to Audiovisual Speech Recognition , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[93]  Gang Wang,et al.  Building text features for object image classification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[94]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[95]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[96]  Matthieu Cord,et al.  Image classification using object detectors , 2013, 2013 IEEE International Conference on Image Processing.

[97]  Thomas S. Huang,et al.  Image Classification Using Super-Vector Coding of Local Image Descriptors , 2010, ECCV.

[98]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[99]  Amandine Bellenger Semantic Decision Support for Information Fusion Applications , 2013 .

[100]  Liang-Tien Chia,et al.  Local features are not lonely – Laplacian sparse coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[101]  Olivier Ferret,et al.  Testing Semantic Similarity Measures for Extracting Synonyms from a Corpus , 2010, LREC.

[102]  Daniel Gatica-Perez,et al.  PLSA-based image auto-annotation: constraining the latent space , 2004, MULTIMEDIA '04.

[103]  Yiannis Kompatsiaris,et al.  High order pLSA for indexing tagged images , 2013, Signal Process..

[104]  Mohamed A. Deriche,et al.  A New Technique for Combining Multiple Classifiers using The Dempster-Shafer Theory of Evidence , 2002, J. Artif. Intell. Res..

[105]  Gabriela Csurka,et al.  Semantic combination of textual and visual information in multimedia retrieval , 2011, ICMR.

[106]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[107]  Don Tapscott,et al.  Wikinomics: How Mass Collaboration Changes Everything , 2006 .

[108]  Ata Kabán,et al.  On an equivalence between PLSI and LDA , 2003, SIGIR.

[109]  Dinan Gunawardena,et al.  Social tags: meaning and suggestions , 2008, CIKM '08.

[110]  Didier Dubois,et al.  Possibilistic information fusion using maximal coherent subsets , 2007, 2007 IEEE International Fuzzy Systems Conference.

[111]  Mingjing Li,et al.  Color texture moments for content-based image retrieval , 2002, Proceedings. International Conference on Image Processing.

[112]  Cynthia Brandt,et al.  Semantic similarity in the biomedical domain: an evaluation across knowledge sources , 2012, BMC Bioinformatics.

[113]  Chong-Wah Ngo,et al.  Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help? , 2006, CIVR.

[114]  John Riedl,et al.  Tagommenders: connecting users to items through tags , 2009, WWW '09.

[115]  Changhu Wang,et al.  Scalable search-based image annotation , 2008, Multimedia Systems.

[116]  Didier Schwab,et al.  Antonymy and Conceptual Vectors , 2002, COLING.

[117]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[118]  Emmanuel Dellandréa,et al.  Associating Textual Features with Visual Ones to Improve Affective Image Classification , 2011, ACII.

[119]  Stefanie Nowak,et al.  The Fraunhofer IDMT at ImageCLEF 2011 Photo Annotation Task , 2011, CLEF.

[120]  Meng Wang,et al.  Visual tag dictionary: interpreting tags with visual words , 2009, WSMC '09.

[121]  Simone Paolo Ponzetto,et al.  WikiRelate! Computing Semantic Relatedness Using Wikipedia , 2006, AAAI.

[122]  Didier Dubois,et al.  Possibility theory , 2018, Scholarpedia.

[123]  Liang-Tien Chia,et al.  Web image concept annotation with better understanding of tags and visual features , 2010, J. Vis. Commun. Image Represent..

[124]  Didier Dubois,et al.  Uncertainty Theories: a Unified View , 2007 .

[125]  Daniel Gatica-Perez,et al.  On image auto-annotation with latent space models , 2003, ACM Multimedia.

[126]  Arthur P. Dempster,et al.  Upper and Lower Probabilities Induced by a Multivalued Mapping , 1967, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[127]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[128]  Shih-Fu Chang,et al.  To search or to label?: predicting the performance of search-based automatic image classifiers , 2006, MIR '06.

[129]  Alexander Panchenko Similarity measures for semantic relation extraction , 2013 .

[130]  Florent Perronnin,et al.  Large-scale image retrieval with compressed Fisher vectors , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[131]  Rainer Lienhart,et al.  Multimodal Image Retrieval , 2012, International Journal of Multimedia Information Retrieval.

[132]  Kilian Q. Weinberger,et al.  Resolving tag ambiguity , 2008, ACM Multimedia.

[133]  Junzhong Gu,et al.  New model of semantic similarity measuring in wordnet , 2008, 2008 3rd International Conference on Intelligent System and Knowledge Engineering.

[134]  Dong Liu,et al.  Tag ranking , 2009, WWW '09.

[135]  Lei Wu,et al.  Tag Completion for Image Retrieval , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[136]  Fabrice Souvannavong,et al.  Multi-modal classifier fusion for video shot content retrieval , 2005 .

[137]  Bart Thomee,et al.  Overview of the ImageCLEF 2012 Flickr Photo Annotation and Retrieval Task , 2012, CLEF.

[138]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[139]  Markus Strohmaier,et al.  Understanding why users tag: A survey of tagging motivation literature and results from an empirical study , 2012, J. Web Semant..

[140]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[141]  Philippe Smets,et al.  Practical Uses of Belief Functions , 1999, UAI.

[142]  Nello Cristianini,et al.  A statistical framework for genomic data fusion , 2004, Bioinform..

[143]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[144]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[145]  A. Tversky Features of Similarity , 1977 .

[146]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[147]  C. Schmid,et al.  Object Class Recognition Using Discriminative Local Features , 2005 .

[148]  Shuicheng Yan,et al.  Image tag refinement towards low-rank, content-tag prior and error sparsity , 2010, ACM Multimedia.

[149]  Dong Liu,et al.  Content-based tag processing for Internet social images , 2010, Multimedia Tools and Applications.

[150]  Latifur Khan,et al.  Image annotations by combining multiple evidence & wordNet , 2005, ACM Multimedia.

[151]  Ioannis Konstas,et al.  Categorising social tags to improve folksonomy-based recommendations , 2011, J. Web Semant..

[152]  Shuicheng Yan,et al.  Inferring semantic concepts from community-contributed images and noisy tags , 2009, ACM Multimedia.

[153]  Rainer Lienhart,et al.  Multilayer pLSA for multimodal image retrieval , 2009, CIVR '09.

[154]  Frédéric Jurie,et al.  Creating efficient codebooks for visual recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[155]  Tony Veale,et al.  An Intrinsic Information Content Metric for Semantic Similarity in WordNet , 2004, ECAI.

[156]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[157]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[158]  Stéphane Ayache,et al.  Evaluation of active learning strategies for video indexing , 2007, Signal Process. Image Commun..

[159]  Gustavo Carneiro,et al.  Formulating semantic image annotation as a supervised learning problem , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[160]  H. L. Borgne,et al.  Prise en compte de l'imperfection des tags pour la classification sémantique d'images , 2012 .

[161]  Gareth J. F. Jones,et al.  A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task , 2010, CLEF.

[162]  Motoaki Kawanabe,et al.  The Joint Submission of the TU Berlin and Fraunhofer FIRST (TUBFI) to the ImageCLEF2011 Photo Annotation Task , 2011, CLEF.

[163]  Florent Perronnin,et al.  Fisher Kernels on Visual Vocabularies for Image Categorization , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[164]  Martin Chodorow,et al.  Combining local context and wordnet similarity for word sense identification , 1998 .

[165]  Steven C. H. Hoi,et al.  A two-view learning approach for image tag ranking , 2011, WSDM '11.

[166]  Sophie M. Wuerger,et al.  Continuous audio-visual digit recognition using N-best decision fusion , 2004, Inf. Fusion.

[167]  Vladimir Pavlovic,et al.  A New Baseline for Image Annotation , 2008, ECCV.

[168]  Jean Ponce,et al.  Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[169]  Sourav S. Bhowmick,et al.  Image tag clarity: in search of visual-representative tags for social images , 2009, WSM@MM.

[170]  Grigorios Tsoumakas,et al.  MLKD's Participation at the CLEF 2011 Photo Annotation and Concept-Based Retrieval Tasks , 2011, CLEF.

[171]  Tat-Seng Chua,et al.  Fusion of AV features and external information sources for event detection in team sports video , 2006, TOMCCAP.

[172]  Md. Monirul Islam,et al.  A review on automatic image annotation techniques , 2012, Pattern Recognit..

[173]  Vedat Coskun,et al.  A new semantic similarity measure evaluated in word sense disambiguation , 2005, NODALIDA.

[174]  Adrian Popescu,et al.  CEA LIST's Participation to Visual Concept Detection Task of ImageCLEF 2011 , 2011, CLEF.

[175]  Hugo Jair Escalante,et al.  Late fusion of heterogeneous methods for multimedia image retrieval , 2008, MIR '08.

[176]  Florentin Smarandache,et al.  Advances and applications of DSmT for information fusion - Collected works - Volume 3 , 2009 .

[177]  Jon Atli Benediktsson,et al.  Decision Fusion for the Classification of Urban Remote Sensing Images , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[178]  Stefanie Nowak,et al.  New Strategies for Image Annotation: Overview of the Photo Annotation Task at ImageCLEF 2010 , 2010, CLEF.

[179]  Arcot Sowmya,et al.  Geometry Aware Local Kernels for Object Recognition , 2010, ACCV.

[180]  David A. Forsyth,et al.  Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[181]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[182]  Hichem Maaref,et al.  New fusion methodology approach and application to mobile robotics: investigation in the framework of possibility theory , 2001, Inf. Fusion.

[183]  Liming Chen,et al.  Semantic Bag-of-Words Models for Visual Concept Detection and Annotation , 2012, 2012 Eighth International Conference on Signal Image Technology and Internet Based Systems.

[184]  Driss Aboutajdine,et al.  Score Fusion in Multibiometric Identification Based on Fuzzy Set Theory , 2012, ICISP.

[185]  Andrew W. Fitzgibbon,et al.  Efficient Object Category Recognition Using Classemes , 2010, ECCV.

[186]  Graeme Hirst,et al.  Lexical chains as representations of context for the detection and correction of malapropisms , 1995 .

[187]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[188]  Mario A. Nascimento,et al.  A compact and efficient image retrieval approach based on border/interior pixel classification , 2002, CIKM '02.

[189]  Hao Su,et al.  Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.

[190]  Céline Hudelot,et al.  Belief Theory for Large-Scale Multi-label Image Classification , 2012, Belief Functions.

[191]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[192]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[193]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[194]  Mohan S. Kankanhalli,et al.  Multimodal fusion for multimedia analysis: a survey , 2010, Multimedia Systems.

[195]  Céline Hudelot,et al.  Codage des modèles de tags , 2013, Rev. d'Intelligence Artif..