The State of the Art in Image and Video Retrieval

Image and video retrieval continues to be one of the most exciting and fastest-growing research areas in the field of multimedia technology. What are the main challenges in image and video retrieval? Despite the sustained efforts in the last years, we think that the paramount challenge remains bridging the semantic gap. By this we mean that low level features are easily measured and computed, but the starting point of the retrieval process is typically the high level query from a human. Translating or converting the question posed by a human to the low level features seen by the computer illustrates the problem in bridging the semantic gap. However, the semantic gap is not merely translating high level features to low level features. The essence of a semantic query is understanding the meaning behind the query. This can involve understanding both the intellectual and emotional sides of the human, not merely the distilled logical portion of the query but also the personal preferences and emotional subtons of the query and the preferential form of the results.

[1]  Yan Liu,et al.  Fast Video Retrieval under Sparse Training Data , 2003, CIVR.

[2]  Bo Zhang,et al.  Constructive Learning Algorithm-Based RBF Network for Relevance Feedback in Image Retrieval , 2003, CIVR.

[3]  Ichiro Ide,et al.  Associating Cooking Video Segments with Preparation Steps , 2003, CIVR.

[4]  Peter G. B. Enser,et al.  Towards a Comprehensive Survey of the Semantic Gap in Visual Image Retrieval , 2003, CIVR.

[5]  Harriet J. Nock,et al.  Speaker Localisation Using Audio-Visual Synchrony: An Empirical Study , 2003, CIVR.

[6]  John P. Eakins,et al.  Shape Feature Matching for Trademark Image Retrieval , 2003, CIVR.

[7]  Mika Rummukainen,et al.  An Efficiency Comparison of Two Content-Based Image Retrieval Systems, GIFT and PicSOM , 2003, CIVR.

[8]  Luc Van Gool,et al.  HPAT Indexing for Fast Object/Scene Recognition Based on Local Appearance , 2003, CIVR.

[9]  Yu Cao,et al.  Audio-Assisted Scene Segmentation for Story Browsing , 2003, CIVR.

[10]  Jean-Marc Odobez,et al.  Spectral Structuring of Home Videos , 2003, CIVR.

[11]  Chi-Ren Shyu,et al.  EBS k-d Tree: An Entropy Balanced Statistical k-d Tree for Image Databases with Ground-Truth Labels , 2003, CIVR.

[12]  Yuechen Qian,et al.  Formal Development of a Distributed Logging Mechanism Supporting Disconnected Updates , 2003, ICFEM.

[13]  Jake K. Aggarwal,et al.  Video Retrieval of Human Interactions Using Model-Based Motion Tracking and Multi-layer Finite State Automata , 2003, CIVR.

[14]  Hisashi Miyamori Automatic Annotation of Tennis Action for Content-Based Retrieval by Integrated Audio and Visual Information , 2003, CIVR.

[15]  Paul H. Lewis,et al.  Integrated Image Content and Metadata Search and Retrieval across Multiple Databases , 2003, CIVR.

[16]  Stefan M. Rüger,et al.  Performance Comparison of Different Similarity Models for CBIR with Relevance Feedback , 2003, CIVR.

[17]  Fatos T. Yarman-Vural,et al.  Selection of the Best Representative Feature and Membership Assignment for Content-Based Fuzzy Image Database , 2003, CIVR.

[18]  Nevenka Dimitrova Multimedia Content Analysis: The Next Wave , 2003, CIVR.

[19]  John R. Kender,et al.  Multiple Features in Temporal Models for the Representation of Visual Contents in Video , 2003, CIVR.

[20]  Antonio C. Siochi,et al.  A State Transition Analysis of Image Search Patterns on the web , 2003, CIVR.

[21]  Guangyou Xu,et al.  Fast Search in Large-Scale Image Database Using Vector Quantization , 2003, CIVR.

[22]  John R. Smith,et al.  A Hybrid Framework for Detecting the Semantics of Concepts and Context , 2003, CIVR.

[23]  Olivier Buisson,et al.  Robust Content-Based Video Copy Identification in a Large Reference Database , 2003, CIVR.

[24]  Rong Yan,et al.  Multimedia Search with Pseudo-relevance Feedback , 2003, CIVR.

[25]  Anup Basu,et al.  Improving Fractal Codes Based Image Retrieval Using Histogram , 2003, CIVR.

[26]  Sai-Ping Li,et al.  A guided Monte Carlo approach to optimization problems , 2003 .

[27]  Paul Over,et al.  TRECVID: Benchmarking the Effectivenss of Information Retrieval Tasks on Digital Video , 2003, CIVR.

[28]  Lawrence Wai-Choong Wong,et al.  ANSES: Summarisation of News Video , 2003, CIVR.

[29]  John P. Eakins,et al.  Content-Based Retrieval of Historical Watermark Images: II - Electron Radiographs , 2003, CIVR.

[30]  Bo Zhang,et al.  Learning in Region-Based Image Retrieval , 2003, CIVR.

[31]  Nicu Sebe,et al.  Evaluation of Expression Recognition Techniques , 2003, CIVR.

[32]  Ling Guan,et al.  Concept-Based Retrieval of Art Documents , 2003, CIVR.

[33]  Jae-Woo Chang,et al.  Efficient Similar Trajectory-Based Retrieval for Moving Objects in Video Databases , 2003, CIVR.

[34]  Sungyoung Kim,et al.  Central Object Extraction for Object-Based Retrieval , 2003, CIVR.

[35]  Joo-Hwee Lim,et al.  Home Photo Retrieval: Time Matters , 2003, CIVR.

[36]  Hanqing Lu,et al.  Multilevel Relevance Judgement, Loss Function, and Performance Measure in Image Retrieval , 2003, CIVR.

[37]  Nicholas R. Howe,et al.  A Closer Look at Boosted Image Retrieval , 2003, CIVR.

[38]  Chong-Wah Ngo,et al.  Detection of Documentary Scene Changes by Audio-Visual Fusion , 2003, CIVR.

[39]  Xavier Binefa,et al.  Spatio-Temporal Decomposition of Sport Events for Video Indexing , 2003, CIVR.

[40]  Joemon M. Jose,et al.  Audio-Based Event Detection for Sports Video , 2003, CIVR.

[41]  John R. Kender,et al.  Spatial-Temporal Semantic Grouping of Instructional Video Content , 2003, CIVR.

[42]  John R. Smith,et al.  Modal Keywords, Ontologies, and Reasoning for Video Understanding , 2003, CIVR.

[43]  Mika Rautiainen,et al.  Detecting Semantic Concepts from Video Using Temporal Gradients and Audio Classification , 2003, CIVR.

[44]  Gary Marchionini,et al.  Text or Pictures? An Eyetracking Study of How People View Digital Video Surrogates , 2003, CIVR.

[45]  Mu Zhang,et al.  Hierarchical Clustering-Merging for Multidimensional Index Structures , 2003, CIVR.

[46]  Fatos T. Yarman-Vural,et al.  BAS: a perceptual shape descriptor based on the beam angle statistics , 2003, Pattern Recognit. Lett..

[47]  Anuj Srivastava,et al.  Learning Optimal Representations for Image Retrieval Applications , 2003, CIVR.

[48]  Heung-Kyu Lee,et al.  Majority Based Ranking Approach in Web Image Retrieval , 2003, CIVR.

[49]  Kiyoharu Aizawa,et al.  Indexing of Personal Video Captured by a Wearable Imaging , 2003, CIVR.

[50]  Sriram K. Rajamani,et al.  Model Checking Software , 2003, Lecture Notes in Computer Science.

[51]  Michael R. Lyu,et al.  A Novel Scheme for Video Similarity Detection , 2003, CIVR.