论文引用

Jiong Yu, Li Tan, Yuanda Cao et al.,
2009,
J. Softw.

Semantic concept classification is a critical task for content-based video retrieval. Traditional methods of machine learning focus on increasing the accuracy of classifiers or models, and face the pr...

Hai-Miao Hu, Bo Li, Fan Jiang et al.,
2016,
Neurocomputing

Retrieving images with multiple features is an active research topic on boosting the performance of existing content-based image retrieval methods. The promising bags-of-words (BoW) models involve mul...

Li Tan, Yuanda Cao, Minghua Yang et al.,
2008 Fourth International Conference on Natural Computation

Recent work in visual retrieval shows that bag-of-features (BoF) has appeared promising for object recognition and categorization. Local descriptors such as SIFT have shown impressive results on objec...

Sheng Tang, Yongdong Zhang, Tat-Seng Chua et al.,
2011,
IEEE Transactions on Circuits and Systems for Video Technology

Realistic human action recognition in videos has been a useful yet challenging task. Video shots of same actions may present huge intra-class variations in terms of visual appearance, kinetic patterns...

Akramul Azim, Nayreet Islam, Nayreet Islam et al.,
2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Real-time applications are usually well-defined and operate based on a particular system model. However, in practical scenarios, the applications can perform differently because of the uncertainties i...

Meng Wang, Tat-Seng Chua, Richang Hong et al.,
2012,
TOMCCAP

Product annotation in videos is of great importance for video browsing, search, and advertisement. However, most of the existing automatic video annotation research focuses on the annotation of high-l...

Meng Wang, Tat-Seng Chua, Xiangdong Zhou et al.,
2011,
IEEE Transactions on Multimedia

One of the main challenges in interactive concept-based video search is the problem of insufficient relevant samples, especially for queries with complex semantics. In this paper, “related samples” ar...

Sheng Tang, Yongdong Zhang, Shaoxi Xu et al.,
2010 Fourth Pacific-Rim Symposium on Image and Video Technology

Multi-modality, the unique and important property of video data, is typically ignored in existing video adaptation processes. To solve this problem, we propose a novel approach, named multi-modality t...

Latent Dirichlet allocation (LDA) topic model has taken a center stage in multimedia information retrieval, for example, LDA model was used by several participants in the recent TRECVid evaluation “Se...

Haojie Li, Yukinobu Taniguchi, Kyoko Sudo et al.,
2013,
TRECVID

In this report, we describe the approaches and experiments on TRECVid 2013 video concept detection conducted by NTT Media Intelligence Laboratories in collaboration with Dalian University of Technolog...

Sheng Tang, Yongdong Zhang, Jintao Li et al.,
2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

In this paper, we specially propose a hierarchical framework for movie content analysis. The purpose of our work is trying to realize computerspsila understanding for movie content, especially ldquowh...

Marcel Worring, Cees G. M. Snoek, M. Worring,
2009,
Found. Trends Inf. Retr.

In this paper, we review 300 references on video retrieval, indicating when text-only solutions are unsatisfactory and showing the promising alternatives which are in majority concept-based. Therefore...

Anan Liu, Zhaoxuan Yang, Anan Liu et al.,
2010,
J. Digit. Content Technol. its Appl.

In this paper, we propose a human-centered framework, “Watching, Thinking, Reacting”, for movie content analysis. The framework consists of a hierarchy of three levels. The low level represents human ...

Sheng Tang, Yongdong Zhang, Jintao Li et al.,
2008,
WWW

In this paper, we highlight the use of multimedia technology in generating intrinsic summaries of tourism related information. The system utilizes an automated process to gather, filter and classify i...

Masashi Morimoto, Go Irie, Akira Kojima et al.,
2011,
TRECVID

In this paper, we describe the TRECVid 2011 semantic indexing system first developed at the NTT Cyber Communication Laboratory Group in collaboration with Zhejiang University. In addition to adopting ...

John R. Smith, Wei Jiang, Michele Merler et al.,
2009,
TRECVID

In this paper, we describe the IBM Research system for indexing, analysis, and retrieval of video as applied to the TREC-2007 video retrieval benchmark. This year, focus of the system improvement was ...

Joemon M. Jose, P. Punitha,
2006,
TRECVID

In this paper we describe our experiments in the automatic search task of TRECVid 2007. For this we have implemented a new video search technique based on SIFT features and manual annotation. We submi...

Marcel Worring, M. de Rijke, Josef Kittler et al.,
2009,
TRECVID

In this paper we describe our TRECVID 2009 video retrieval experiments. The MediaMill team participated in three tasks: concept detection, automatic search, and interactive search. Starting point for ...

In this paper we describe our TRECVID 2008 video retrieval experiments. The MediaMill team participated in three tasks: concept detection, automatic search, and interactive search. Rather than continu...

Dennis Koelma, Xirong Li, Masoud Mazloom et al.,
2008,
TRECVID

In this paper we describe our TRECVID 2008 video retrieval experiments. The MediaMill team participated in three tasks: concept detection, automatic search, and interac- tive search. Rather than conti...